Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3292500.3330774acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Open access

Blending Noisy Social Media Signals with Traditional Movement Variables to Predict Forced Migration

Published: 25 July 2019 Publication History

Abstract

Worldwide displacement due to war and conflict is at all-time high. Unfortunately, determining if, when, and where people will move is a complex problem. This paper proposes integrating both publicly available organic data from social media and newspapers with more traditional indicators of forced migration to determine when and where people will move. We combine movement and organic variables with spatial and temporal variation within different Bayesian models and show the viability of our method using a case study involving displacement in Iraq. Our analysis shows that incorporating open-source generated conversation and event variables maintains or improves predictive accuracy over traditional variables alone. This work is an important step toward understanding how to leverage organic big data for societal--scale problems.

References

[1]
P. Adhikari. 2013. Conflict-Induced Displacement, Understanding the Causes of Flight. American Journal of Political Science, Vol. 57, 1 (2013), 82--89.
[2]
A. Agrawal and A. An. 2016. Selective Co-occurrences for Word-Emotion Association. In International Conference on Computational Linguistics (COLING).
[3]
L. Balcells and A. Steele. 2016. Warfare, political identities, and displacement in Spain and Colombia. Political Geography, Vol. 51 (2016), 15--29.
[4]
L. Bengtsson, X. Lu, A. Thorson, R. Garfield, and J. Von Schreeb. 2011. Improved response to disasters and outbreaks by tracking population movements with mobile phone network data: a post-earthquake geospatial study in Haiti., Vol. 8 (2011).
[5]
M. Betancourt. 2017. Diagnosing biased inference with divergences. http://mc-stan.org/documentation/case-studies/divergences_and_bias.html . (2017).
[6]
J. Blumenstock. 2012. Inferring patterns of internal migration from mobile phone call records: evidence from Rwanda. Information Technology for Development, Vol. 18 (2012), 107--125.
[7]
J. Borokowski and Z. Bulutgil. 2012. Ethnicity Study : Ethnic Composition at District Level. (2012). https://esoc.princeton.edu/files/ethnicity-study-ethnic-composition-district-level The Empirical Studies of Conflict (ESOC).
[8]
E. Boschee, J. Lautenschlager, S. O'Brien, S. Shellman, and et al. 2015. ICEWS Coded Event Data. (2015).
[9]
M. Bylander. 2015. Depending on the sky: Environmental distress, migration, and coping in rural Cambodia. International Migration, Vol. 53, 5 (2015), 135--147.
[10]
D. Carter and C. Signorino. 2010. Back to the future: Modeling time dependence in binary data. Political Analysis, Vol. 18, 3 (2010), 271--292.
[11]
R. Churchill, L. Singh, and C. Kirov. 2018. A Temporal Topic Model for Noisy Mediums. In Pasific Asian Conference on Knowledge Discovery and Data Mining .
[12]
Norwegian Refugee Council and Chemin de Balexert. 2018. Internal Displacement Monitoring Centre. Guidelines on Profiling Internally Displaced Persons (2018).
[13]
C. Davenport, W. Moore, and S. Poe. 2003. Sometimes you just have to leave: Domestic threats and forced migration, 1964--1989. International Interactions, Vol. 29, 1 (2003), 27--55.
[14]
C. Davenport and R. Talibova. 2019. Counting Battle Deaths: Shaping Outcomes in Civil Wars. (2019). Unpublished Manuscript.
[15]
P. Deville, C. Linard, S. Martin, M. Gilbert, Forrest Stevens, Andrea Gaughan, Vincent Blondel, and Andrew Tatem. 2014. Dynamic population mapping using mobile phone data. Proceedings of the National Academy of Sciences(PNAS), Vol. 111 (2014), 15888--15893.
[16]
K. Donato and E. Ferris. 2017. A Dynamic View of Forced Migration: Revisiting the Push-Pull Framework. TransAtlantic Council on Migration (2017).
[17]
P. Earle, D. Bowden, and M. Guy. 2012. Twitter earthquake detection: earthquake monitoring in a social world. Annals of Geophysics, Vol. 54 (2012). Issue 6.
[18]
United Nations High Commissioner for Refugees. 2016. UNHCR Global Trends: Forced Displacement in 2015. (2016). Technical Report.
[19]
I. Foster, R. Ghani, R. Jarmin, F. Kreuter, and J. Lane. 2016. Big data and social science: A practical guide to methods and tools .CRC Press.
[20]
E. Frias-Martinez, G. Williamson, and V. Frias-Martinez. An agent-based model of epidemic spread using human mobility and social network information. In IEEE International Conference on Social Computing (2011).
[21]
J. Hockett, Y. Liu, Y. Wei, L. Singh, and N. Schneider. Detecting and using buzz from newspapers to understand patterns of movement. In IEEE International Conference on Big Data (2018).
[22]
M. Hoffman and A. Gelman. 2014. The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. Journal of Machine Learning Research, Vol. 15, 1 (2014), 1593--1623.
[23]
L. Hunter. 2005. Migration and environmental hazards. Population and environment, Vol. 26, 4 (2005), 273--302.
[24]
D. Karemera, V. Oguledo, and B. Davis. 2000. A gravity model analysis of international migration to North America. Applied Economics, Vol. 32, 13 (2000), 1745--1755.
[25]
N. Lozano-Gracia, G. Piras, A. M. Ibá nez, and G. Hewings. 2010. The journey to safety: conflict-driven migration flows in Colombia. International Regional Science Review, Vol. 33, 2 (2010), 157--180.
[26]
S. Martin and L. Singh. 2018. Data Analytics and Displacement: Using Big Data to Forecast Mass Movements of People. (2018).
[27]
S. Martin, L. Singh, A. Taylor, and Wahedi L. 2019. What drives forced migration?: A dynamic factor analysis. (2019). Working Paper.
[28]
D. Massey. 2015. A missing element in migration theories. international Journal of Migration Studies (2015), 279.
[29]
E. Melander and M. Öberg. 2006. Time to go? Duration dependence in forced migration. International Interactions, Vol. 32, 2 (2006), 129--152.
[30]
E. Melander and M. Öberg. 2007. The threat of violence and forced migration: Geographical scope trumps intensity of fighting. Civil Wars, Vol. 9, 2 (2007), 156--173.
[31]
S. Mohammad. 2016. A practical guide to sentiment annotation: Challenges and solutions. In Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. 174--179.
[32]
W. Moore and S. Shellman. 2004. Fear of persecution: Forced migration, 1952--1995. Journal of Conflict Resolution, Vol. 48, 5 (2004), 723--745.
[33]
N. Ramakrishnan, P. Butler, S. Muthiah, N. Self, and et al. 2014. 'Beating the News' with EMBERS: Forecasting Civil Unrest Using Open Source Indicators. In ACM International Conference on Knowledge Discovery and Data Mining (KDD) .
[34]
R. Reuveny. 2007. Climate change-induced migration and violent conflict. Political Geography, Vol. 26, 6 (2007), 656--673.
[35]
J. Rubin and W. Moore. 2007. Risk factors for forced migrant flight. Conflict Management and Peace Science, Vol. 24, 2 (2007), 85--104.
[36]
I. Salehyan and K. Gleditsch. 2006. Refugees and the spread of civil war. International Organization, Vol. 60, 2 (2006), 335--366.
[37]
J. Salvatier, T. Wiecki, and C. Fonnesbeck. 2016. Probabilistic programming in Python using PyMC3. PeerJ Computer Science, Vol. 2 (2016), e55.
[38]
J. Schon. 2015. Focus on the forest, not the trees: A changepoint model of forced displacement. Journal of Refugee Studies, Vol. 28, 4 (2015), 437--467.
[39]
L. Singh and DataLab. Forced Migration Interactive Visualizations of Big Data Variables. http://forcedmigration.cs.georgetown.edu . ( ????).
[40]
L. Singh and R. Pemmaraju. 2017. EOS: A multilingual text archive of international newspaper & blog articles. In IEEE International Conference on Big Data .
[41]
B. State, M. Rodriguez, D. Helbing, and E. Zagheni. 2014. Migration of professionals to the US: evidence from linkedin data. In International Conference on Social Informatics. Springer.
[42]
J. Tarver. 1961. Predicting migration. Social Forces, Vol. 39, 3 (1961), 207--213.
[43]
A. Tollefsen, H. Strand, and H. Buhaug. 2012. PRIO-GRID: A unified spatial data structure. Journal of Peace Research, Vol. 49, 2 (2012), 363--374.
[44]
X. Wang and A. McCallum. 2006. Topics over time: a non-Markov continuous-time model of topical trends. In ACM International Conference on Knowledge Discovery and Data Mining (KDD) .
[45]
Y. Wei, L. Singh, D. Buttler, and B. Gallagher. 2018. Using semantic graphs to detect overlapping target events and story lines from newspaper articles. International Journal of Data Science and Analytics, Vol. 5, 1 (2018), 41--60.

Cited By

View all
  • (2024)The digital trail of Ukraine’s 2022 refugee exodusJournal of Computational Social Science10.1007/s42001-024-00304-47:2(2147-2193)Online publication date: 16-Jul-2024
  • (2023)Forecasting Ukrainian Refugee Flows With Organic Data SourcesInternational Migration Review10.1177/01979183231203931Online publication date: 9-Oct-2023
  • (2023)Big Data for the Prediction of Forced DisplacementInternational Migration Review10.1177/01979183231195296Online publication date: 23-Aug-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
July 2019
3305 pages
ISBN:9781450362016
DOI:10.1145/3292500
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bayesian
  2. big data
  3. forced migration
  4. open source data
  5. social media
  6. text mining

Qualifiers

  • Research-article

Funding Sources

Conference

KDD '19
Sponsor:

Acceptance Rates

KDD '19 Paper Acceptance Rate 110 of 1,200 submissions, 9%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)216
  • Downloads (Last 6 weeks)19
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)The digital trail of Ukraine’s 2022 refugee exodusJournal of Computational Social Science10.1007/s42001-024-00304-47:2(2147-2193)Online publication date: 16-Jul-2024
  • (2023)Forecasting Ukrainian Refugee Flows With Organic Data SourcesInternational Migration Review10.1177/01979183231203931Online publication date: 9-Oct-2023
  • (2023)Big Data for the Prediction of Forced DisplacementInternational Migration Review10.1177/01979183231195296Online publication date: 23-Aug-2023
  • (2023)All Translation Tools Are Not Equal: Investigating the Quality of Language Translation for Forced Migration2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA60987.2023.10302481(1-10)Online publication date: 9-Oct-2023
  • (2023)A Bayesian Approach of Predicting the Movement of Internally Displaced PersonsSocial, Cultural, and Behavioral Modeling10.1007/978-3-031-43129-6_24(241-250)Online publication date: 16-Sep-2023
  • (2022)Environmental change and human mobility: Opportunities and challenges of big dataInternational Migration10.1111/imig.1300261:5(29-46)Online publication date: 10-Apr-2022
  • (2022)Predictive modelling of movements of refugees and internally displaced people: towards a computational frameworkJournal of Ethnic and Migration Studies10.1080/1369183X.2022.210054649:2(408-444)Online publication date: 16-Aug-2022
  • (2022)Mobile phone data reveal the effects of violence on internal displacement in AfghanistanNature Human Behaviour10.1038/s41562-022-01336-46:5(624-634)Online publication date: 12-May-2022
  • (2021)The Role of Emerging Predictive IT Tools in Effective Migration GovernancePolitics and Governance10.17645/pag.v9i4.44369:4(133-145)Online publication date: 28-Oct-2021
  • (2021)Text Analytic Research Portals: Supporting Large-Scale Social Science Research2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671696(6020-6022)Online publication date: 15-Dec-2021
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media