Proposing a Broader Scope of Predictive Features for Modeling Refugee Counts

The world-wide refugee problem has a long history, but continues to this day, and will unfortunately continue into the foreseeable future. Efforts to anticipate, mitigate and prepare for refugee counts, however, are still lacking. There are many potential causes, but the published research has primarily focused on identifying ways to integrate already existing refugees into the various communities wherein they ultimately reside, rather than on preventive measures. The work proposed herein uses a set of features that can be divided into three basic categories: 1) sociocultural, 2) socioeconomic, and 3) economic, which refer to the nature of each proposed predictive feature. For example, corruption perception is a sociocultural feature, access to healthcare is a socioeconomic feature, and inflation is an economic feature. Forty-five predictive features were collected for various years and countries of interest. As may seem intuitive, the features that fell under the category of "economic" produced the highest predictive value from the regression technique employed. However, additional potential predictive features that have not been previously addressed stood out in our experiments. These include: the global peace index (gpi), freedom of expression (fe), internet users (iu), access to healthcare (hc), cost of living index (coli), local purchasing power index (lppi), homicide rate (hr), access to justice (aj), and women's property rights (wpr). Many of these features are nascent in terms of both their development and collection, as well as the fact that some of these features are not yet collected at a universal level, meaning that the data is missing for some countries and years. Ongoing work regarding these datasets for predicting refugee counts is also discussed in this work.

[1]  Jurij Paraszczak,et al.  A machine learning approach to scenario analysis and forecasting of mixed migration , 2020, IBM J. Res. Dev..

[2]  Will Moore,et al.  Sometimes You Just Have to Leave: Domestic Threats and Forced Migration, 1964-1989 , 2003 .

[3]  Liliana Pérez,et al.  An Agent-Based Model to Identify Migration Pathways of Refugees: The Case of Syria , 2018 .

[4]  A. Richmond Reactive Migration: Sociological Perspectives On Refugee Movements , 1993 .

[5]  J. Bijak Bayesian Methods in International Migration Forecasting , 2008 .

[6]  Joshua S. Martineau Red Flags: A Model for the Early Warning of Refugee Outflows , 2010 .

[7]  C. Apodaca,et al.  Human Rights Abuses: Precursor to Refugee Flight? , 1998 .

[8]  Derek Groen,et al.  A Coupled Food Security and Refugee Movement Model for the South Sudan Conflict , 2019, ICCS.

[9]  Jian Jiang,et al.  Boosting Tree-Assisted Multitask Deep Learning for Small Scientific Datasets , 2020, J. Chem. Inf. Model..

[10]  D. F. Andrews,et al.  A Robust Method for Multiple Linear Regression , 1974 .

[11]  J. Jenkins,et al.  Issues in Quantitative Modelling in theEarly Warning of Refugee Migration , 1996 .

[12]  Brandon M Stewart,et al.  Predicting Risk Factors Associated with Forced Migration: An Early Warning Model of Haitian Flight , 2007 .

[14]  Huiyu Zhou,et al.  Using deep neural network with small dataset to predict material defects , 2019, Materials & Design.

[15]  W. Adger,et al.  The effect of environmental change on human migration , 2011 .

[16]  Ying Zhang,et al.  A strategy to apply machine learning to small datasets in materials science , 2018, npj Computational Materials.

[17]  Muhammad Nihal Hussain,et al.  Voices from Venezuela: Examining Blogs to Study the Socio-Politico-Economic Crisis and Consequent Emigration , 2018 .

[18]  Susanne Schmeidl,et al.  Exploring the causes of forced migration: a pooled time-series analysis 1971-1990. , 1997 .