Forecasting asylum applications in the European Union with machine learning and data at scale

The effects of the so-called "refugee crisis" of 2015-16 continue to dominate much of the European political agenda. Migration flows were sudden and unexpected, exposing significant shortcomings in the field of migration forecasting and leaving governments and NGOs unprepared. Migration is a complex system typified by episodic variation, underpinned by causal factors that are interacting, highly context dependent and short-lived. Correspondingly, migration nowcasts rely on scattered low-quality data and much-needed forecasts are local and inconsistent. Here we describe a data-driven adaptive system for forecasting asylum applications in the European Union (EU), built on machine learning algorithms that combine administrative data with non-traditional data sources at scale. We exploit three tiers of data: geolocated events and internet searches in countries of origin, detections at the EU external border, and asylum recognition rates in the EU, to effectively forecast individual asylum-migration flows up to four weeks ahead with high accuracy. Uniquely our approach a) models individual country-to-country migration flows; b) detects migration drivers early onset; c) anticipates lagged effects; d) estimates the effect of individual drivers; and e) describes how patterns of drivers shift over time. This is, to our knowledge, the first comprehensive system for forecasting asylum applications based on an unsupervised algorithm and data at scale. Importantly, this approach can be extended to forecast other migration social-economic indicators.

[1]  Ingmar Weber,et al.  Inferring Migrations: Traditional Methods and New Approaches based on Mobile Phone, Social Media, and other Big Data: Feasibility study on Inferring (labour) mobility and migration in the European Union from big data and social media data , 2016 .

[2]  Hideitsu Hino,et al.  The YUIMA Project: A Computational Framework for Simulation and Inference of Stochastic Differential Equations , 2014 .

[3]  E. Neumayer Bogus Refugees? The Determinants of Asylum Migration to Western Europe , 2004 .

[4]  Jakub Bijak,et al.  Assessing time series models for forecasting international migration: Lessons from the United Kingdom , 2019, Journal of Forecasting.

[5]  Nakahiro Yoshida,et al.  Nonsynchronous covariation process and limit theorems , 2011 .

[6]  M. White International Handbook of Migration and Population Distribution , 2015 .

[7]  M. Czaika,et al.  Do restrictive asylum and visa policies increase irregular migration into Europe? , 2016 .

[8]  Jangkoo Kang,et al.  An Empirical Investigation of the Lead-Lag Relations of Returns and Volatilities among the KOSPI200 Spot, Futures and Options Markets and their Explanations , 2006 .

[9]  H. Haas,et al.  Trends, determinants and policy effects , 2018 .

[10]  Marcus H. Böhme,et al.  Searching for a better life: Predicting international migration with online search keywords , 2020, Journal of Development Economics.

[11]  F. Barthel,et al.  Spatial Dependence in Asylum Migration , 2014 .

[12]  M. Røed,et al.  Reduction or deflection? The effect of asylum policy on interconnected asylum flows , 2017 .

[13]  Asymptotic normality of a covariance estimator for nonsynchronously observed diffusion processes , 2008 .

[14]  Mathieu Rosenbaum,et al.  A New Approach for the Dynamics of Ultra-High-Frequency Data: The Model with Uncertainty Zones , 2011 .

[15]  A. Singleton Migration and Asylum Data for Policy-Making in the European Union – The Problem with Numbers , 2016 .

[16]  Lan Zhang Estimating Covariation: Epps Effect, Microstructure Noise , 2006 .

[17]  H. Haas,et al.  International Migration: Trends, Determinants, and Policy Effects , 2019, Population and Development Review.

[18]  Graeme Hugo,et al.  Theories of international migration: a review and appraisal. , 1993 .

[19]  M. Czaika,et al.  Drivers of migration: A synthesis of knowledge , 2020 .

[20]  N. Yoshida,et al.  On covariance estimation of non-synchronously observed diffusion processes , 2005 .

[21]  C. Chiao,et al.  The price adjustment and lead-lag relations between stock returns: microstructure evidence from the Taiwan stock market , 2004 .

[22]  Ingmar Weber,et al.  Migration data using social media: a European perspective , 2018 .

[23]  Susanne Schmeidl,et al.  Exploring the causes of forced migration: a pooled time-series analysis 1971-1990. , 1997 .

[24]  Miltos D. Grammatikakis,et al.  State-of-the-Art and Challenges , 2018, Distributed Real-Time Architecture for Mixed-Criticality Systems.

[25]  Gary D. Thompson,et al.  Review and Appraisal , 2016 .

[26]  S. Vertovec Super-diversity and its implications , 2007, Celebrating 40 Years of Ethnic and Racial Studies.

[27]  Natalia Adler,et al.  Monitoring of the Venezuelan exodus through Facebook’s advertising platform , 2020, PloS one.

[28]  Estimation for the change point of the volatility in a stochastic differential equation , 2009, 0906.3108.

[29]  Jurij Paraszczak,et al.  A machine learning approach to scenario analysis and forecasting of mixed migration , 2020, IBM J. Res. Dev..

[30]  D. Toshkov The dynamic relationship between asylum applications and recognition rates in Europe (1987–2010) , 2014 .

[31]  Frank de Jong,et al.  High frequency analysis of lead-lag relationships between financial markets , 1997 .

[32]  P. Fearnhead,et al.  Optimal detection of changepoints with a linear computational cost , 2011, 1101.1438.

[33]  W. Jennings Bureaucratic Performance and Control in British Politics: Asylum Policy 1994–2007 , 2010 .

[34]  Joshua S. Goldstein A Conflict-Cooperation Scale for WEIS Events Data , 1992 .

[35]  T. Hatton,et al.  Applications for Asylum in the Developed World: Modelling Asylum Claims by Origin and Destination , 2015 .

[36]  Derek Groen,et al.  A generalized simulation development approach for predicting refugee destinations , 2017, Scientific Reports.

[37]  M. Hoffmann,et al.  Estimation of the lead-lag parameter from non-synchronous data , 2013, 1303.4871.

[38]  Nakahiro Yoshida,et al.  Simulation and Inference for Stochastic Processes with YUIMA: A Comprehensive R Framework for SDEs and Other Stochastic Processes , 2018 .

[39]  Hein de Haas,et al.  The Internal Dynamics of Migration Processes: A Theoretical Inquiry , 2010 .

[40]  Brandon M Stewart,et al.  Predicting Risk Factors Associated with Forced Migration: An Early Warning Model of Haitian Flight , 2007 .

[41]  T. Hatton The Rise and Fall of Asylum: What Happened and Why? , 2008 .

[42]  Atsushi Nakazawa,et al.  Measurement, Analysis and Modeling , 2014 .

[43]  Stephen M. Shellman,et al.  Refugee or Internally Displaced Person? To Where Should One Flee? , 2005 .

[44]  F. Comte,et al.  Long memory continuous time models , 1996 .