You are where you e-mail: using e-mail data to estimate international migration rates

International migration is one of the major determinants of demographic change. Although efforts to produce comparable statistics are underway, estimates of demographic flows are inexistent, outdated, or largely inconsistent, for most countries. We estimate age and gender-specific migration rates using data extracted from a large sample of Yahoo! e-mail messages. Self-reported age and gender of anonymized e-mail users were linked to the geographic locations (mapped from IP addresses) from where users sent e-mail messages over time (2009-2011). The users' country of residence over time was inferred as the one from where most e-mail messages were sent. Our estimates of age profiles of migration are qualitatively consistent with existing administrative data sources. Selection bias generates uncertainty for estimates at one point in time, especially for developing countries. However, our approach allows us to compare in a reliable way migration trends of females and males. We document the recent increase in human mobility and we observe that female mobility has been increasing at a faster pace. Our findings suggest that e-mail data may complement existing migration data, resolve inconsistencies arising from different definitions of migration, and provide new and rich information on mobility patterns and social networks of migrants. The use of digital records for demographic research has the potential to become particularly important for developing countries, where the diffusion of Internet will be faster than the development of mature demographic registration systems.

[1]  Guy J. Abel,et al.  Estimation of international migration flow tables in Europe , 2010 .

[2]  Cong Yu,et al.  Constructing travel itineraries from tagged geo-temporal breadcrumbs , 2010, WWW '10.

[3]  Cong Yu,et al.  Automatic construction of travel itineraries using social breadcrumbs , 2010, HT '10.

[4]  Ingmar Weber,et al.  The demographics of web search , 2010, SIGIR.

[5]  Martin Raubal,et al.  A case for space: physical and virtual location requirements in the CouchSurfing social network , 2009, LBSN '09.

[6]  Andrei Rogers,et al.  Model migration schedules , 1981 .

[7]  R. Yarwood,et al.  Beyond Six Billion: Forecasting the World's Population , 2003 .

[8]  Murat Ali Bayir,et al.  Discovering spatiotemporal mobility profiles of cellphone users , 2009, 2009 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks & Workshops.

[9]  Laura Ferrari,et al.  Discovering daily routines from Google Latitude with topic models , 2011, 2011 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops).

[10]  Fabian J. Theis,et al.  Money Circulation, Trackable Items, and the Emergence of Universal Human Mobility Patterns , 2008, IEEE Pervasive Computing.

[11]  Joel E. Cohen,et al.  International migration beyond gravity: A statistical model for use in population projections , 2008, Proceedings of the National Academy of Sciences.

[12]  Cecilia Mascolo,et al.  An Empirical Study of Geographic User Activity Patterns in Foursquare , 2011, ICWSM.

[13]  Peter Boden,et al.  Monitoring who moves where: information systems for internal and international migration , 2011 .

[14]  Ronald Lee,et al.  The Outlook for Population Growth , 2011, Science.

[15]  Andrei Rogers,et al.  Applying Model Migration Schedules to Represent Age‐Specific Migration Flows , 2008 .

[16]  Mark Chignell,et al.  Proceedings of the 21st ACM conference on Hypertext and hypermedia , 2010, Hypertext 2010.

[17]  Jon Crowcroft,et al.  Planet-scale human mobility measurement , 2009, HotPlanet '10.

[18]  Franco Zambonelli,et al.  Extracting urban patterns from location-based social networks , 2011, LBSN '11.

[19]  Andrei Rogers,et al.  Origin dependence, secondary migration, and the indirect estimation of migration flows from population stocks , 2005 .

[20]  J. de Beer,et al.  Overcoming the Problems of Inconsistent International Migration data: A New Method Applied to Flows in Europe , 2010, European journal of population = Revue europeenne de demographie.

[21]  Xing Xie,et al.  Proceedings of the 2009 International Workshop on Location Based Social Networks, LBSN 2009, November 3, 2009, Seattle, Washington, USA, Proceedings , 2009, GIS-LBSN.