Mining top-up transactions and online classified ads to predict urban neighborhoods socioeconomic status

Quantifying income inequalities in developing countries faces challenges regarding data publicly available. Census data, collected every five or ten years, is the only source for socioeconomic indicators. Thus, local authorities need ways of producing more frequently updated indicators. Studies conducted for developed countries (Europe and USA) use Call Detail Records (CDRs) for such a purpose. In our study we propose to exploit patterns observed in developing countries, specifically in Latin America, where mobile phone usage is pervasive even among the poorest and the dominant modality for purchasing mobile airtime is the prepaid scheme (top-ups). We analyze more than 1M top-up transactions together with more than 5K online classified ads for housing sales to predict the socioeconomic status measured at an intra-urban level for 89 neighborhoods. Using a Linear Regression with Regularization L1 (Lasso), we can explain the economic status with a prediction rate up to 71% for urban neighborhoods in Guayaquil, Ecuador. Consequently, we show evidence that top-up transactions provide effective signals to characterize urban neighborhoods socioeconomic status.

[1]  Daniele Quercia,et al.  Taking Brazil's pulse: tracking growing urban economies from online attention , 2014, WWW '14 Companion.

[2]  R. Reed The contribution of social area analysis: Modelling house price variations at the neighbourhood level in Australia , 2013 .

[3]  L. Capra,et al.  Ubiquitous Sensing for Mapping Poverty in Developing Countries , 2013 .

[4]  S. Chand,et al.  House prices and market fundamentals in urban China , 2013 .

[5]  Gabriel Cadamuro,et al.  Predicting poverty and wealth from mobile phone metadata , 2015, Science.

[6]  Jeroen Warner,et al.  The socioeconomic vulnerability index: A pragmatic approach for assessing climate change led risks–A case study in the south-western coastal Bangladesh , 2014 .

[7]  Patrick E. McSharry,et al.  Constructing spatiotemporal poverty indices from big data , 2017 .

[8]  Peter Yamakawa,et al.  The diffusion of mobile telephones: An empirical analysis for Peru , 2013 .

[9]  N. Eagle,et al.  Network Diversity and Economic Development , 2010, Science.

[10]  Qingquan Li,et al.  Understanding aggregate human mobility patterns using passive mobile phone location data: a home-based approach , 2015, Transportation.

[11]  J. Blumenstock,et al.  A Method for Estimating the Relationship Between Phone Use and Wealth , 2010 .

[12]  Virgílio A. F. Almeida,et al.  We know where you live: privacy characterization of foursquare behavior , 2012, UbiComp.

[13]  Margaret Martonosi,et al.  Human mobility modeling at metropolitan scales , 2012, MobiSys '12.

[14]  Jianfeng Ma,et al.  Hidden community identification in location-based social network via probabilistic venue sequences , 2018, Inf. Sci..

[15]  Xavier Andrade,et al.  RiSC: Quantifying change after natural disasters to estimate infrastructure damage with mobile phone data , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[16]  Carlo Ratti,et al.  Human mobility and socioeconomic status: Analysis of Singapore and Boston , 2018, Comput. Environ. Urban Syst..

[17]  P. Boelhouwer,et al.  Equilibrium between interest payments and income in the housing market , 2009 .

[18]  G. Galster,et al.  Making our neighborhoods, making our selves , 2019, Urban Geography.

[19]  Zbigniew Smoreda,et al.  Using big data to study the link between human mobility and socio-economic development , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[20]  Víctor Soto,et al.  Prediction of socioeconomic levels using cell phone records , 2011, UMAP'11.

[21]  Ming-Chi Chen,et al.  House prices and household income: Do they move apart? Evidence from Taiwan , 2007 .

[22]  Bjørn-Atle Reme,et al.  Deep Learning Applied to Mobile Phone Data for Individual Income Classification , 2016 .

[23]  Francesc Prior,et al.  Banking the Unbanked Using Prepaid Platforms and Mobile Telephones in the United States , 2010 .

[24]  M. Austin,et al.  The Status of Low-Income Neighborhoods in the Post-Welfare Reform Environment , 2005, Journal of health & social policy.

[25]  Sudeept Mohan,et al.  STATE , 2016, Intell. Serv. Robotics.