Understanding the Representativeness of Mobile Phone Location Data in Characterizing Human Mobility Indicators

The advent of big data has aided understanding of the driving forces of human mobility, which is beneficial for many fields, such as mobility prediction, urban planning, and traffic management. However, the data sources used in many studies, such as mobile phone location and geo-tagged social media data, are sparsely sampled in the temporal scale. An individual’s records can be distributed over a few hours a day, or a week, or over just a few hours a month. Thus, the representativeness of sparse mobile phone location data in characterizing human mobility requires analysis before using data to derive human mobility patterns. This paper investigates this important issue through an approach that uses subscriber mobile phone location data collected by a major carrier in Shenzhen, China. A dataset of over 5 million mobile phone subscribers that covers 24 h a day is used as a benchmark to test the representativeness of mobile phone location data on human mobility indicators, such as total travel distance, movement entropy, and radius of gyration. This study divides this dataset by hour, using 2- to 23-h segments to evaluate the representativeness due to the availability of mobile phone location data. The results show that different numbers of hourly segments affect estimations of human mobility indicators and can cause overestimations or underestimations from the individual perspective. On average, the total travel distance and movement entropy tend to be underestimated. The underestimation coefficient results for estimation of total travel distance are approximately linear, declining as the number of time segments increases, and the underestimation coefficient results for estimating movement entropy decline logarithmically as the time segments increase, whereas the radius of gyration tends to be more ambiguous due to the loss of isolated locations. This paper suggests that researchers should carefully interpret results derived from this type of sparse data in the era of big data.

[1]  Paolo Santi,et al.  Supersampling and Network Reconstruction of Urban Mobility , 2015, PloS one.

[2]  Martin Raubal,et al.  Correlating mobile phone usage and travel behavior - A case study of Harbin, China , 2012, Comput. Environ. Urban Syst..

[3]  Cecilia Mascolo,et al.  A Tale of Many Cities: Universal Patterns in Human Urban Mobility , 2011, PloS one.

[4]  Tianbo Lu,et al.  Next Big Thing in Big Data: The Security of the ICT Supply Chain , 2013, 2013 International Conference on Social Computing.

[5]  Tao Zhang,et al.  Understanding Spatiotemporal Patterns of Human Convergence and Divergence Using Mobile Phone Location Data , 2016, ISPRS Int. J. Geo Inf..

[6]  Caroline O. Buckee,et al.  Heterogeneous Mobile Phone Ownership and Usage Patterns in Kenya , 2012, PloS one.

[7]  Ling Yin,et al.  Re-Identification Risk versus Data Utility for Aggregated Mobility Research Using Mobile Phone Location Data , 2015, PloS one.

[8]  Wei-Ying Ma,et al.  Understanding mobility based on GPS data , 2008, UbiComp.

[9]  Carolien Beckx,et al.  Dynamic assessment of exposure to air pollution using mobile phone data , 2016, International Journal of Health Geographics.

[10]  Ling Yin,et al.  Estimating Potential Demand of Bicycle Trips from Mobile Phone Data - An Anchor-Point Based Approach , 2016, ISPRS Int. J. Geo Inf..

[11]  Luis Miguel Romero Pérez,et al.  Traffic Flow Estimation Models Using Cellular Phone Data , 2012, IEEE Transactions on Intelligent Transportation Systems.

[12]  Caroline O. Buckee,et al.  The impact of biases in mobile phone ownership on estimates of human mobility , 2013, Journal of The Royal Society Interface.

[13]  David L. Smith,et al.  Quantifying the Impact of Human Mobility on Malaria , 2012, Science.

[14]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[15]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[16]  Michael F. Goodchild,et al.  The quality of big (geo)data , 2013 .

[17]  Marta C. González,et al.  A universal model for mobility and migration patterns , 2011, Nature.

[18]  Morton E. O'Kelly,et al.  Spatial Interaction Models:Formulations and Applications , 1988 .

[19]  Ryosuke Shibasaki,et al.  Comparative Perspective of Human Behavior Patterns to Uncover Ownership Bias among Mobile Phone Users , 2016, ISPRS Int. J. Geo Inf..

[20]  Chaoming Song,et al.  Modelling the scaling properties of human mobility , 2010, 1010.0436.

[21]  G. Jacquez A research agenda: does geocoding positional error matter in health GIS studies? , 2012, Spatial and spatio-temporal epidemiology.

[22]  Carlo Ratti,et al.  Exploring Universal Patterns in Human Home-Work Commuting from Mobile Phone Data , 2013, PloS one.

[23]  B. Chen,et al.  Most reliable path-finding algorithm for maximizing on-time arrival probability , 2017 .

[24]  Mirko Degli Esposti,et al.  Entropic measures of individual mobility patterns , 2013 .

[25]  Margaret Martonosi,et al.  ON CELLULAR , 2022 .

[26]  Adam Jacobs,et al.  The pathologies of big data , 2009, Commun. ACM.

[27]  Marc Barthelemy,et al.  A stochastic model of randomly accelerated walkers for human mobility , 2015, Nature Communications.

[28]  Hui Zang,et al.  Are call detail records biased for sampling human mobility? , 2012, MOCO.

[29]  Marc Barthelemy,et al.  Corrigendum: Influence of sociodemographic characteristics on human mobility , 2015, Scientific reports.

[30]  Matthew Smith,et al.  Big data privacy issues in public social media , 2012, 2012 6th IEEE International Conference on Digital Ecosystems and Technologies (DEST).

[31]  Sune Lehmann,et al.  Understanding predictability and exploration in human mobility , 2016, EPJ Data Science.

[32]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[33]  M. Goodchild,et al.  Uncertainty in geographical information , 2002 .

[34]  T. Geisel,et al.  The scaling laws of human travel , 2006, Nature.

[35]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[36]  Yong Gao,et al.  Uncovering Patterns of Inter-Urban Trip and Spatial Interaction from Social Media Check-In Data , 2013, PloS one.

[37]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[38]  O. Järv,et al.  Understanding monthly variability in human activity spaces: A twelve-month study using mobile phone call detail records , 2014 .

[39]  M. Batty,et al.  Variability in Regularity: Mining Temporal Mobility Patterns in London, Singapore and Beijing Using Smart-Card Data , 2016, PloS one.

[40]  Carlo Ratti,et al.  Estimating Origin-Destination flows using opportunistically collected mobile phone location data from one million users in Boston Metropolitan Area , 2011 .

[41]  Qingquan Li,et al.  Another Tale of Two Cities: Understanding Human Activity Space Using Actively Tracked Cellphone Location Data , 2016, Geographies of Mobility.

[42]  Brent J. Hecht,et al.  A Tale of Cities: Urban Biases in Volunteered Geographic Information , 2014, ICWSM.

[43]  Yi Zhu,et al.  Inferring individual daily activities from mobile phone traces: A Boston example , 2016 .

[44]  Carlo Ratti,et al.  Mobile Landscapes: Using Location Data from Cell Phones for Urban Analysis , 2006 .

[45]  Fasheng Liu,et al.  Estimating freeway traffic measures from mobile phone location data , 2013, Eur. J. Oper. Res..

[46]  Song Gao,et al.  Discovering Spatial Interaction Communities from Mobile Phone Data , 2013 .

[47]  K. Fu,et al.  Reality Check for the Chinese Microblog Space: A Random Sampling Approach , 2013, PloS one.

[48]  Qingquan Li,et al.  Spatiotemporal analysis of critical transportation links based on time geographic concepts: a case study of critical bridges in Wuhan, China , 2012 .

[49]  Carlo Ratti,et al.  Understanding individual mobility patterns from urban sensing data: A mobile phone trace example , 2013 .

[50]  Qingquan Li,et al.  Understanding aggregate human mobility patterns using passive mobile phone location data: a home-based approach , 2015, Transportation.

[51]  Robert Brewer,et al.  Evaluation of Cell Phone Traffic Data in Minnesota , 2008 .

[52]  Omer Tene Jules Polonetsky,et al.  Privacy in the Age of Big Data: A Time for Big Decisions , 2012 .

[53]  D. Hosmer,et al.  A review of goodness of fit statistics for use in the development of logistic regression models. , 1982, American journal of epidemiology.

[54]  Chenghu Zhou,et al.  A new insight into land use classification based on aggregated mobile phone data , 2013, Int. J. Geogr. Inf. Sci..

[55]  Ling Yin,et al.  Understanding the bias of call detail records in human mobility research , 2016, Int. J. Geogr. Inf. Sci..

[56]  Liang Liu,et al.  Estimating Origin-Destination Flows Using Mobile Phone Location Data , 2011, IEEE Pervasive Computing.

[57]  Piotr Sapiezynski,et al.  Inferring Stop-Locations from WiFi , 2016, PloS one.

[58]  Vyron Antoniou,et al.  How Many Volunteers Does it Take to Map an Area Well? The Validity of Linus’ Law to Volunteered Geographic Information , 2010 .