Understanding the bias of call detail records in human mobility research

ABSTRACT In recent years, call detail records (CDRs) have been widely used in human mobility research. Although CDRs are originally collected for billing purposes, the vast amount of digital footprints generated by calling and texting activities provide useful insights into population movement. However, can we fully trust CDRs given the uneven distribution of people’s phone communication activities in space and time? In this article, we investigate this issue using a mobile phone location dataset collected from over one million subscribers in Shanghai, China. It includes CDRs (~27%) plus other cellphone-related logs (e.g., tower pings, cellular handovers) generated in a workday. We extract all CDRs into a separate dataset in order to compare human mobility patterns derived from CDRs vs. from the complete dataset. From an individual perspective, the effectiveness of CDRs in estimating three frequently used mobility indicators is evaluated. We find that CDRs tend to underestimate the total travel distance and the movement entropy, while they can provide a good estimate to the radius of gyration. In addition, we observe that the level of deviation is related to the ratio of CDRs in an individual’s trajectory. From a collective perspective, we compare the outcomes of these two datasets in terms of the distance decay effect and urban community detection. The major differences are closely related to the habit of mobile phone usage in space and time. We believe that the event-triggered nature of CDRs does introduce a certain degree of bias in human mobility research and we suggest that researchers use caution to interpret results derived from CDR data.

[1]  Chenghu Zhou,et al.  A new insight into land use classification based on aggregated mobile phone data , 2013, Int. J. Geogr. Inf. Sci..

[2]  Michael Blakemore,et al.  The accuracy of spatial databases , 1991 .

[3]  Song Gao,et al.  Discovering Spatial Interaction Communities from Mobile Phone Data , 2013 .

[4]  David L Buckeridge,et al.  Residential address errors in public health surveillance data: a description and analysis of the impact on geocoding. , 2010, Spatial and spatio-temporal epidemiology.

[5]  O. Järv,et al.  Using Mobile Positioning Data to Model Locations Meaningful to Users of Mobile Phones , 2010 .

[6]  Y. Kestens,et al.  Conceptualization and measurement of environmental exposure in epidemiology: accounting for activity space related to daily mobility. , 2013, Health & place.

[7]  Martin Raubal,et al.  Correlating mobile phone usage and travel behavior - A case study of Harbin, China , 2012, Comput. Environ. Urban Syst..

[8]  Yu Liu,et al.  Towards Estimating Urban Population Distributions from Mobile Call Data , 2012 .

[9]  Carlo Ratti,et al.  Quantifying urban attractiveness from the distribution and density of digital footprints , 2009, Int. J. Spatial Data Infrastructures Res..

[10]  A. Pozdnoukhov,et al.  Spatial structure and dynamics of urban communities , 2011 .

[11]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[12]  T. Geisel,et al.  The scaling laws of human travel , 2006, Nature.

[13]  Michael Batty The Pulse of the City , 2010 .

[14]  Peter A. Vanrolleghem,et al.  Uncertainty in the environmental modelling process - A framework and guidance , 2007, Environ. Model. Softw..

[15]  Brent J. Hecht,et al.  A Tale of Cities: Urban Biases in Volunteered Geographic Information , 2014, ICWSM.

[16]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[17]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[18]  W. Gesler,et al.  International Journal of Health Geographics a Suite of Methods for Representing Activity Space in a Healthcare Accessibility Study , 2022 .

[19]  Zbigniew Smoreda,et al.  Moving and Calling: Mobile Phone Data Quality Measurements and Spatiotemporal Uncertainty in Human Mobility Studies , 2013, AGILE Conf..

[20]  Daqiang Zhang,et al.  Predicting Mobile Phone User Locations by Exploiting Collective Behavioral Patterns , 2012, 2012 9th International Conference on Ubiquitous Intelligence and Computing and 9th International Conference on Autonomic and Trusted Computing.

[21]  Margaret Martonosi,et al.  ON CELLULAR , 2022 .

[22]  Daniel A. Griffith,et al.  Impacts of Positional Error on Spatial Regression Analysis: A Case Study of Address Locations in Syracuse, New York , 2007, Trans. GIS.

[23]  Yong Wang,et al.  Mapping the Internet using GIS: The death of distance hypothesis revisited , 2003, J. Geogr. Syst..

[24]  Alexei Pozdnoukhov,et al.  Spatial structure and dynamics of urbancommunities , 2011 .

[25]  Carlo Ratti,et al.  The Geography of Taste: Analyzing Cell-Phone Mobility and Social Events , 2010, Pervasive.

[26]  JoEllen Wilbur,et al.  Activity space environment and dietary and physical activity behaviors: a pilot study. , 2011, Health & place.

[27]  Alex Pang,et al.  Visualizing Uncertainty in Geo-spatial Data , 2001 .

[28]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[29]  Yihong Yuan,et al.  Analyzing and geo-visualizing individual human mobility patterns using mobile call records , 2010, 2010 18th International Conference on Geoinformatics.

[30]  Piet Rietveld,et al.  Transport in regional science: The “death of distance” is premature , 2003 .

[31]  Carlo Ratti,et al.  Eigenplaces: Segmenting Space through Digital Signatures , 2010, IEEE Pervasive Computing.

[32]  Eric M. Delmelle,et al.  Visualizing the impact of space-time uncertainties on dengue fever patterns , 2014, Int. J. Geogr. Inf. Sci..

[33]  Mark Gahegan,et al.  Visualizing Geospatial Information Uncertainty: What We Know and What We Need to Know , 2005 .

[34]  G. Jacquez A research agenda: does geocoding positional error matter in health GIS studies? , 2012, Spatial and spatio-temporal epidemiology.

[35]  L. A. Brown,et al.  The Intra-Urban Migration Process: a Perspective , 1970 .

[36]  Zbigniew Smoreda,et al.  Urban Mobility: Velocity and Uncertainty in Mobile Phone Data , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[37]  F. E. Horton,et al.  Effects of Urban Spatial Structure on Individual Behavior , 1971 .

[38]  R. Golledge,et al.  Spatial Behavior: A Geographic Perspective , 1996 .

[39]  Martin Raubal,et al.  Extracting Dynamic Urban Mobility Patterns from Mobile Phone Data , 2012, GIScience.

[40]  Helen Couclelis,et al.  The Certainty of Uncertainty: GIS and the Limits of Geographic Knowledge , 2003 .

[41]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[42]  A. Hernández-Vásquez,et al.  [Geographic information systems]. , 2016, Revista peruana de medicina experimental y salud publica.

[43]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Chaoming Song,et al.  Modelling the scaling properties of human mobility , 2010, 1010.0436.

[45]  L. Bengtsson,et al.  Improved Response to Disasters and Outbreaks by Tracking Population Movements with Mobile Phone Network Data: A Post-Earthquake Geospatial Study in Haiti , 2011, PLoS medicine.

[46]  Yuan Tian,et al.  Understanding intra-urban trip patterns from taxi trajectory data , 2012, J. Geogr. Syst..

[47]  Torsten Hägerstraand WHAT ABOUT PEOPLE IN REGIONAL SCIENCE , 1970 .

[48]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[49]  Holger R. Maier,et al.  Future research challenges for incorporation of uncertainty in environmental and ecological decision-making , 2008 .

[50]  Carlo Ratti,et al.  Cellular Census: Explorations in Urban Data Collection , 2007, IEEE Pervasive Computing.

[51]  Hillel Bar-Gera,et al.  Evaluation of a Cellular Phone-Based System for Measurements of Traffic Speeds and Travel Times: A Case Study from Israel , 2007 .

[52]  Qingquan Li,et al.  Understanding aggregate human mobility patterns using passive mobile phone location data: a home-based approach , 2015, Transportation.

[53]  Robert Brewer,et al.  Evaluation of Cell Phone Traffic Data in Minnesota , 2008 .

[54]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[55]  Injong Rhee,et al.  On the levy-walk nature of human mobility , 2011, TNET.