Identifying Important Places in People's Lives from Cellular Network Data

People spend most of their time at a few key locations, such as home and work. Being able to identify how the movements of people cluster around these "important places" is crucial for a range of technology and policy decisions in areas such as telecommunications and transportation infrastructure deployment. In this paper, we propose new techniques based on clustering and regression for analyzing anonymized cellular network data to identify generally important locations, and to discern semantically meaningful locations such as home and work. Starting with temporally sparse and spatially coarse location information, we propose a new algorithm to identify important locations. We test this algorithm on arbitrary cellphone users, including those with low call rates, and find that we are within 3 miles of ground truth for 88% of volunteer users. Further, after locating home and work, we achieve commute distance estimates that are within 1 mile of equivalent estimates derived from government census data. Finally, we perform carbon footprint analyses on hundreds of thousands of anonymous users as an example of how our data and algorithms can form an accurate and efficient underpinning for policy and infrastructure studies.

[1]  R. Walgate Tale of two cities , 1984, Nature.

[2]  F. Girardin,et al.  Understanding of Tourist Dynamics from Explicitly Disclosed Location Information , 2007 .

[3]  William G. Griswold,et al.  Place-Its: A Study of Location-Based Reminders on Mobile Phones , 2005, UbiComp.

[4]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[5]  Gaetano Borriello,et al.  Extracting places from traces of locations , 2004, MOCO.

[6]  Josep Blat,et al.  Leveraging explicitly disclosed location information to understand tourist dynamics: a case study , 2008, J. Locat. Based Serv..

[7]  Chris Schmandt,et al.  Location-Aware Information Delivery with ComMotion , 2000, HUC.

[8]  Thad Starner,et al.  Using GPS to learn significant locations and predict movement across multiple users , 2003, Personal and Ubiquitous Computing.

[9]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[10]  Alexandre Gerber,et al.  TOWARDS ESTIMATING THE PRESENCE OF VISITORS FROM THE AGGREGATE MOBILE PHONE NETWORK ACTIVITY THEY GENERATE , 2009 .

[11]  Kari Laasonen,et al.  Mining Cell Transition Data , 2009 .

[12]  Hui Zang,et al.  Mining call and mobility data to improve paging efficiency in cellular networks , 2007, MobiCom '07.

[13]  Deborah Estrin,et al.  PEIR, the personal environmental impact report, as a platform for participatory sensing systems research , 2009, MobiSys '09.

[14]  Xing Xie,et al.  Mining user similarity based on location history , 2008, GIS '08.

[15]  Murat Ali Bayir,et al.  Discovering spatiotemporal mobility profiles of cellphone users , 2009, 2009 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks & Workshops.

[16]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[17]  Jean-Yves Le Boudec,et al.  Predicting User-Cell Association in Cellular Networks from Tracked Data , 2009, MELT.

[18]  Sunny Consolvo,et al.  Learning and Recognizing the Places We Go , 2005, UbiComp.

[19]  Deborah Estrin,et al.  Discovering semantically meaningful places from pervasive RF-beacons , 2009, UbiComp.

[20]  Salvatore Monni,et al.  Rome. A Tale of two Cities , 2012 .

[21]  Henry A. Kautz,et al.  Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields , 2007, Int. J. Robotics Res..

[22]  Xenofon Koutsoukos,et al.  Mobile Entity Localization and Tracking in GPS-less Environnments, Second International Workshop, MELT 2009, Orlando, FL, USA, September 30, 2009. Proceedings , 2009, MELT.