Breadcrumbs: A Rich Mobility Dataset with Point-of-Interest Annotations

Rich human mobility datasets are fundamental for evaluating algorithms pertaining to geographic information systems. Unfortunately, existing mobility datasets--that are available to the research community--are restricted to location data captured through a single sensor (typically GPS) and have a low spatiotemporal granularity. They also lack ground-truth data regarding points of interest and the associated semantic labels (e.g., "home", "work", etc.). In this paper, we present Breadcrumbs, a rich mobility dataset collected from multiple sensors (incl. GPS, GSM, WiFi, Bluetooth) on the smartphones of 81 individuals. In addition to sensor data, Breadcrumbs contains ground-truth data regarding people points of interest (incl. semantic labels) as well as demographic attributes, contact records, calendar events, lifestyle information, and social relationship labels between the participants of the study. We describe the data collection methodology and present a preliminary quantitative analysis of the dataset. A sanitized version of the dataset as well as the source code will be made available to the research community.

[1]  Michael Devetsikiotis,et al.  CRAWDAD dataset unm/blebeacon (v.2019-03-12) , 2019 .

[2]  Xing Xie,et al.  GeoLife: A Collaborative Social Networking Service among User, Location and Trajectory , 2010, IEEE Data Eng. Bull..

[3]  Daqing Zhang,et al.  Fine-grained preference-aware location search leveraging crowdsourced digital footprints from LBSNs , 2013, UbiComp.

[4]  Ben Mokhtar Sonia,et al.  The Long Road to Computational Location Privacy: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[5]  Alex Pentl,et al.  Reality Mining of Mobile Communications: Toward A New Deal On Data , 2009 .

[6]  Shashi Shekhar,et al.  Discovering personal gazetteers: an interactive clustering approach , 2004, GIS '04.

[7]  Sébastien Gambs,et al.  Show me how you move and I will tell you who you are , 2010, SPRINGL '10.

[8]  Christophe Diot,et al.  CRAWDAD dataset thlab/sigcomm2009 (v.2012-07-15) , 2012 .

[9]  Alex Pentland,et al.  Reality mining: sensing complex social systems , 2006, Personal and Ubiquitous Computing.

[10]  Daqing Zhang,et al.  Modeling User Activity Preference by Leveraging User Spatial Temporal Characteristics in LBSNs , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[11]  Benoît Garbinato,et al.  Extracting Hotspots without A-priori by Enabling Signal Processing over Geospatial Data , 2017, SIGSPATIAL/GIS.

[12]  Alex Pentland,et al.  Once Upon a Crime: Towards Crime Prediction from Demographics and Mobile Data , 2014, ICMI.

[13]  Hervé Rivano,et al.  PRIVA'MOV: Analysing Human Mobility Through Multi-Sensor Datasets , 2017 .

[14]  Roberto Trasarti,et al.  Discovering and Understanding City Events with Big Data: The Case of Rome , 2017, Inf..

[15]  Li Tu,et al.  Density-based clustering for real-time stream data , 2007, KDD '07.

[16]  Jignesh M. Patel,et al.  Indexing Large Trajectory Data Sets With SETI , 2003, CIDR.

[17]  Jiawei Han,et al.  Mining periodic behaviors for moving objects , 2010, KDD.

[18]  D. Gática-Pérez,et al.  Towards rich mobile phone datasets: Lausanne data collection campaign , 2010 .

[19]  Tao Zhou,et al.  Diversity of individual mobility patterns and emergence of aggregated scaling laws , 2012, Scientific Reports.

[20]  Stefano Chessa,et al.  Mobile crowd sensing management with the ParticipAct living lab , 2017, Pervasive Mob. Comput..

[21]  W. Bruce Croft,et al.  Predicting query performance , 2002, SIGIR '02.

[22]  James Irvine,et al.  Nodobo: Mobile Phone as a Software Sensor for Social Network Research , 2011, 2011 IEEE 73rd Vehicular Technology Conference (VTC Spring).

[23]  Ciprian Dobre,et al.  CRAWDAD dataset upb/hyccups (v.2016-10-17) , 2016 .

[24]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.

[25]  Lionel Brunie,et al.  The Long Road to Computational Location Privacy: A Survey , 2019, IEEE Communications Surveys & Tutorials.

[26]  Hugh F. Durrant-Whyte,et al.  Simultaneous Localization, Mapping and Moving Object Tracking , 2007, Int. J. Robotics Res..

[27]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.