Unveiling the complexity of human mobility by querying and mining massive trajectory data

The technologies of mobile communications pervade our society and wireless networks sense the movement of people, generating large volumes of mobility data, such as mobile phone call records and Global Positioning System (GPS) tracks. In this work, we illustrate the striking analytical power of massive collections of trajectory data in unveiling the complexity of human mobility. We present the results of a large-scale experiment, based on the detailed trajectories of tens of thousands private cars with on-board GPS receivers, tracked during weeks of ordinary mobile activity. We illustrate the knowledge discovery process that, based on these data, addresses some fundamental questions of mobility analysts: what are the frequent patterns of people’s travels? How big attractors and extraordinary events influence mobility? How to predict areas of dense traffic in the near future? How to characterize traffic jams and congestions? We also describe M-Atlas, the querying and mining language and system that makes this analytical process possible, providing the mechanisms to master the complexity of transforming raw GPS tracks into mobility knowledge. M-Atlas is centered onto the concept of a trajectory, and the mobility knowledge discovery process can be specified by M-Atlas queries that realize data transformations, data-driven estimation of the parameters of the mining methods, the quality assessment of the obtained results, the quantitative and visual exploration of the discovered behavioral patterns and models, the composition of mined patterns, models and data with further analyses and mining, and the incremental mining strategies to address scalability.

[1]  Dino Pedreschi,et al.  Interactive visual clustering of large collections of trajectories , 2009, 2009 IEEE Symposium on Visual Analytics Science and Technology.

[2]  Anna Monreale,et al.  WhereNext: a location predictor on trajectory pattern mining , 2009, KDD.

[3]  Miriam Baglioni,et al.  Towards Semantic Interpretation of Movement Behavior , 2009, AGILE Conf..

[4]  Nikos Pelekis,et al.  Hermes - A Framework for Location-Based Data Management , 2006, EDBT.

[5]  Joachim Gudmundsson,et al.  Computing longest duration flocks in trajectory data , 2006, GIS '06.

[6]  Panos Kalnis,et al.  On Discovering Moving Clusters in Spatio-temporal Data , 2005, SSTD.

[7]  Laks V. S. Lakshmanan,et al.  The 3W Model and Algebra for Unified Data Mining , 2000, VLDB.

[8]  Chaoming Song,et al.  Modelling the scaling properties of human mobility , 2010, 1010.0436.

[9]  Joachim Gudmundsson,et al.  Reporting flock patterns , 2006, Comput. Geom..

[10]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[11]  Heikki Mannila,et al.  A database perspective on knowledge discovery , 1996, CACM.

[12]  Heng Tao Shen,et al.  Searching trajectories by locations: an efficiency study , 2010, SIGMOD Conference.

[13]  Xing Xie,et al.  Mining interesting locations and travel sequences from GPS trajectories , 2009, WWW '09.

[14]  Dino Pedreschi,et al.  A Query Language for Mobility Data Mining , 2011, Int. J. Data Warehous. Min..

[15]  S. Heinze,et al.  Spontaneous atomic-scale magnetic skyrmion lattice in two dimensions , 2011 .

[16]  Francesco Bonchi,et al.  Anonymization of moving objects databases by clustering and perturbation , 2010, Inf. Syst..

[17]  NanniMirco,et al.  Unveiling the complexity of human mobility by querying and mining massive trajectory data , 2011, VLDB 2011.

[18]  Anna Monreale,et al.  Movement data anonymity through generalization , 2009, SPRINGL '09.

[19]  Francesco Bonchi,et al.  Never Walk Alone: Uncertainty for Anonymity in Moving Objects Databases , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[20]  Dino Pedreschi,et al.  Trajectory pattern mining , 2007, KDD '07.

[21]  Markus Schneider,et al.  A foundation for representing and querying moving objects , 2000, TODS.

[22]  Albert-László Barabási,et al.  Understanding the Spreading Patterns of Mobile Phone Viruses , 2009, Science.

[23]  Carlo Zaniolo,et al.  ATLAS: A Small but Complete SQL Extension for Data Mining and Data Streams , 2003, VLDB.

[24]  Roberto Trasarti Mastering the Spatio-temporal Knowledge Discovery Process , 2010 .

[25]  Dino Pedreschi,et al.  Time-focused clustering of trajectories of moving objects , 2006, Journal of Intelligent Information Systems.

[26]  Padhraic Smyth,et al.  Trajectory clustering with mixtures of regression models , 1999, KDD '99.

[27]  Nikos Pelekis,et al.  The DAEDALUS framework: progressive querying and mining of movement data , 2008, GIS '08.

[28]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[29]  Albert-László Barabási,et al.  Understanding individual human mobility patterns , 2008, Nature.

[30]  T. Geisel,et al.  The scaling laws of human travel , 2006, Nature.

[31]  Stefan Wrobel,et al.  Visual analytics tools for analysis of movement data , 2007, SKDD.

[32]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[33]  Dino Pedreschi,et al.  Mobility, Data Mining and Privacy - Geographic Knowledge Discovery , 2008, Mobility, Data Mining and Privacy.

[34]  Jae-Gil Lee,et al.  MoveMine: mining moving object databases , 2010, SIGMOD Conference.

[35]  Fosca Giannotti,et al.  Mining mobility user profiles for car pooling , 2011, KDD.

[36]  Luc De Raedt,et al.  A Theory of Inductive Query Answering , 2010, Inductive Databases and Constraint-Based Data Mining.

[37]  Dino Pedreschi,et al.  Human mobility, social ties, and link prediction , 2011, KDD.

[38]  Samuel Madden,et al.  TrajStore: An adaptive storage system for very large trajectory data sets , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[39]  Xing Xie,et al.  Mining user similarity based on location history , 2008, GIS '08.

[40]  Dino Pedreschi,et al.  Advanced knowledge discovery on movement data with the GeoPKDD system , 2010, EDBT '10.

[41]  T. Imielinski,et al.  A database perspective on knowledge discovery : A database perspective on knowledge discovery , 1996 .

[42]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.