Trajectory Data Mining

The advances in location-acquisition and mobile computing techniques have generated massive spatial trajectory data, which represent the mobility of a diversity of moving objects, such as people, vehicles, and animals. Many techniques have been proposed for processing, managing, and mining trajectory data in the past decade, fostering a broad range of applications. In this article, we conduct a systematic survey on the major research into trajectory data mining, providing a panorama of the field as well as the scope of its research topics. Following a road map from the derivation of trajectory data, to trajectory data preprocessing, to trajectory data management, and to a variety of mining tasks (such as trajectory pattern mining, outlier detection, and trajectory classification), the survey explores the connections, correlations, and differences among these existing techniques. This survey also introduces the methods that transform trajectories into other data formats, such as graphs, matrices, and tensors, to which more data mining and machine learning techniques can be applied. Finally, some public trajectory datasets are presented. This survey can help shape the field of trajectory data mining, providing a quick understanding of this field to the community.

[1]  Eric Horvitz,et al.  LOCADIO: inferring motion and location from Wi-Fi signal strengths , 2004, The First Annual International Conference on Mobile and Ubiquitous Systems: Networking and Services, 2004. MOBIQUITOUS 2004..

[2]  Bettina Speckmann,et al.  Efficient detection of motion patterns in spatio-temporal data sets , 2004, GIS '04.

[3]  Xing Xie,et al.  Collaborative location and activity recommendations with GPS history data , 2010, WWW '10.

[4]  Nicholas Jing Yuan,et al.  On discovery of gathering patterns from trajectories , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[5]  Heng Tao Shen,et al.  Searching trajectories by locations: an efficiency study , 2010, SIGMOD Conference.

[6]  Xing Xie,et al.  Mining interesting locations and travel sequences from GPS trajectories , 2009, WWW '09.

[7]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[8]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[9]  Xing Xie,et al.  Social itinerary recommendation from user-generated digital trails , 2012, Personal and Ubiquitous Computing.

[10]  Xing Xie,et al.  Mining correlation between locations using human location history , 2009, GIS.

[11]  Ahmed Eldawy,et al.  A demonstration of MNTG - A web-based road network traffic generator , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[12]  Archan Misra,et al.  TODMIS: mining communities from trajectories , 2013, CIKM.

[13]  Padhraic Smyth,et al.  Trajectory clustering with mixtures of regression models , 1999, KDD '99.

[14]  ChenLei,et al.  Probabilistic nearest neighbor queries on uncertain moving object trajectories , 2013, VLDB 2013.

[15]  Xing Xie,et al.  Mining user similarity based on location history , 2008, GIS '08.

[16]  Alok N. Choudhary,et al.  Uncertain Range Queries for Necklaces , 2010, 2010 Eleventh International Conference on Mobile Data Management.

[17]  Ralf Hartmut Güting,et al.  BerlinMOD: a benchmark for moving object databases , 2009, The VLDB Journal.

[18]  Heng Tao Shen,et al.  Discovering popular routes from trajectories , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[19]  Zhi-Hua Zhou,et al.  iBAT: detecting anomalous taxi trajectories from GPS traces , 2011, UbiComp '11.

[20]  Xing Xie,et al.  Sensing the pulse of urban refueling behavior , 2013, UbiComp.

[21]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[22]  Yong Yu,et al.  Inferring gas consumption and pollution emission of vehicles throughout a city , 2014, KDD.

[23]  Timos K. Sellis,et al.  Sampling Trajectory Streams with Spatiotemporal Criteria , 2006, 18th International Conference on Scientific and Statistical Database Management (SSDBM'06).

[24]  Xing Xie,et al.  Inferring Taxi Status Using GPS Trajectories , 2012, ArXiv.

[25]  Xing Xie,et al.  GeoLife2.0: A Location-Based Social Networking Service , 2009, 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware.

[26]  Chi-Yin Chow,et al.  Probabilistic Verifiers: Evaluating Constrained Nearest-Neighbor Queries over Uncertain Data , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[27]  J. Hershberger,et al.  Speeding Up the Douglas-Peucker Line-Simplification Algorithm , 1992 .

[28]  Kai Zheng,et al.  Calibrating trajectory data for similarity-based analysis , 2013, SIGMOD '13.

[29]  Yu Zheng,et al.  Location-Based Social Networks: Users , 2011, Computing with Spatial Trajectories.

[30]  Christian S. Jensen,et al.  Techniques for efficient road-network-based tracking of moving objects , 2005, IEEE Transactions on Knowledge and Data Engineering.

[31]  Yanchi Liu,et al.  Diagnosing New York city's noises with ubiquitous data , 2014, UbiComp.

[32]  Jae-Gil Lee,et al.  Trajectory Outlier Detection: A Partition-and-Detect Framework , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[33]  Yu Zheng,et al.  Constructing popular routes from uncertain trajectories , 2012, KDD.

[34]  Wei-Ying Ma,et al.  Recommending friends and locations based on individual location history , 2011, ACM Trans. Web.

[35]  Dieter Pfoser,et al.  Novel Approaches in Query Processing for Moving Object Trajectories , 2000, VLDB 2000.

[36]  Wei-Ying Ma,et al.  Understanding mobility based on GPS data , 2008, UbiComp.

[37]  Xing Xie,et al.  Smart Itinerary Recommendation Based on User-Generated GPS Trajectories , 2010, UIC.

[38]  Ouri Wolfson,et al.  Proceedings of the 2nd ACM SIGKDD International Workshop on Urban Computing , 2013, KDD 2013.

[39]  Heng Tao Shen,et al.  Convoy Queries in Spatio-Temporal Databases , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[40]  Hai Yang,et al.  ACM Transactions on Intelligent Systems and Technology - Special Section on Urban Computing , 2014 .

[41]  Xing Xie,et al.  Understanding transportation modes based on GPS data for web applications , 2010, TWEB.

[42]  Yücel Saygin,et al.  Towards trajectory anonymization: a generalization-based approach , 2008, SPRINGL '08.

[43]  Xing Xie,et al.  T-drive: driving directions based on taxi trajectories , 2010, GIS '10.

[44]  Jiawei Han,et al.  Mining event periodicity from incomplete observations , 2012, KDD.

[45]  Chengyang Zhang,et al.  Map-matching for low-sampling-rate GPS trajectories , 2009, GIS.

[46]  Thomas Brinkhoff,et al.  A Framework for Generating Network-Based Moving Objects , 2002, GeoInformatica.

[47]  Xing Xie,et al.  A Flexible Spatio-Temporal Indexing Scheme for Large-Scale GPS Track Retrieval , 2008, The Ninth International Conference on Mobile Data Management (mdm 2008).

[48]  Henry A. Kautz,et al.  Learning and inferring transportation routines , 2004, Artif. Intell..

[49]  Klaus H. Hinrichs,et al.  Managing uncertainty in moving objects databases , 2004, TODS.

[50]  Christos Faloutsos,et al.  Efficient retrieval of similar time sequences under time warping , 1998, Proceedings 14th International Conference on Data Engineering.

[51]  Nicholas Jing Yuan,et al.  T-Finder: A Recommender System for Finding Passengers and Vacant Taxis , 2013, IEEE Transactions on Knowledge and Data Engineering.

[52]  John Krumm Trajectory Analysis for Driving , 2011, Computing with Spatial Trajectories.

[53]  Xing Xie,et al.  Inferring social ties between users with human location history , 2014, J. Ambient Intell. Humaniz. Comput..

[54]  M. A. ShehnazBegum,et al.  T-Drive: Enhancing Driving Directions with Taxi Drivers' Intelligence , 2014 .

[55]  Lei Chen,et al.  Robust and fast similarity search for moving object trajectories , 2005, SIGMOD '05.

[56]  Xing Xie,et al.  Learning transportation mode from raw gps data for geographic applications on the web , 2008, WWW.

[57]  Xing Xie,et al.  Collaborative Filtering Meets Mobile Recommendation: A User-Centered Approach , 2010, AAAI.

[58]  Xing Xie,et al.  Learning travel recommendations from user-generated GPS traces , 2011, TIST.

[59]  Jie Bao,et al.  A Survey on Recommendations in Location-based Social Networks , 2013 .

[60]  Dino Pedreschi,et al.  Trajectory pattern mining , 2007, KDD '07.

[61]  Xing Xie,et al.  Finding similar users using category-based location history , 2010, GIS '10.

[62]  Yufei Tao,et al.  MV3R-Tree: A Spatio-Temporal Access Method for Timestamp and Interval Queries , 2001, VLDB.

[63]  Hui Xiong,et al.  Achieving Guaranteed Anonymity in GPS Traces via Uncertainty-Aware Path Cloaking , 2010, IEEE Transactions on Mobile Computing.

[64]  Sanjay Chawla,et al.  Inferring the Root Cause in Road Traffic Anomalies , 2012, 2012 IEEE 12th International Conference on Data Mining.

[65]  Oliver Pink,et al.  A statistical approach to map matching using road network geometry, topology and vehicular motion constraints , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[66]  Philip S. Yu,et al.  InfoMiner+: mining partial periodic patterns with gap penalties , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[67]  Jae-Gil Lee,et al.  Incremental Clustering for Trajectories , 2010, DASFAA.

[68]  Lei Chen,et al.  On The Marriage of Lp-norms and Edit Distance , 2004, VLDB.

[69]  Wang-Chien Lee,et al.  Trajectory Preprocessing , 2011, Computing with Spatial Trajectories.

[70]  Wei-Ying Ma,et al.  A Cloud-Based Knowledge Discovery System for Monitoring Fine-Grained Air Quality , 2014 .

[71]  Xing Xie,et al.  Where to find my next passenger , 2011, UbiComp '11.

[72]  Sanjay Chawla,et al.  On detection of emerging anomalous traffic patterns using GPS data , 2013, Data Knowl. Eng..

[73]  Mohamed F. Mokbel,et al.  Location-based and preference-aware recommendation using sparse geo-social networking data , 2012, SIGSPATIAL/GIS.

[74]  J. Greenfeld MATCHING GPS OBSERVATIONS TO LOCATIONS ON A DIGITAL MAP , 2002 .

[75]  Cyrus Shahabi,et al.  Crowd sensing of traffic anomalies based on human mobility and social media , 2013, SIGSPATIAL/GIS.

[76]  Qiang Yang,et al.  High-Level Goal Recognition in a Wireless LAN , 2004, AAAI.

[77]  Washington Y. Ochieng,et al.  MAP-MATCHING IN COMPLEX URBAN ROAD NETWORKS , 2009, Revista Brasileira de Cartografia.

[78]  Nicholas Jing Yuan,et al.  Online Discovery of Gathering Patterns over Trajectories , 2014, IEEE Transactions on Knowledge and Data Engineering.

[79]  Xing Xie,et al.  Reducing Uncertainty of Low-Sampling-Rate Trajectories , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[80]  Xiaofang Zhou,et al.  Trajectory Indexing and Retrieval , 2011, Computing with Spatial Trajectories.

[81]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[82]  Padhraic Smyth,et al.  A general probabilistic framework for clustering individuals and objects , 2000, KDD '00.

[83]  Hui Xiong,et al.  Exploiting geographic dependencies for real estate appraisal: a mutual perspective of ranking and clustering , 2014, KDD.

[84]  John Krumm,et al.  Hidden Markov map matching through noise and sparseness , 2009, GIS.

[85]  Dieter Pfoser,et al.  On Map-Matching Vehicle Tracking Data , 2005, VLDB.

[86]  Robert B McMaster,et al.  A Statistical Analysis of Mathematical Measures for Linear Simplification , 1986 .

[87]  Yu Zheng,et al.  Tutorial on Location-Based Social Networks , 2012 .

[88]  Xing Xie,et al.  Mining Individual Life Pattern Based on Location History , 2009, 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware.

[89]  Dieter Pfoser,et al.  Capturing the Uncertainty of Moving-Object Representations , 1999, SSD.

[90]  Christian S. Jensen,et al.  Discovery of convoys in trajectory databases , 2008, Proc. VLDB Endow..

[91]  Beng Chin Ooi,et al.  Continuous Clustering of Moving Objects , 2007, IEEE Transactions on Knowledge and Data Engineering.

[92]  Nikos Mamoulis,et al.  Mining frequent spatio-temporal sequential patterns , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[93]  Kai-Florian Richter,et al.  Semantic trajectory compression: Representing urban movement in a nutshell , 2012, J. Spatial Inf. Sci..

[94]  Jiawei Han,et al.  Swarm: Mining Relaxed Temporal Moving Object Clusters , 2010, Proc. VLDB Endow..

[95]  Xing Xie,et al.  An Interactive-Voting Based Map Matching Algorithm , 2010, 2010 Eleventh International Conference on Mobile Data Management.

[96]  Xing Xie,et al.  Towards mobile intelligence: Learning from GPS history data for collaborative recommendation , 2012, Artif. Intell..

[97]  Torben Bach Pedersen,et al.  Privacy-Preserving Data Mining on Moving Object Trajectories , 2007, 2007 International Conference on Mobile Data Management.

[98]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.

[99]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.

[100]  William G. Griswold,et al.  Mobility Detection Using Everyday GSM Traces , 2006, UbiComp.

[101]  Sanjay Chawla,et al.  On Mining Anomalous Patterns in Road Traffic Streams , 2011, ADMA.

[102]  Frank Stajano,et al.  Location Privacy in Pervasive Computing , 2003, IEEE Pervasive Comput..

[103]  Jiawei Han,et al.  Mining periodic behaviors for moving objects , 2010, KDD.

[104]  Ouri Wolfson,et al.  A weight-based map matching method in moving objects databases , 2004, Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004..

[105]  Weiwei Sun,et al.  PRESS: A Novel Framework of Trajectory Compression in Road Networks , 2014, Proc. VLDB Endow..

[106]  Hans-Peter Kriegel,et al.  Querying Uncertain Spatio-Temporal Data , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[107]  Ge Yu,et al.  Interval reverse nearest neighbor queries on uncertain data with Markov correlations , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[108]  Jing Yuan,et al.  On Discovery of Traveling Companions from Streaming Trajectories , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[109]  Nikos Mamoulis,et al.  Discovery of Periodic Patterns in Spatiotemporal Sequences , 2007, IEEE Transactions on Knowledge and Data Engineering.

[110]  Francesco Bonchi,et al.  Never Walk Alone: Uncertainty for Anonymity in Moving Objects Databases , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[111]  Christian S. Jensen,et al.  Trajectory Pattern Mining , 2011, Computing with Spatial Trajectories.

[112]  Xing Xie,et al.  Discovering spatio-temporal causal interactions in traffic data streams , 2011, KDD.

[113]  Tetsuji Satoh,et al.  An anonymous communication technique using dummies for location-based services , 2005, ICPS '05. Proceedings. International Conference on Pervasive Services, 2005..

[114]  Yu Zheng,et al.  T-share: A large-scale dynamic taxi ridesharing service , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[115]  Hans-Peter Kriegel,et al.  Probabilistic Nearest Neighbor Queries on Uncertain Moving Object Trajectories , 2013, Proc. VLDB Endow..

[116]  Lei Chen,et al.  Finding time period-based most frequent path in big trajectory data , 2013, SIGMOD '13.

[117]  Xing Xie,et al.  Urban computing with taxicabs , 2011, UbiComp '11.

[118]  Philip S. Yu,et al.  Infominer: mining surprising periodic patterns , 2001, KDD '01.

[119]  Yufei Tao,et al.  Efficient historical R-trees , 2001, Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001.

[120]  Ramayya Krishnan,et al.  Fraud Detection From Taxis' Driving Behaviors , 2014, IEEE Transactions on Vehicular Technology.

[121]  Yu Zheng,et al.  Computing with Spatial Trajectories , 2011, Computing with Spatial Trajectories.

[122]  Dieter Pfoser,et al.  Novel Approaches to the Indexing of Moving Object Trajectories , 2000, VLDB.

[123]  Robert B. Noland,et al.  A High Accuracy Fuzzy Logic Based Map Matching Algorithm for Road Transport , 2006, J. Intell. Transp. Syst..

[124]  Xing Xie,et al.  Retrieving k-Nearest Neighboring Trajectories by a Set of Point Locations , 2011, SSTD.

[125]  Guangzhong Sun,et al.  Driving with knowledge from the physical world , 2011, KDD.

[126]  Jing Yuan,et al.  A framework of traveling companion discovery on trajectory data streams , 2013, ACM Trans. Intell. Syst. Technol..

[127]  S.S. Chawathe,et al.  Segment-Based Map Matching , 2007, 2007 IEEE Intelligent Vehicles Symposium.

[128]  Yu Zheng,et al.  Travel time estimation of a path using sparse trajectories , 2014, KDD.

[129]  Hui Xiong,et al.  Sparse Real Estate Ranking with Online User Reviews and Offline Moving Behaviors , 2014, 2014 IEEE International Conference on Data Mining.

[130]  Sami Faïz,et al.  Clustering Algorithm for Network Constraint Trajectories , 2008, SDH.

[131]  Yufei Tao,et al.  Continuous Nearest Neighbor Search , 2002, VLDB.

[132]  Nirvana Meratnia,et al.  Spatiotemporal Compression Techniques for Moving Point Objects , 2004, EDBT.

[133]  Dino Pedreschi,et al.  Discovering the Geographical Borders of Human Mobility , 2012, KI - Künstliche Intelligenz.

[134]  Henry A. Kautz,et al.  Inferring High-Level Behavior from Low-Level Sensors , 2003, UbiComp.

[135]  Walid G. Aref,et al.  Casper*: Query processing for location services without compromising privacy , 2006, TODS.

[136]  Nicholas Jing Yuan,et al.  Segmentation of Urban Areas Using Road Networks , 2012 .

[137]  Philip S. Yu,et al.  A Framework for Clustering Evolving Data Streams , 2003, VLDB.

[138]  Umeshwar Dayal,et al.  PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth , 2001, ICDE 2001.

[139]  Yu Zheng,et al.  U-Air: when urban air quality inference meets big data , 2013, KDD.

[140]  David H. Douglas,et al.  ALGORITHMS FOR THE REDUCTION OF THE NUMBER OF POINTS REQUIRED TO REPRESENT A DIGITIZED LINE OR ITS CARICATURE , 1973 .

[141]  Philip S. Yu,et al.  Mining Asynchronous Periodic Patterns in Time Series Data , 2003, IEEE Trans. Knowl. Data Eng..

[142]  Nikos Pelekis,et al.  Trajectory Compression under Network Constraints , 2009, SSTD.

[143]  Nenghai Yu,et al.  Trajectory simplification method for location-based social networking services , 2009, LBSN '09.

[144]  Roberto Tamassia,et al.  Continuous probabilistic nearest-neighbor queries for uncertain trajectories , 2009, EDBT '09.

[145]  Günter Rote,et al.  Matching planar maps , 2003, SODA '03.

[146]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[147]  Nikos Mamoulis,et al.  Privacy Preservation in the Publication of Trajectories , 2008, The Ninth International Conference on Mobile Data Management (mdm 2008).

[148]  Chi-Yin Chow,et al.  Privacy of Spatial Trajectories , 2011, Computing with Spatial Trajectories.

[149]  Anna Monreale,et al.  WhereNext: a location predictor on trajectory pattern mining , 2009, KDD.

[150]  Hui Xiong,et al.  Discovering Urban Functional Zones Using Latent Activity Trajectories , 2015, IEEE Transactions on Knowledge and Data Engineering.

[151]  Xing Xie,et al.  Destination prediction by sub-trajectory synthesis and privacy protection against such prediction , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[152]  Xing Xie,et al.  GeoLife: A Collaborative Social Networking Service among User, Location and Trajectory , 2010, IEEE Data Eng. Bull..

[153]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[154]  Joachim Gudmundsson,et al.  Computing longest duration flocks in trajectory data , 2006, GIS '06.

[155]  Huidong Jin,et al.  PutMode: prediction of uncertain trajectories in moving objects databases , 2010, Applied Intelligence.

[156]  Sunil Prabhakar,et al.  Querying imprecise data in moving object environments , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[157]  Lars Kulik,et al.  Privacy-Aware Traffic Monitoring , 2010, IEEE Transactions on Intelligent Transportation Systems.

[158]  Mohamed F. Mokbel,et al.  Recommendations in location-based social networks: a survey , 2015, GeoInformatica.

[159]  Yu Zheng,et al.  Real-Time City-Scale Taxi Ridesharing , 2015, IEEE Transactions on Knowledge and Data Engineering.