Extracting Stops from Noisy Trajectories: A Sequence Oriented Clustering Approach

Trajectories, representing the movements of objects in the real world, carry significant stop/move semantics. The detection of trajectory stops poses a critical problem in the study of moving objects and becomes even more challenging due to the inevitable noise recorded along with true data. To extract stops with a variety of shapes and sizes from single trajectories with noise, this paper presents a sequence oriented clustering approach, in which noise points within the sequence of a stop can be identified and classified as a part of the stop. In our method, two key concepts are first introduced: (1) a core sequence that defines sequence density based not only on proximity in space but also continuity in time as well as the duration over time; and (2) an Eps-reachability sequence that aggregates core sequences that overlap or meet over time. Then, three criteria are presented to merge Eps-reachability sequences interrupted by noise. Further, an algorithm, called SOC (Sequence Oriented Clustering), is developed to automatically extract stops from a single trajectory. In addition, a reachability graph is designed that visually illustrates the spatio-temporal clustering structure and levels of a trajectory. Finally, the proposed algorithm is evaluated against two baseline methods through extensive experiments based on real world trajectories, some with serious noise, and the results show that our approach is fairly effective in recognizing trajectory stops.

[1]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[2]  Stefano Spaccapietra,et al.  Semantic trajectories modeling and analysis , 2013, CSUR.

[3]  Fabio Porto,et al.  A conceptual view on trajectories , 2008, Data Knowl. Eng..

[4]  Vania Bogorny,et al.  A model for enriching trajectories with semantic geographical information , 2007, GIS.

[5]  Ralf Hartmut Güting,et al.  A generic data model for moving objects , 2012, GeoInformatica.

[6]  Robert Weibel,et al.  Towards a taxonomy of movement patterns , 2008, Inf. Vis..

[7]  Kay W. Axhausen,et al.  Processing Raw Data from Global Positioning Systems without Additional Information , 2009 .

[8]  Samuel Madden,et al.  TrajStore: An adaptive storage system for very large trajectory data sets , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[9]  Zhixian Yan,et al.  Robust and Hierarchical Stop Discovery in Sparse and Diverse Trajectories , 2011 .

[10]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[11]  Stefano Spaccapietra,et al.  Semantic trajectories: Mobility data computation and annotation , 2013, TIST.

[12]  J. Wolf Applications of New Technologies in Travel Surveys , 2006 .

[13]  Xing Xie,et al.  Mining interesting locations and travel sequences from GPS trajectories , 2009, WWW '09.

[14]  Vania Bogorny,et al.  A clustering-based approach for discovering interesting places in trajectories , 2008, SAC '08.

[15]  Maike Buchin,et al.  Segmenting trajectories: A framework and algorithms using spatiotemporal criteria , 2011, J. Spatial Inf. Sci..

[16]  Stan Matwin,et al.  GRASP-UTS: an algorithm for unsupervised trajectory segmentation , 2015, Int. J. Geogr. Inf. Sci..

[17]  George Kollios,et al.  Complex Spatio-Temporal Pattern Queries , 2005, VLDB.

[18]  Dino Pedreschi,et al.  Trajectory pattern mining , 2007, KDD '07.

[19]  Ralf Hartmut Güting,et al.  Spatiotemporal pattern queries , 2011, GeoInformatica.

[20]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.

[21]  Derya Birant,et al.  ST-DBSCAN: An algorithm for clustering spatial-temporal data , 2007, Data Knowl. Eng..

[22]  Christian S. Jensen,et al.  Mining significant semantic locations from GPS data , 2010, Proc. VLDB Endow..

[23]  Markus Schneider,et al.  A foundation for representing and querying moving objects , 2000, TODS.

[24]  Randall Guensler,et al.  Accuracy of Global Positioning System for Determining Driver Performance Parameters , 2002 .

[25]  Mahmoud A. Sakr,et al.  INFORMATIK BERICHTE 358 – 04 / 2011 Group Spatiotemporal Pattern Queries , 2011 .

[26]  M. Spiliopoulou,et al.  Finding Stops in Error-Prone Trajectories of Moving Objects with Time-Based Clustering , 2009, IMC.

[27]  Xing Xie,et al.  Mining user similarity based on location history , 2008, GIS '08.

[28]  Valéria Cesário Times,et al.  DB-SMoT: A direction-based spatio-temporal clustering method , 2010, 2010 5th IEEE International Conference Intelligent Systems.

[29]  Eric Horvitz,et al.  Predestination: Inferring Destinations from Partial Trajectories , 2006, UbiComp.