Online clustering for trajectory data stream of moving objects

Trajectory data streams contain huge amounts of data pertaining to the time and position of moving objects. It is crucial to extract useful information from this peculiar kind of data in many application scenarios, such as vehicle traffic management, large-scale tracking management and video surveillance. This paper proposes a density-based clustering algorithm for trajectory data stream called CTraStream. It contains two stages: trajectory line segment stream clustering and online trajectory cluster updating. CTraStream handles the trajectory data of moving objects as an incremental line segment stream. For line segment stream clustering, we present a distance measurement approach between line segments. Incremental line segments are processed quickly based on previous line clusters in order to achieve clustering line segment stream online, and line-segment-clusters in a time interval are obtained on the fly. For online trajectory cluster updating, TC-Tree, an index structure, which stores all closed trajectory clusters, is designed. According to the linesegment-cluster set, the current closed trajectory clusters are updated online based on TC-Tree by performing proposed update rules. The algorithm has exhibited many advantages, such as high scalability to process incremental trajectory data streams and the ability to discover trajectory clusters in data streams in real time. Our performance evaluation experiments conducted on a number of real and synthetic trajectory datasets illustrate the effectiveness, efficiency, and scalability of the algorithm.

[1]  Daniel A. Keim,et al.  An Efficient Approach to Clustering in Large Multimedia Databases with Noise , 1998, KDD.

[2]  Gian Luca Foresti,et al.  On-line trajectory clustering for anomalous events detection , 2006, Pattern Recognit. Lett..

[3]  Mark Last,et al.  Incremental Clustering of Mobile Objects , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[4]  Jing Yuan,et al.  On Discovery of Traveling Companions from Streaming Trajectories , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[5]  Xing Xie,et al.  T-drive: driving directions based on taxi trajectories , 2010, GIS '10.

[6]  Sang-Wook Kim,et al.  Trajectory Clustering in Road Network Environment , 2009 .

[7]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.

[8]  Ee-Peng Lim,et al.  Mining Mobile Group Patterns: A Trajectory-Based Approach , 2005, PAKDD.

[9]  Christian S. Jensen,et al.  Discovery of convoys in trajectory databases , 2008, Proc. VLDB Endow..

[10]  Beng Chin Ooi,et al.  Continuous Clustering of Moving Objects , 2007, IEEE Transactions on Knowledge and Data Engineering.

[11]  ShenHeng Tao,et al.  Discovery of convoys in trajectory databases , 2008, VLDB 2008.

[12]  Jae-Gil Lee,et al.  TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering , 2008, Proc. VLDB Endow..

[13]  Dino Pedreschi,et al.  Time-focused clustering of trajectories of moving objects , 2006, Journal of Intelligent Information Systems.

[14]  Bettina Speckmann,et al.  Efficient detection of motion patterns in spatio-temporal data sets , 2004, GIS '04.

[15]  Yifan Li,et al.  Clustering moving objects , 2004, KDD.

[16]  Padhraic Smyth,et al.  Trajectory clustering with mixtures of regression models , 1999, KDD '99.

[17]  M. Nanni,et al.  Spatio-Temporal Clustering : a Survey Spatio-Temporal Clustering : a Survey , 2010 .

[18]  Jae-Gil Lee,et al.  Incremental Clustering for Trajectories , 2010, DASFAA.

[19]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[20]  Xing Xie,et al.  Discovering spatio-temporal causal interactions in traffic data streams , 2011, KDD.

[21]  Padhraic Smyth,et al.  Probabilistic clustering of extratropical cyclones using regression mixture models , 2007 .

[22]  Xing Xie,et al.  GeoLife2.0: A Location-Based Social Networking Service , 2009, 2009 Tenth International Conference on Mobile Data Management: Systems, Services and Middleware.

[23]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[24]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[25]  Panos Kalnis,et al.  On Discovering Moving Clusters in Spatio-temporal Data , 2005, SSTD.