Mining Massive-Scale Spatiotemporal Trajectories in Parallel: A Survey

With the popularization of positioning devices such as GPS navigators and smart phones, large volumes of spatiotemporal trajectory data have been produced at unprecedented speed. For many trajectory mining problems, a number of computationally efficient approaches have been proposed. However, to more effectively tackle the challenge of big data, it is important to exploit various advanced parallel computing paradigms. In this paper, we present a comprehensive survey of the state-of-the-art techniques for mining massive-scale spatiotemporal trajectory data based on parallel computing platforms such as Graphics Processing Unit GPU, MapReduce and Field Programmable Gate Array FPGA. This survey covers essential topics including trajectory indexing and query, clustering, join, classification, pattern mining and applications. We also give an in-depth analysis of the related techniques and compare them according to their principles and performance.

[2]  Aoying Zhou,et al.  Query processing of massive trajectory data based on mapreduce , 2009, CloudDB@CIKM.

[3]  Ralf Hartmut Güting,et al.  Parallel SECONDO: A practical system for large-scale processing of moving objects , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[4]  Yu Zheng,et al.  Computing with Spatial Trajectories , 2011, Computing with Spatial Trajectories.

[5]  Joachim Gudmundsson,et al.  A GPU Approach to Subtrajectory Clustering Using the Fréchet Distance , 2012, IEEE Transactions on Parallel and Distributed Systems.

[6]  Nacho Valladares,et al.  A parallel GPU-based approach for reporting flock patterns , 2014, Int. J. Geogr. Inf. Sci..

[7]  Yu Wang,et al.  Accelerating subsequence similarity search based on dynamic time warping distance with FPGA , 2013, FPGA '13.

[8]  Fu Kit Sheong,et al.  A fast parallel clustering algorithm for molecular simulation trajectories , 2013, J. Comput. Chem..

[9]  Marcos R. Vieira,et al.  Stream-Mode FPGA Acceleration of Complex Pattern Trajectory Querying , 2013, SSTD.

[10]  Changjie Tang,et al.  PartSpan: Parallel Sequence Mining of Trajectory Patterns , 2008, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery.

[11]  Henri Casanova,et al.  Technical Report: Parallel Distance Threshold Query Processing for Spatiotemporal Trajectory Databases on the GPU , 2014, ArXiv.

[12]  Shaojie Qiao,et al.  Parallel Sequential Pattern Mining of Massive Trajectory Data , 2010, Int. J. Comput. Intell. Syst..

[13]  Kazuhiro Seki,et al.  Parallel distributed trajectory pattern mining using MapReduce , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[14]  Le Gruenwald,et al.  Parallel online spatial and temporal aggregations on multi-core CPUs and many-core GPUs , 2014, Inf. Syst..

[15]  Roberto J. Bayardo,et al.  Fast Algorithms for Finding Extremal Sets , 2011, SDM.

[16]  Le Gruenwald,et al.  High-performance online spatial and temporal aggregations on multi-core CPUs and many-core GPUs , 2012, DOLAP '12.

[17]  P. Deshpande,et al.  A MapReduce based hybrid genetic algorithm using island approach for solving time dependent vehicle routing problem , 2012, 2012 International Conference on Computer & Information Science (ICCIS).

[18]  Jarke J. van Wijk,et al.  Contour based visualization of vessel movement predictions , 2014, Int. J. Geogr. Inf. Sci..

[19]  Ralf Hartmut Güting,et al.  SECONDO: A Platform for Moving Objects Database Research and for Publishing and Integrating Research Implementations , 2010, IEEE Data Eng. Bull..

[20]  Jae-Gil Lee,et al.  TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering , 2008, Proc. VLDB Endow..

[21]  Satyandra K. Gupta,et al.  GPU based generation of state transition models using simulations for unmanned surface vehicle trajectory planning , 2012, Robotics Auton. Syst..

[22]  Hsiao-Ping Tsai,et al.  Mining Uncertain Sequence Data on Hadoop Platform , 2014, PAKDD Workshops.

[23]  Reynold Cheng,et al.  Scalable Algorithms for Nearest-Neighbor Joins on Big Trajectory Data , 2016, IEEE Transactions on Knowledge and Data Engineering.

[24]  Hui Ding,et al.  Efficient Similarity Join of Large Sets of Moving Object Trajectories , 2008, 2008 15th International Symposium on Temporal Representation and Reasoning.

[25]  Eamonn J. Keogh,et al.  Accelerating Dynamic Time Warping Subsequence Search with GPUs and FPGAs , 2010, 2010 IEEE International Conference on Data Mining.

[26]  Christian S. Jensen,et al.  Discovery of convoys in trajectory databases , 2008, Proc. VLDB Endow..

[27]  Marcos R. Vieira,et al.  High performance FPGA and GPU complex pattern matching over spatio-temporal streams , 2014, GeoInformatica.

[28]  Bolin Ding,et al.  Attraction and Avoidance Detection from Movements , 2013, Proc. VLDB Endow..

[29]  Dino Pedreschi,et al.  Trajectory pattern mining , 2007, KDD '07.

[30]  Ignacio Valladares Cereceda GPU parallel algorithms for reporting movement behaviour patterns in spatiotemporal databases , 2013 .

[31]  Jiawei Han,et al.  Swarm: Mining Relaxed Temporal Moving Object Clusters , 2010, Proc. VLDB Endow..

[32]  Le Gruenwald,et al.  U2STRA: high-performance data management of ubiquitous urban sensing trajectories on GPGPUs , 2012, CDMW '12.

[33]  Joachim Gudmundsson,et al.  Detecting Commuting Patterns by Clustering Subtrajectories , 2011, Int. J. Comput. Geom. Appl..

[34]  Kazuhiro Seki,et al.  Parallel Distributed Trajectory Pattern Mining Using Hierarchical Grid with MapReduce , 2013, Int. J. Grid High Perform. Comput..

[35]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.

[36]  Fei Wu,et al.  Mining Following Relationships in Movement Data , 2013, 2013 IEEE 13th International Conference on Data Mining.

[37]  Le Gruenwald,et al.  Parallel spatial query processing on GPUs using R-trees , 2013, BigSpatial '13.

[38]  Joachim Gudmundsson,et al.  Computing longest duration flocks in trajectory data , 2006, GIS '06.

[39]  Zhenhui Li Spatiotemporal Pattern Mining: Algorithms and Applications , 2014, Frequent Pattern Mining.

[40]  Ralf Hartmut Güting,et al.  Parallel Secondo: Boosting Database Engines with Hadoop , 2012, 2012 IEEE 18th International Conference on Parallel and Distributed Systems.

[41]  蔡曉萍 Mining Movement Pattern from Uncertain Trajectory Data with Mapreduce , 2011 .

[42]  Petko Bakalov,et al.  On-line discovery of flock patterns in spatio-temporal data , 2009, GIS.

[43]  Monica Wachowicz,et al.  Exploring patterns of movement suspension in pedestrian mobility. , 2011, Geographical analysis.

[44]  Wenjun Wang,et al.  The Design and Application of Navigation and Location Services Data Index , 2013, 2013 International Conference on Computational and Information Sciences.

[45]  Jiawei Han,et al.  Mining periodic behaviors for moving objects , 2010, KDD.

[46]  Ahmed Eldawy,et al.  A Demonstration of SpatialHadoop: An Efficient MapReduce Framework for Spatial Data , 2013, Proc. VLDB Endow..