Mining Spatio-Temporal Reachable Regions over Massive Trajectory Data

Mining spatio-temporal reachable regions aims to find a set of road segments from massive trajectory data, that are reachable from a user-specified location and within a given temporal period. Accurately extracting such spatiotemporal reachable area is vital in many urban applications, e.g., (i) location-based recommendation, (ii) location-based advertising, and (iii) business coverage analysis. The traditional approach of answering such queries essentially performs a distance-based range query over the given road network, which have two main drawbacks: (i) it only works with the physical travel distances, where the users usually care more about dynamic traveling time, and (ii) it gives the same result regardless of the querying time, where the reachable area could vary significantly with different traffic conditions. Motivated by these observations, we propose a data-driven approach to formulate the problem as mining actual reachable region based on real historical trajectory dataset. The main challenge in our approach is the system efficiency, as verifying the reachability over the massive trajectories involves huge amount of disk I/Os. In this paper, we develop two indexing structures: 1) spatio-temporal index (ST-Index) and 2) connection index (Con-Index) to reduce redundant trajectory data access operations. We also propose a novel query processing algorithm with: 1) maximum bounding region search, which directly extracts a small searching region from the index structure and 2) trace back search, which refines the search results from the previous step to find the final query result. Moreover, our system can also efficiently answer the spatio-temporal reachability query with multiple query locations by skipping the overlapped area search. We evaluate our system extensively using a large-scale real taxi trajectory data in Shenzhen, China, where results demonstrate that the proposed algorithms can reduce 50%-90% running time over baseline algorithms.

[1]  Jinfeng Li,et al.  Reachability and time-based path queries in temporal graphs , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[2]  Kotagiri Ramamohanarao,et al.  Optimized algorithms for predictive range and KNN queries on moving objects , 2010, Inf. Syst..

[3]  Jimeng Sun,et al.  The TPR*-Tree: An Optimized Spatio-Temporal Access Method for Predictive Queries , 2003, VLDB.

[4]  Jignesh M. Patel,et al.  Indexing Large Trajectory Data Sets With SETI , 2003, CIDR.

[5]  Kian-Lee Tan,et al.  G-Tree: An Efficient and Scalable Index for Spatial Search on Road Networks , 2015, IEEE Transactions on Knowledge and Data Engineering.

[6]  Yu Zheng,et al.  Trajectory Data Mining , 2015, ACM Trans. Intell. Syst. Technol..

[7]  Gerhard Weikum,et al.  FERRARI: Flexible and efficient reachability range assignment for graph indexing , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[8]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[9]  Sibo Wang,et al.  Reachability queries on large dynamic graphs: a total order approach , 2014, SIGMOD Conference.

[10]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[11]  Mikkel Thorup,et al.  Approximate distance oracles , 2001, JACM.

[12]  Feifei Li,et al.  On Trip Planning Queries in Spatial Databases , 2005, SSTD.

[13]  Byron Choi,et al.  On incremental maintenance of 2-hop labeling of graphs , 2008, WWW.

[14]  Yufei Tao,et al.  Query Processing in Spatial Network Databases , 2003, VLDB.

[15]  Ralf Hartmut Güting,et al.  Indexing the Trajectories of Moving Objects in Networks* , 2004, GeoInformatica.

[16]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[17]  Lei Chen,et al.  Robust and fast similarity search for moving object trajectories , 2005, SIGMOD '05.

[18]  A. Goldberg,et al.  TRANSIT: Ultrafast Shortest-Path Queries with Linear-Time Preprocessing , 2006 .

[19]  Heng Tao Shen,et al.  Searching trajectories by locations: an efficiency study , 2010, SIGMOD Conference.

[20]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[21]  Rui Li,et al.  Fast Range Query Processing with Strong Privacy Protection for Cloud Computing , 2014, Proc. VLDB Endow..

[22]  Ralf Hartmut Güting,et al.  Modeling and querying moving objects in networks , 2006, The VLDB Journal.

[23]  Christian S. Jensen,et al.  Path prediction and predictive range querying in road network databases , 2010, The VLDB Journal.

[24]  Tanzima Hashem,et al.  Group Trip Planning Queries in Spatial Databases , 2013, SSTD.

[25]  Dieter Pfoser,et al.  Novel Approaches to the Indexing of Moving Object Trajectories , 2000, VLDB.

[26]  Loïc Cerf,et al.  Reachability Queries in Very Large Graphs: A Fast Refined Online Search Approach , 2014, EDBT.

[27]  Abdeltawab M. Hendawi,et al.  Predictive tree: An efficient index for predictive queries on road networks , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[28]  Nikos Pelekis,et al.  Algorithms for Nearest Neighbor Search on Moving Object Trajectories , 2007, GeoInformatica.

[29]  Bernhard Seeger,et al.  An asymptotically optimal multiversion B-tree , 1996, The VLDB Journal.

[30]  Chi-Yin Chow,et al.  Efficient Evaluation of k-Range Nearest Neighbor Queries in Road Networks , 2010, 2010 Eleventh International Conference on Mobile Data Management.

[31]  Yangjun Chen,et al.  Decomposing DAGs into spanning trees: A new way to compress transitive closures , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[32]  Qiang Yang,et al.  Sampling Big Trajectory Data , 2015, CIKM.

[33]  Xing Xie,et al.  An Interactive-Voting Based Map Matching Algorithm , 2010, 2010 Eleventh International Conference on Mobile Data Management.

[34]  Jing Cai,et al.  Path-hop: efficiently indexing large graphs for reachability queries , 2010, CIKM.

[35]  Yangjun Chen,et al.  An Efficient Algorithm for Answering Graph Reachability Queries , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[36]  Oege de Moor,et al.  A memory efficient reachability data structure through bit vector compression , 2011, SIGMOD '11.