TrajMesa: A Distributed NoSQL Storage Engine for Big Trajectory Data

Trajectory data is very useful for many urban applications. However, due to its spatio-temporal and high-volume properties, it is challenging to manage trajectory data. Existing trajectory data management frameworks suffer from scalability problem, and only support limited trajectory queries. This paper proposes a holistic distributed NoSQL trajectory storage engine, TrajMesa, based on GeoMesa, an open-source indexing toolkit for spatio-temporal data. TrajMesa adopts a novel storage schema, which reduces the storage size tremendously. We also devise novel indexing key designs, and propose a bunch of pruning strategies. TrajMesa can support plentiful queries efficiently, including ID-Temporal query, spatial range query, similarity query, and k-NN query. Experimental results show the powerful query efficiency and scalability of TrajMesa.

[1]  Feifei Li,et al.  Distributed Trajectory Similarity Search , 2017, Proc. VLDB Endow..

[2]  Heng Tao Shen,et al.  Searching trajectories by locations: an efficiency study , 2010, SIGMOD Conference.

[3]  Chao Tian,et al.  Detecting Vehicle Illegal Parking Events using Sharing Bikes' Trajectories , 2018, KDD.

[4]  Zhifeng Bao,et al.  DITA: Distributed In-Memory Trajectory Analytics , 2018, SIGMOD Conference.

[5]  Yu Zheng,et al.  Trajectory Data Mining , 2015, ACM Trans. Intell. Syst. Technol..

[6]  Yu Zheng,et al.  CloudTP: A Cloud-Based Flexible Trajectory Preprocessing Framework , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[7]  J. Shane Culpepper,et al.  Torch: A Search Engine for Trajectory Data , 2018, SIGIR.

[8]  Yu Zheng,et al.  Managing massive trajectories on the cloud , 2016, SIGSPATIAL/GIS.

[9]  H. Sagan Space-filling curves , 1994 .

[10]  James M. Kang,et al.  Space-Filling Curves , 2017, Encyclopedia of GIS.

[11]  Jun Luo,et al.  Mining Spatio-Temporal Reachable Regions over Massive Trajectory Data , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[12]  Helmut Alt,et al.  Computing the Fréchet distance between two polygonal curves , 1995, Int. J. Comput. Geom. Appl..

[13]  Christian Böhm,et al.  XZ-Ordering: A Space-Filling Curve for Objects with Spatial Extension , 1999, SSD.

[14]  Hanan Samet,et al.  An Incremental Hausdorff Distance Calculation Algorithm , 2011, Proc. VLDB Endow..

[15]  이상훈,et al.  트위터 트랜딩 토픽을 이용한 HBase 기반 자동 요약 시스템 , 2014 .