An Effective Spatio-Temporal Query Framework for Massive Trajectory Data in Urban Computing

With the development of IoT techniques, urban computing has become an emerging topic in academia and industry. The goal of urban computing is to address some issues of urban planning by using the big data generated in urban facilities. The massive trajectory data processing is viewed as an important issue in urban computing. To satisfy the storage and processing requirements of massive trajectory data, a distributed system is usually adopted. However, existing distributed systems face challenges of data locality aware partitioning and various trajectory queries. In this paper, we propose a distributed framework of massive trajectory data analysis based on HBase, to realize spatio-temporal query more effectively. We first design a temporal-based pre-partitioning strategy to improve the performance of data written. Then we develop a Multi-Level Index to speed up the process of spatio-temporal query. Extensive experiments on real trajectory datasets demonstrate that the proposed framework significantly improves efficiency and usability.

[1]  Amit P. Sheth,et al.  Semantic (Web) Technology In Action: Ontology Driven Information Systems for Search, Integration and Analysis , 2003, IEEE Data Eng. Bull..

[2]  Yu Zheng,et al.  Managing massive trajectories on the cloud , 2016, SIGSPATIAL/GIS.

[3]  Le Gruenwald,et al.  Spatial Join Query Processing in Cloud: Analyzing Design Choices and Performance Comparisons , 2015, 2015 44th International Conference on Parallel Processing Workshops.

[4]  Aoying Zhou,et al.  TrajSpark: A Scalable and Efficient In-Memory Management System for Big Trajectory Data , 2017, APWeb/WAIM.

[5]  Mohamed Sarwat,et al.  GeoSpark: a cluster computing framework for processing large-scale spatial data , 2015, SIGSPATIAL/GIS.

[6]  Lei Chen,et al.  Finding time period-based most frequent path in big trajectory data , 2013, SIGMOD '13.

[7]  Jörg Sander,et al.  PIST: An Efficient and Practical Indexing Technique for Historical Spatio-Temporal Point Data , 2008, GeoInformatica.

[8]  Weiwei Sun,et al.  COMPRESS , 2017, ACM Trans. Database Syst..

[9]  Samuel Madden,et al.  TrajStore: An adaptive storage system for very large trajectory data sets , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[10]  Kai Hwang,et al.  Intelligent Carpool Routing for Urban Ridesharing by Mining GPS Trajectories , 2014, IEEE Transactions on Intelligent Transportation Systems.

[11]  Yu Zheng,et al.  Travel time estimation of a path using sparse trajectories , 2014, KDD.

[12]  Yunjun Gao,et al.  UlTraMan: A Unified Platform for Big Trajectory Data Management and Analytics , 2018, Proc. VLDB Endow..

[13]  Walid G. Aref,et al.  Spatio-Temporal Access Methods , 2003, IEEE Data Eng. Bull..

[14]  Walid G. Aref,et al.  Spatio-Temporal Access Methods: Part 2 (2003 - 2010) , 2010, IEEE Data Eng. Bull..

[15]  Ahmed Eldawy,et al.  SpatialHadoop: A MapReduce framework for spatial data , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[16]  Ralf Hartmut Güting,et al.  BerlinMOD: a benchmark for moving object databases , 2009, The VLDB Journal.

[17]  Joel H. Saltz,et al.  Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce , 2013, Proc. VLDB Endow..

[18]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[19]  Aoying Zhou,et al.  Query processing of massive trajectory data based on mapreduce , 2009, CloudDB@CIKM.

[20]  Shazia Wasim Sadiq,et al.  SharkDB: An In-Memory Column-Oriented Trajectory Storage , 2014, CIKM.

[21]  Barzan Mozafari,et al.  SnappyData: A Unified Cluster for Streaming, Transactions and Interactice Analytics , 2017, CIDR.

[22]  Ralf Hartmut Güting,et al.  Parallel Secondo: Boosting Database Engines with Hadoop , 2012, 2012 IEEE 18th International Conference on Parallel and Distributed Systems.

[23]  Minyi Guo,et al.  Simba: Efficient In-Memory Spatial Analytics , 2016, SIGMOD Conference.

[24]  Xiaoyong Du,et al.  Elite: an elastic infrastructure for big spatiotemporal trajectories , 2016, The VLDB Journal.

[25]  Yunjun Gao,et al.  VIPTRA: Visualization and Interactive Processing on Big Trajectory Data , 2018, 2018 19th IEEE International Conference on Mobile Data Management (MDM).

[26]  Qing Liu,et al.  DFTHR: A Distributed Framework for Trajectory Similarity Query Based on HBase and Redis , 2019, Inf..

[27]  M. N. Vora,et al.  Hadoop-HBase for large-scale data , 2011, Proceedings of 2011 International Conference on Computer Science and Network Technology.

[28]  Nikos Pelekis,et al.  Algorithms for Nearest Neighbor Search on Moving Object Trajectories , 2007, GeoInformatica.