GroupSeeker: An Applicable Framework for Travel Companion Discovery from Vast Trajectory Data

The popularity of mobile locate-enabled devices and Location Based Service (LBS) generates massive spatio-temporal data every day. Due to the close relationship between behavior patterns and movement trajectory, trajectory data mining has been applied in numerous fields to find the behavior pattern. Among them, discovering traveling companions is one of the most fundamental techniques in these areas. This paper proposes a flexible framework named GroupSeeker for discovering traveling companions in vast real-world trajectory data. In the real-world data resource, it is significant to avoid the companion candidate omitting problem happening in the time-snapshot-slicing-based method. These methods do not work well with the sparse real-world data, which is caused by the equipment sampling failure or manual intervention. In this paper, a 5-stage framework including Data Preprocessing, Spatio-temporal Clustering, Candidate Voting, Pseudo-companion Filtering, and Group Merging is proposed to discover traveling companions. The framework even works well when there is a long time span during several days. The experiments result on two real-world data sources which offer massive amount of data subsets with different scale and different sampling frequencies show the effective and robustness of this framework. Besides, the proposed framework has a higher-efficiency performing when discovering satisfying companions over a long-term period.

[1]  Tian Qin,et al.  Spatio-Temporal Routine Mining on Mobile Phone Data , 2018, ACM Trans. Knowl. Discov. Data.

[2]  Yee Leung,et al.  Applying mobile phone data to travel behaviour research: A literature review , 2017 .

[3]  Leland McInnes,et al.  hdbscan: Hierarchical density based clustering , 2017, J. Open Source Softw..

[4]  Kamal Al-Sabahi,et al.  A Novel Integrated Approach for Companion Vehicle Discovery Based on Frequent Itemset Mining on Spark , 2019, Arabian Journal for Science and Engineering.

[5]  Jie Zhao,et al.  A review of moving object trajectory clustering algorithms , 2016, Artificial Intelligence Review.

[6]  Xiaohui Huang,et al.  A scalable and fast OPTICS for clustering trajectory big data , 2015, Cluster Computing.

[7]  Hao Yuan,et al.  Exploring Group Movement Pattern through Cellular Data: A Case Study of Tourists in Hainan , 2018, ISPRS Int. J. Geo Inf..

[8]  Ahmed Elragal,et al.  Analysis of Trajectory Data in Support of Traffic Management: A Data Mining Approach , 2014, ICDM.

[9]  Lei Shu,et al.  Reality mining: A prediction algorithm for disease dynamics based on mobile big data , 2017, Inf. Sci..

[10]  Zili Zhang,et al.  A MapReduce-Based Parallel Frequent Pattern Growth Algorithm for Spatiotemporal Association Analysis of Mobile Trajectory Big Data , 2018, Complex..

[11]  Daniel A. Keim,et al.  A General Approach to Clustering in Large Databases with Noise , 2003, Knowledge and Information Systems.

[12]  Jing Yuan,et al.  A framework of traveling companion discovery on trajectory data streams , 2013, ACM Trans. Intell. Syst. Technol..

[13]  Xing Xie,et al.  GeoLife: A Collaborative Social Networking Service among User, Location and Trajectory , 2010, IEEE Data Eng. Bull..

[14]  Jérôme Morio,et al.  Trajectory clustering of air traffic flows around airports , 2019, Aerospace Science and Technology.

[15]  丁庆海 Ding Qinghai,et al.  Trajectory outlier detection based on DBSCAN clustering algorithm , 2017 .

[16]  Yunjun Gao,et al.  Algorithms for constrained k-nearest neighbor queries over moving object trajectories , 2010, GeoInformatica.

[17]  Xudong Liu,et al.  Identification of Urban Functional Regions in Chengdu Based on Taxi Trajectory Time Series Data , 2020, ISPRS Int. J. Geo Inf..

[18]  Yu Zheng,et al.  Trajectory Data Mining , 2015, ACM Trans. Intell. Syst. Technol..

[19]  Christian S. Jensen,et al.  Discovery of convoys in trajectory databases , 2008, Proc. VLDB Endow..

[20]  Jiawei Han,et al.  Swarm: Mining Relaxed Temporal Moving Object Clusters , 2010, Proc. VLDB Endow..

[21]  Xujun Zhao,et al.  TAD: A trajectory clustering algorithm based on spatial-temporal density analysis , 2020, Expert Syst. Appl..

[22]  Gennady L. Andrienko,et al.  Clustering Trajectories by Relevant Parts for Air Traffic Analysis , 2018, IEEE Transactions on Visualization and Computer Graphics.