FastTopK: A Fast Top-K Trajectory Similarity Query Processing Algorithm for GPUs

With the increasing prevalence of location sensor devices like GPS, it has been possible to collect large datasets of a special type of spatio-temporal data called trajectory data. A trajectory is a discrete sequence of positions that a moving object occupies in space as time passes. Such large datasets enable researchers to study the behavior of the objects describing these movements by issuing spatial queries. Among the queries that can be issued are top-K trajectory similarity queries, which retrieve the K most similar trajectories to a given query trajectory. This query has applications in many areas, such as urban planning, ecology and social networking; however, this query is computationally expensive. In this work, we introduce a new parallel top-K trajectory similarity query technique for GPUs, FastTopK, to deal with these challenges. Our experiments on two large real-life datasets showed that FastTopK produces on average 107.96X smaller candidate result sets, and 3.36X faster query execution times than the existing state-of-the-art technique, TKSimGPU.

[1]  Hanan Samet,et al.  An Incremental Hausdorff Distance Calculation Algorithm , 2011, Proc. VLDB Endow..

[2]  Lei Chen,et al.  Robust and fast similarity search for moving object trajectories , 2005, SIGMOD '05.

[3]  Robert E. Kenward,et al.  Life path analysis: scaling indicates priming effects of social and habitat factors on dispersal distances , 2001 .

[4]  Hui Ding,et al.  Efficient Similarity Join of Large Sets of Moving Object Trajectories , 2008, 2008 15th International Symposium on Temporal Representation and Reasoning.

[5]  Pradeep Dubey,et al.  Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU , 2010, ISCA.

[6]  Stephen M. Krone,et al.  Analyzing animal movements using Brownian bridges. , 2007, Ecology.

[7]  Xing Xie,et al.  GeoLife: A Collaborative Social Networking Service among User, Location and Trajectory , 2010, IEEE Data Eng. Bull..

[8]  Le Gruenwald,et al.  TKSimGPU : A Parallel Top-K Trajectory Similarity Query Processing Algorithm for GPGPUs , 2015 .

[9]  Lei Chen,et al.  On The Marriage of Lp-norms and Edit Distance , 2004, VLDB.

[10]  Sriram Raghavan,et al.  Indexing and matching trajectories under inconsistent sampling rates , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[11]  Gang Chen,et al.  KSQ: Top-k Similarity Query on Uncertain Trajectories , 2013, IEEE Transactions on Knowledge and Data Engineering.

[12]  Le Gruenwald,et al.  U2STRA: high-performance data management of ubiquitous urban sensing trajectories on GPGPUs , 2012, CDMW '12.

[13]  Le Gruenwald,et al.  Towards an Efficient Top-K Trajectory Similarity Query Processing Algorithm for Big Trajectory Data on GPGPUs , 2016, 2016 IEEE International Congress on Big Data (BigData Congress).