Coupling or decoupling for KNN search on road networks?: a hybrid framework on user query patterns

We explore in this paper a new KNN algorithm, called the SQUARE algorithm, for searching spatial objects on road networks. Recent works in the literature discussed the necessity to support object updates for promising location-based services. Among them, the decoupling spatial search algorithms, which separate the handle of the network traversal and the object lookup, has been recognized as the most effective approach to cut the maintenance overhead from updates. However, the queue-based network traversal needs to be performed from scratch for each KNN query until the KNN objects are exactly identified, indicating that the query complexity is in proportion to the number of visited network nodes. The query efficiency is concerned for online LBS applications since they only allow lightweight operations for minimizing the query latency. To improve the query scalability while supporting data updates, SQUARE constructs the network index similar to the way used in decoupling models, and meanwhile exploit the coupling idea to maintain the KNN information relative to hot regions in the network index. The hot region denotes the area with frequent queries discovered in the query history. Inspired from the prevalently observed 80-20 rule, SQUARE can maximize the query throughput by returning KNN results in the quasi-constant time for 80% queries that are roughly issued within 20% area (hot regions). As validated in our experimental results, SQUARE outperforms previous works and achieves the significant performance improvement without sacrifice on the maintenance overhead for object updates.

[1]  Christos Faloutsos,et al.  Density biased sampling: an improved method for data mining and clustering , 2000, SIGMOD '00.

[2]  Hanan Samet,et al.  Scalable network distance browsing in spatial databases , 2008, SIGMOD Conference.

[3]  Yufei Tao,et al.  Location-based spatial queries , 2003, SIGMOD '03.

[4]  Ken C. K. Lee,et al.  Fast object search on road networks , 2009, EDBT '09.

[5]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[6]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[7]  Christian S. Jensen,et al.  The Islands Approach to Nearest Neighbor Querying in Spatial Networks , 2005, SSTD.

[8]  Christian S. Jensen,et al.  Multiple k Nearest Neighbor Query Processing in Spatial Network Databases , 2006, ADBIS.

[9]  Cyrus Shahabi,et al.  Voronoi-Based K Nearest Neighbor Search for Spatial Network Databases , 2004, VLDB.

[10]  Jianliang Xu,et al.  Fast Nearest Neighbor Search on Road Networks , 2006, EDBT.

[11]  Chin-Wan Chung,et al.  An Efficient and Scalable Approach to CNN Queries in a Road Network , 2005, VLDB.

[12]  Feifei Li,et al.  On Trip Planning Queries in Spatial Databases , 2005, SSTD.

[13]  Christos Faloutsos,et al.  Next Generation Data Mining Tools: Power Laws and Self-similarity for Graphs, Streams and Traditional Data , 2003, PKDD.

[14]  Yufei Tao,et al.  Query Processing in Spatial Network Databases , 2003, VLDB.

[15]  Richard Koch,et al.  The 80/20 Principle: The Secret of Achieving More With Less , 1998 .

[16]  Yuan Tian,et al.  ROAD: A New Spatial Object Search Framework for Road Networks , 2012, IEEE Transactions on Knowledge and Data Engineering.

[17]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[18]  Victor C. S. Lee,et al.  Distance indexing on road networks , 2006, VLDB.

[19]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[20]  Nick Roussopoulos,et al.  K-Nearest Neighbor Search for Moving Query Point , 2001, SSTD.