Indexing land surface for efficient kNN query

The class of k Nearest Neighbor (kNN) queries is frequently used in geospatial applications. Many studies focus on processing kNN in Euclidean and road network spaces. Meanwhile, with the recent advances in remote sensory devices that can acquire detailed elevation data, the new geospatial applications heavily operate on this third dimension, i.e., land surface. Hence, for the field of databases to stay relevant, it should be able to efficiently process spatial queries given this constrained third dimension. However, online processing of the surface k Nearest Neighbor (skNN) queries is quite challenging due to the huge size of land surface models which renders any accurate distance computation on the surface extremely slow. In this paper, for the first time, we propose an index structure on land surface that enables exact and fast responses to skNN queries. Two complementary indexing schemes, namely Tight Surface Index (TSI) and Loose Surface Index (LSI), are constructed and stored collectively on a single novel data structure called Surface Index R-tree (SIR-tree). With those indexes, we can process skNN query efficiently by localizing the search and minimizing the invocation of the costly surface distance computation and hence incurring low I/O and computation costs. Our algorithm does not need to know the value of k a priori and can incrementally expand the search region using SIR-tree and report the query result progressively. It also reports the exact shortest surface paths to the query results. We show through experiments with real world data sets that our algorithm has better performance than the competitors in both efficiency and accuracy.

[1]  Lien Fa Lin,et al.  Continuous nearest neighbor search , 2008 .

[2]  Flip Korn,et al.  Influence sets based on reverse nearest neighbor queries , 2000, SIGMOD 2000.

[3]  A. Guttman,et al.  A Dynamic Index Structure for Spatial Searching , 1984, SIGMOD 1984.

[4]  Kyriakos Mouratidis,et al.  Aggregate nearest neighbor queries in spatial databases , 2005, TODS.

[5]  Qing Liu,et al.  A multi-resolution surface distance model for k-NN query processing , 2008, The VLDB Journal.

[6]  Hiromasa Suzuki,et al.  Approximate shortest path on a polyhedral surface based on selective refinement of the discrete graph and its applications , 2000, Proceedings Geometric Modeling and Processing 2000. Theory and Applications.

[7]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[8]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[9]  Yijie Han,et al.  Shortest paths on a polyhedron , 1990, SCG '90.

[10]  Atsuyuki Okabe,et al.  Spatial Tessellations: Concepts and Applications of Voronoi Diagrams , 1992, Wiley Series in Probability and Mathematical Statistics.

[11]  Xiaofang Zhou,et al.  Expansion-Based Algorithms for Finding Single Pair Shortest Path on Surface , 2004, W2GIS.

[12]  Joseph O'Rourke,et al.  An Implementation of Chen & Han's Shortest Paths Algorithm , 2000, Canadian Conference on Computational Geometry.

[13]  Cyrus Shahabi,et al.  A Road Network Embedding Technique for K-Nearest Neighbor Search in Moving Object Databases , 2002, GIS '02.

[14]  Mark de Berg,et al.  Computational geometry: algorithms and applications , 1997 .

[15]  PapadiasDimitris,et al.  Aggregate nearest neighbor queries in spatial databases , 2005 .

[16]  Cyrus Shahabi,et al.  Voronoi-Based K Nearest Neighbor Search for Spatial Network Databases , 2004, VLDB.

[17]  Heng Tao Shen,et al.  Surface k-NN Query Processing , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[18]  Nick Roussopoulos,et al.  Nearest neighbor queries , 1995, SIGMOD '95.

[19]  Yufei Tao,et al.  Continuous Nearest Neighbor Search , 2002, VLDB.

[20]  Yufei Tao,et al.  Query Processing in Spatial Network Databases , 2003, VLDB.

[21]  Xuemin Lin,et al.  Direct mesh: a multiresolution approach to terrain visualization , 2004, Proceedings. 20th International Conference on Data Engineering.

[22]  S. Muthukrishnan,et al.  Influence sets based on reverse nearest neighbor queries , 2000, SIGMOD '00.