Supporting Range Queries on Web Data Using k-Nearest Neighbor Search

A large volume of geospatial data is available on the web through various forms of applications. However, access to these data is limited by certain types of queries due to restrictive web interfaces. A typical scenario is the existence of numerous business web sites that provide the address of their branch locations through a limited "nearest location" web interface. For example, a chain restaurant's web site such as McDonalds can be queried to find some of the closest locations of its branches to the user's home address. However, even though the site has the location data of all restaurants in, for example, the state of California, the provided web interface makes it very difficult to retrieve this data set. We conceptualize this problem as a more general problem of running spatial range queries by utilizing only k-Nearest Neighbor (k-NN) queries. Subsequently, we propose two algorithms to cover the rectangular spatial range query by minimizing the number of k-NN queries as possible. Finally, we evaluate the efficiency of our algorithms through empirical experiments.

[1]  Shaowen Wang,et al.  A quadtree approach to domain decomposition for spatial interpolation in Grid computing environments , 2003, Parallel Comput..

[2]  Yi-Shin Chen,et al.  TheaterLoc: using information integration technology to rapidly build virtual applications , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[3]  Nick Roussopoulos,et al.  K-Nearest Neighbor Search for Moving Query Point , 2001, SSTD.

[4]  Yeh-Ching Chung,et al.  A Delaunay triangulation based method for wireless sensor network deployment , 2006, 12th International Conference on Parallel and Distributed Systems - (ICPADS'06).

[5]  Cyrus Shahabi,et al.  Utilizing Voronoi Cells of Location Data Streams for Accurate Computation of Aggregate Functions in Sensor Networks , 2006, GeoInformatica.

[6]  Jeffrey D. Ullman,et al.  Computing capabilities of mediators , 1999, SIGMOD '99.

[7]  Hanan Samet,et al.  Data structures for quadtree approximation and compression , 1985, CACM.

[8]  Subbarao Kambhampati,et al.  Effectively mining and using coverage and overlap statistics for data integration , 2005, IEEE Transactions on Knowledge and Data Engineering.

[9]  Ee-Peng Lim,et al.  Efficient k nearest neighbor queries on remote spatial databases using range estimation , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[10]  Cláudio T. Silva,et al.  Efficient Acquisition of Web Data through Restricted Query Interfaces , 2001, WWW Posters.

[11]  Thomas F. La Porta,et al.  Movement-assisted sensor deployment , 2004, IEEE INFOCOM 2004.

[12]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[13]  L. Paul Chew,et al.  Constrained Delaunay triangulations , 1987, SCG '87.