User-Contributed Relevance and Nearest Neighbor Queries

Novel Web technologies and resulting applications have lead to a participatory data ecosystem that when utilized properly will lead to more rewarding services. In this work, we investigate the case of Location-based Services and specifically of how to improve the typical location-based Point-Of-Interest POI request processed as a k-Nearest-Neighbor query. This work introduces Links-of-interest LOI between POIs as a means to increase the relevance and overall result quality of such queries. By analyzing user-contributed content in the form of travel blogs, we establish the overall popularity of a LOI, i.e., how frequently the respective POI pair is mentioned in the same context. Our contribution is a query processing method for so-called k-Relevant Nearest Neighbor k-RNN queries that considers spatial proximity in combination with LOI information to retrieve close-by and relevant as judged by the crowd POIs. Our method is based on intelligently combining indices for spatial data a spatial grid and for relevance data a graph during query processing. An experimental evaluation using real and synthetic data establishes that our approach efficiently solves the k-RNN problem when compared to existing methods.

[1]  Mark Sanderson,et al.  Spatio-textual Indexing for Geographical Search on the Web , 2005, SSTD.

[2]  Naphtali Rishe,et al.  Keyword Search on Spatial Databases , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[3]  João B. Rocha-Junior,et al.  Efficient Processing of Top-k Spatial Keyword Queries , 2011, SSTD.

[4]  Panos Kalnis,et al.  Efficient OLAP Operations in Spatial Data Warehouses , 2001, SSTD.

[5]  Xing Xie,et al.  Hybrid index structures for location-based web search , 2005, CIKM '05.

[6]  Chengyang Zhang,et al.  Advances in Spatial and Temporal Databases , 2015, Lecture Notes in Computer Science.

[7]  Yong Gao,et al.  Analyzing Relatedness by Toponym Co‐Occurrences on Web Pages , 2014, Trans. GIS.

[8]  Hanan Samet,et al.  Distance browsing in spatial databases , 1999, TODS.

[9]  Christian S. Jensen,et al.  Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects , 2009, Proc. VLDB Endow..

[10]  Torsten Suel,et al.  Efficient query processing in geographic web search engines , 2006, SIGMOD Conference.

[11]  Christian S. Jensen,et al.  Retrieving top-k prestige-based relevant spatial web objects , 2010, Proc. VLDB Endow..

[12]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[13]  Moni Naor,et al.  Optimal aggregation algorithms for middleware , 2001, PODS.

[14]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[15]  Dieter Pfoser,et al.  Geospatial route extraction from texts , 2010, DMG '10.

[16]  Ken C. K. Lee,et al.  IR-Tree: An Efficient Index for Geographic Document Search , 2011, IEEE Trans. Knowl. Data Eng..

[17]  Mário J. Silva,et al.  Indexing and ranking in Geo-IR systems , 2005, GIR '05.

[18]  Ronald Fagin,et al.  Combining Fuzzy Information from Multiple Systems , 1999, J. Comput. Syst. Sci..