Direct neighbor search

In this paper we study a novel query type, called direct neighbor query. Two objects in a dataset are direct neighbors (DNs) if a window selection may exclusively retrieve these two objects. Given a source object, a DN search computes all of its direct neighbors in the dataset. The DNs define a new type of affinity that differs from existing formulations (e.g., nearest neighbors, nearest surrounders, reverse nearest neighbors, etc) and finds application in domains where user interests are expressed in the form of windows, i.e., multi-attribute range selections. Drawing on key properties of the DN relationship, we develop an I/O optimal processing algorithm for data indexed with a spatial access method. In addition to plain DN search, we also study its K-DN and all-DN variants. The former relaxes the DN condition ‐ two objects are K-DNs if a window query may retrieve them and only up to K 1 other objects ‐ whereas the all-DN variant computes the DNs of every object in the dataset. Using real, large-scale data, we demonstrate the efficiency and practicality of our approach, and show that it vastly outperforms a competitor constructed from previous work.

[1]  Nick Roussopoulos,et al.  Nearest neighbor queries , 1995, SIGMOD '95.

[2]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[3]  Bala R. Vatti A generic solution to polygon clipping , 1992, CACM.

[4]  Xiang Lian,et al.  Efficient Processing of Metric Skyline Queries , 2009, IEEE Transactions on Knowledge and Data Engineering.

[5]  Craig S. Fleisher,et al.  Business and Competitive Analysis : Effective Application of New and Classic Methods , 2012 .

[6]  Derick Wood,et al.  An Optimal Worst Case Algorithm for Reporting Intersections of Rectangles , 1980, IEEE Transactions on Computers.

[7]  Werner Kießling,et al.  Corroborating Information from Web Sources. , 2011 .

[8]  Yin Yang,et al.  Kernel-based skyline cardinality estimation , 2009, SIGMOD Conference.

[9]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[10]  Werner Kießling,et al.  Foundations of Preferences in Database Systems , 2002, VLDB.

[11]  Werner Kießling,et al.  The Preference SQL System - An Overview , 2011, IEEE Data Eng. Bull..

[12]  Luca J. Senatore,et al.  From survey to representation , 2012, CompIMAGE.

[13]  Bernhard Seeger,et al.  Efficient Computation of Reverse Skyline Queries , 2007, VLDB.

[14]  Georgia Koutrika,et al.  A survey on representation, composition and application of preferences in database systems , 2011, TODS.

[15]  Jeffrey Scott Vitter,et al.  External-Memory Algorithms for Processing Line Segments in Geographic Information Systems , 1996 .

[16]  Christos Faloutsos,et al.  On packing R-trees , 1993, CIKM '93.

[17]  Georgia Koutrika,et al.  Personalizing queries based on networks of composite preferences , 2010, TODS.

[18]  Mario A. López,et al.  STR: a simple and efficient algorithm for R-tree packing , 1997, Proceedings 13th International Conference on Data Engineering.

[19]  Jonathan Goldstein,et al.  When Is ''Nearest Neighbor'' Meaningful? , 1999, ICDT.

[20]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[21]  S. Muthukrishnan,et al.  Influence sets based on reverse nearest neighbor queries , 2000, SIGMOD '00.

[22]  Riccardo Torlone,et al.  Finding the Best when it's a Matter of Preference , 2002, SEBD.

[23]  Christos Doulkeridis,et al.  Reverse top-k queries , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[24]  M. Levandowsky,et al.  Distance between Sets , 1971, Nature.

[25]  Ken C. K. Lee,et al.  Nearest Surrounder Queries , 2006, IEEE Transactions on Knowledge and Data Engineering.

[26]  Jan Chomicki,et al.  Preference formulas in relational queries , 2003, TODS.

[27]  Yufei Tao,et al.  Reverse Nearest Neighbor Query , 2009, Encyclopedia of Database Systems.

[28]  Beng Chin Ooi,et al.  Efficient Progressive Skyline Computation , 2001, VLDB.

[29]  Roger Zimmermann,et al.  Edge indexing in a grid for highly dynamic virtual environments , 2006, MM '06.

[30]  Hanan Samet,et al.  Distance browsing in spatial databases , 1999, TODS.

[31]  Bernhard Seeger,et al.  Progressive skyline computation in database systems , 2005, TODS.

[32]  Cyrus Shahabi,et al.  Processing spatial skyline queries in both vector spaces and spatial network databases , 2009, TODS.

[33]  Heng Tao Shen,et al.  Multi-source Skyline Query Processing in Road Networks , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[34]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.