Comparative Analysis of Nearest Neighbor Query Processing Techniques

Abstract The goal of Nearest Neighbour (NN) search is to find the objects in a dataset A that are closest to a query point q. Existing algorithms presume that the dataset is indexed by an R-tree and searching a query point q in a large volume of a dataset, is a tedious task that effects the quality and usefulness of the NNQ processing algorithms which determined by the time as well as space complexity. The simplest solution to the NNS problem is to compute the distance from the query point to every other point in the database. However, due to these complexities issue, the various research techniques have been proposed. It is a technique which has applications in various fields such as pattern recognition, moving object recognition etc. In this paper, a comprehensive analysis on data structures, processing techniques and variety of algorithms in this field is done along with different way to categorize the NNS techniques is presented. This different category such as weighted, additive, reductional, continuous, reverse, principal axis, which are analyzed and compared in this paper. Complexity of different data structures used in different NNS algorithms is also discussed.

[1]  Donald Meagher,et al.  Geometric modeling using octree encoding , 1982, Computer Graphics and Image Processing.

[3]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[4]  Clifford A. Shaffer,et al.  Optimal quadtree construction algorithms , 1987, Comput. Vis. Graph. Image Process..

[5]  Nick Roussopoulos,et al.  Nearest neighbor queries , 1995, SIGMOD '95.

[6]  Özgür Ulusoy,et al.  A Quadtree-Based Dynamic Attribute Indexing Method , 1998, Comput. J..

[7]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[8]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[9]  Anil K. Jain,et al.  NOTE ON DISTANCE-WEIGHTED k-NEAREST NEIGHBOR RULES. , 1978 .

[10]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[11]  Ben Wang,et al.  Integration of Projected Clusters and Principal Axis Trees for High-Dimensional Data Indexing and Query , 2004, IDEAL.

[12]  Zheng-Zhi Wang,et al.  Center-based nearest neighbor classifier , 2007, Pattern Recognit..

[13]  Joachim Hertzberg,et al.  Cached k-d tree search for ICP algorithms , 2007, Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007).

[14]  Wenming Zheng,et al.  Locally nearest neighbor classifiers for pattern classification , 2004, Pattern Recognit..

[15]  S. Chandramathi,et al.  A Review of various k-Nearest Neighbor Query Processing Techniques , 2011 .

[16]  Andrew W. Moore,et al.  New Algorithms for Efficient High-Dimensional Nonparametric Classification , 2006, J. Mach. Learn. Res..

[17]  Donald Meagher,et al.  Geometric modeling using octree encoding , 1982, Comput. Graph. Image Process..

[18]  James McNames,et al.  A Fast Nearest-Neighbor Algorithm Based on a Principal Axis Search Tree , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Divyakant Agrawal,et al.  Reverse Nearest Neighbor Queries for Dynamic Databases , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[20]  Hanan Samet,et al.  The Quadtree and Related Hierarchical Data Structures , 1984, CSUR.

[21]  Elke Achtert,et al.  Efficient reverse k-nearest neighbor search in arbitrary metric spaces , 2006, SIGMOD Conference.

[22]  Christian S. Jensen,et al.  Nearest neighbor and reverse nearest neighbor queries for moving objects , 2002, Proceedings International Database Engineering and Applications Symposium.

[23]  Yaxin Bi,et al.  KNN Model-Based Approach in Classification , 2003, OTM.

[24]  Shixiong Xia,et al.  An Improved KNN Text Classification Algorithm Based on Clustering , 2009, J. Comput..

[25]  Changshui Zhang,et al.  Tunable Nearest Neighbor Classifier , 2004, DAGM-Symposium.

[26]  Oliver Günther,et al.  Multidimensional access methods , 1998, CSUR.

[27]  Yunjun Gao,et al.  On efficient mutual nearest neighbor query processing in spatial databases , 2009, Data Knowl. Eng..

[28]  Songbo Tan,et al.  Neighbor-weighted K-nearest neighbor for unbalanced text corpus , 2005, Expert Syst. Appl..

[29]  S. Muthukrishnan,et al.  Influence sets based on reverse nearest neighbor queries , 2000, SIGMOD '00.

[30]  Stan Z. Li,et al.  Face recognition using the nearest feature line method , 1999, IEEE Trans. Neural Networks.

[31]  Flip Korn,et al.  Influence sets based on reverse nearest neighbor queries , 2000, SIGMOD 2000.

[32]  Alexandr Andoni,et al.  Nearest neighbor search : the old, the new, and the impossible , 2009 .

[33]  Micah Adler,et al.  Search Space Reductions for Nearest-Neighbor Queries , 2008, TAMC.

[34]  Yufei Tao,et al.  Reverse Nearest Neighbor Search in Metric Spaces , 2006, IEEE Transactions on Knowledge and Data Engineering.

[35]  G. Gates,et al.  The reduced nearest neighbor rule (Corresp.) , 1972, IEEE Trans. Inf. Theory.

[36]  Tomás Skopal,et al.  Pivoting M-tree: A Metric Access Method for Efficient Similarity Search , 2004, DATESO.

[37]  Robert F. Sproull,et al.  Refinements to nearest-neighbor searching ink-dimensional trees , 1991, Algorithmica.

[38]  Stan Z. Li,et al.  Performance Evaluation of the Nearest Feature Line Method in Image Classification and Retrieval , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[39]  G. Gates The Reduced Nearest Neighbor Rule , 1998 .

[40]  Yi-Ching Liaw,et al.  Fast exact k nearest neighbors search using an orthogonal search tree , 2010, Pattern Recognit..

[41]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[42]  C. G. Hilborn,et al.  The Condensed Nearest Neighbor Rule , 1967 .

[43]  Klaus Hechenbichler,et al.  Weighted k-Nearest-Neighbor Techniques and Ordinal Classification , 2004 .

[44]  Changshui Zhang,et al.  Extended Nearest Feature Line Classifier , 2004, PRICAI.

[45]  Yi-Ching Liaw,et al.  Fast k-nearest neighbors search using modified principal axis search tree , 2010, Digit. Signal Process..

[46]  Rina Panigrahy,et al.  An Improved Algorithm Finding Nearest Neighbor Using Kd-trees , 2008, LATIN.

[47]  Vandana,et al.  Survey of Nearest Neighbor Techniques , 2010, ArXiv.

[48]  Haibo Hu,et al.  Range Nearest-Neighbor Query , 2006, IEEE Trans. Knowl. Data Eng..

[49]  King-Ip Lin,et al.  An index structure for efficient reverse nearest neighbor queries , 2001, Proceedings 17th International Conference on Data Engineering.

[50]  Yufei Tao,et al.  Continuous Nearest Neighbor Search , 2002, VLDB.

[51]  Yaxin Bi,et al.  An kNN Model-Based Approach and Its Application in Text Categorization , 2004, CICLing.

[52]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[53]  Sousa Lobo,et al.  SHIP NOISE CLASSIFICATION , 2002 .

[54]  Christian S. Jensen,et al.  Nearest and reverse nearest neighbor queries for moving objects , 2006, The VLDB Journal.