Shape indexing using approximate nearest-neighbour search in high-dimensional spaces

Shape indexing is a way of making rapid associations between features detected in an image and object models that could have produced them. When model databases are large, the use of high-dimensional features is critical, due to the improved level of discrimination they can provide. Unfortunately, finding the nearest neighbour to a query point rapidly becomes inefficient as the dimensionality of the feature space increases. Past indexing methods have used hash tables for hypothesis recovery, but only in low-dimensional situations. In this paper we show that a new variant of the k-d tree search algorithm makes indexing in higher-dimensional spaces practical. This Best Bin First, or BBF search is an approximate algorithm which finds the nearest neighbour for a large fraction of the queries, and a very close neighbour in the remaining cases. The technique has been integrated into a fully developed recognition system, which is able to detect complex objects in real, cluttered scenes in just a few seconds.

[1]  Jon Louis Bentley,et al.  An Algorithm for Finding Best Matches in Logarithmic Expected Time , 1977, TOMS.

[2]  Laurent Miclet,et al.  Approximative fast nearest-neighbour recognition , 1983, Pattern Recognit. Lett..

[3]  Song B. Park,et al.  A Fast k Nearest Neighbor Finding Algorithm Based on the Ordered Partition , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Rakesh Mohan,et al.  Book review: PERCEPTUAL ORGANIZATION AND VISUAL RECOGNITION by David G. Lowe (Kluwer Academic Publishers) , 1987, SGAR.

[5]  Heinrich Niemann,et al.  An efficient branch-and-bound nearest neighbour classifier , 1988, Pattern Recognition Letters.

[6]  Yehezkel Lamdan,et al.  Geometric Hashing: A General And Efficient Model-based Recognition Scheme , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[7]  Yehezkel Lamdan,et al.  Affine invariant model-based object recognition , 1990, IEEE Trans. Robotics Autom..

[8]  David A. Forsyth,et al.  Invariance-a new framework for vision , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[9]  W. Eric L. Grimson,et al.  On the sensitivity of geometric hashing , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[10]  David W. Jacobs,et al.  Space and Time Bounds on Indexing 3D Models from 2D Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  David A. Forsyth,et al.  Efficient model library access by projectively invariant indexing functions , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Gérard G. Medioni,et al.  Structural Indexing: Efficient 3-D Object Recognition , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Rakesh Mohan,et al.  Multidimensional Indexing for Recognizing Visual Shapes , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  David G. Lowe,et al.  Learning indexing functions for 3-D model-based object recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Katsushi Ikeuchi,et al.  Sensor Modeling, Probabilistic Hypothesis Generation, and Robust Localization for Object Recognition , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Shree K. Nayar,et al.  Closest point search in high dimensions , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18]  S. Arya Nearest neighbor searching and applications , 1996 .

[19]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .