Complexity of Indexing: Efficient and Learnable Large Database Indexing

Object recognition starts from a set of image measurements (including locations of points, lines, surfaces, color, and shading), which provides access into a database where representations of objects are stored. We describe a complexity theory of indexing, a meta-analysis which identifies the best set of measurements (up to algebraic transformations) such that: (1) the representation of objects are linear subspaces and thus easy to learn; (2) direct indexing is efficient since the linear subspaces are of minimal rank. The index complexity is determined via a simple process, equivalent to computing the rank of a matrix. We readily re-derive the index complexity of the few previously analyzed cases. We then compute the best index for new cases: 6 points in one perspective image, and 6 directions in one para-perspective image; the most efficient representation of a color is a plane in 3D space. For future applications with any vision problem where the relations between shape and image measurements can be written down in an algebraic form, we give an automatic process to construct the most efficient database that can be directly obtained by learning from examples.

[1]  David A. Forsyth,et al.  Extracting projective structure from single perspective views of 3D point sets , 1993, 1993 (4th) International Conference on Computer Vision.

[2]  John F. Canny,et al.  Efficient indexing techniques for model based sensing , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Daphna Weinshall Model-based invariants for 3D vision , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[4]  J.B. Burns,et al.  View Variation of Point-Set and Line-Segment Features , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  L. Maloney,et al.  Color constancy: a method for recovering surface spectral reflectance. , 1986, Journal of the Optical Society of America. A, Optics and image science.

[6]  Shimon Ullman,et al.  Limitations of Non Model-Based Recognition Schemes , 1992, ECCV.

[7]  Carlo Tomasi Pictures and trails: a new framework for the computation of shape and motion from perspective image sequences , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Michael Werman,et al.  Duality of Multi-Point and Multi-Frame Geometry: Fundamental Shape Matrices and Tensors , 1996, ECCV.

[9]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[10]  David W. Jacobs,et al.  Space and Time Bounds on Indexing 3D Models from 2D Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  K NayarShree,et al.  Visual learning and recognition of 3-D objects from appearance , 1995 .

[12]  Michael Werman,et al.  The study of 3D-from-2D using elimination , 1995, Proceedings of IEEE International Conference on Computer Vision.