Experiments on (Intelligent) Brute Force Methods for Appearance-Based Object Recognition

It has long been recognized that, in principle object recognition problems can be solved by simple, brute force methods. However, the approach has generally been held to be completely impractical. We argue that by combining a few more or less standard tricks with computational resources that are historically large, but completely feasible by recent standards , dramatic results can be achieved for a number of recognition problems. In particular, we describe a resource-intensive, appearance-based method that utilizes intermediate-level features to provide normalized keys into a large, memorized feature database, and Bayesian evidence combination coupled with a Hough-like indexing scheme to assemble object hypotheses from the memory. This system demonstrates robust recognition of a variety of 3-D shapes, ranging from sports cars and ghter planes to snakes and lizards over full spherical or hemispherical ranges (and pla-nar scale, translation and rotation). We report the results of various large-scale performance tests, involving , altogether, over 2000 separate test images. These include performance scaling with database size, robustness against clutter, and generic ability. The result of 97% forced choice accuracy with full ortho-graphic invariance for 24 complex curved 3-D objects over full viewing spheres or hemispheres is the best we are aware of for this type of problem.

[1]  Randal C. Nelson,et al.  Finding Line Segments by Stick Growing , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[3]  R. Bolles,et al.  Recognizing and Locating Partially Visible Objects: The Local-Feature-Focus Method , 1982 .

[4]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[5]  William Grimson,et al.  Object recognition by computer - the role of geometric constraints , 1991 .

[6]  Roberto Brunelli,et al.  Face Recognition: Features Versus Templates , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  W. Eric L. Grimson,et al.  On the Sensitivity of the Hough Transform for Object Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Olivier D. Faugeras,et al.  HYPER: A New Approach for the Recognition and Positioning of Two-Dimensional Objects , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Robert C. Bolles,et al.  Representation space: an approach to the integration of visual information , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Cordelia Schmid,et al.  Combining greyvalue invariants with local constraints for object recognition , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  W. Eric L. Grimson,et al.  On the sensitivity of geometric hashing , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[13]  Hiroshi Murase,et al.  Learning and recognition of 3D objects from appearance , 1993, [1993] Proceedings IEEE Workshop on Qualitative Vision.

[14]  Yehezkel Lamdan,et al.  Geometric Hashing: A General And Efficient Model-based Recognition Scheme , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[15]  Ruzena Bajcsy,et al.  Recovery of Parametric Models from Range Images: The Case for Superquadrics with Global Deformations , 1990, IEEE Trans. Pattern Anal. Mach. Intell..