The Challenge of Generic Object Recognition

We discuss the issues and challenges in the development of generic object recognition systems. We argue that high-level, volumetric part-based, descriptions are essential if we want to recognize objects which are similar but not identical to pre-stored models, under wide viewing conditions, and to automatically learn new models and add them to our knowledge base.We discuss the representation scheme and its relationships to the description extraction, recognition and learning processes. We then describe the difficulties in obtaining such descriptions from images and outline steps for robust and efficient implementations. We also demonstrate the viability of the arguments by reporting on recent progress.

[1]  Rodney A. Brooks,et al.  Model-Based Three-Dimensional Interpretations of Two-Dimensional Images , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Gérard G. Medioni,et al.  Inferring global perceptual contours from local features , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[3]  M. B. Clowes,et al.  On Seeing Things , 1971, Artif. Intell..

[4]  Jean Ponce,et al.  Invariant Properties of Straight Homogeneous Generalized Cylinders and Their Contours , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Ramakant Nevatia,et al.  From an Intensity Image to 3-D Segmented Descriptions , 1996, Object Representation in Computer Vision.

[6]  T. Binford,et al.  Finding and recovering SHGC objects in an edge image , 1993 .

[7]  Thomas O. Binford,et al.  Bayesian inference in model-based machine vision , 1987, Int. J. Approx. Reason..

[8]  L. Stark,et al.  Dissertation Abstract , 1994, Journal of Cognitive Education and Psychology.

[9]  Ramakant Nevatia,et al.  Using Invariance and Quasi-Invariance for the Segmentation and Recovery of Curved Objects , 1993, Applications of Invariance in Computer Vision.

[10]  A. Macworth Interpreting pictures of polyhedral scenes , 1973 .

[11]  Andrew Zisserman,et al.  Geometric invariance in computer vision , 1992 .

[12]  Ramakant Nevatia,et al.  Recovering shape from contour for constant cross section generalized cylinders , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Azriel Rosenfeld,et al.  Recognition by Functional Parts , 1995, Comput. Vis. Image Underst..

[14]  Gérard G. Medioni,et al.  Hierarchical Decomposition and Axial Shape Description , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Alan K. Mackworth Interpreting Pictures of Polyhedral Scenes , 1973, IJCAI.

[16]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[17]  R. Nevatia,et al.  Quasi-invariant properties and 3-D shape recovery of non-straight, non-constant generalized cylinders , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[18]  A. Pentland Recognition by Parts , 1987 .

[19]  Robert Bergevin,et al.  Generic Object Recognition: Building and Matching Coarse Descriptions from Line Drawings , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  William Grimson,et al.  Object recognition by computer - the role of geometric constraints , 1991 .

[21]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[22]  Ramakant Nevatia,et al.  Description and Recognition of Curved Objects , 1977, Artif. Intell..