Object Recognition

Object recognition is a subproblem of the more general problem of perception, and can be defined as follows. Given a scene consisting of one or more objects, can we identify and localize those objects that are sufficiently visible to the sensory system? It is generally assumed that a description of each object to be recognized is available to the computer and can be used to facilitate the task of identification and localization. These descriptions can either be model-based or appearance-based, or a combination of both. Model-based object representation is based on geometric features, whereas appearance-based representation uses a large set of images for training but does not require any insight on the geometric structure of the objects. Object recognition is a key component of many intelligent vision systems, such as those used in hand-eye coordination for bin picking, inspection, and mobile robotics.

[1]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[2]  Hans P. Moravec Towards Automatic Visual Obstacle Avoidance , 1977, IJCAI.

[3]  Paul Beaudet,et al.  Rotationally invariant image operators , 1978 .

[4]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[5]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[6]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[7]  Robert C. Bolles,et al.  3DPO: A Three- Dimensional Part Orientation System , 1986, IJCAI.

[8]  W. Grimson,et al.  Model-Based Recognition and Localization from Sparse Range or Tactile Data , 1984 .

[9]  Berthold K. P. Horn Extended Gaussian images , 1984, Proceedings of the IEEE.

[10]  E. Dubois,et al.  Digital picture processing , 1985, Proceedings of the IEEE.

[11]  Charles R. Dyer,et al.  Model-based recognition in robot vision , 1986, CSUR.

[12]  Olivier D. Faugeras,et al.  HYPER: A New Approach for the Recognition and Positioning of Two-Dimensional Objects , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Yoshiaki Shirai,et al.  Three-Dimensional Computer Vision , 1987, Symbolic Computation.

[14]  R. Brooks Model-Based Three-Dimensional Interpretations of Two-Dimensional Images , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  A. C. Kak,et al.  Edge Extraction and Labeling from Structured Light 3-D Vision Data , 1988 .

[16]  G. Medioni,et al.  Recognizing 3-D Objects Using Surface Descriptions , 1989, [1988 Proceedings] Second International Conference on Computer Vision.

[17]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[18]  Takeo Kanade,et al.  Automatic generation of object recognition programs , 1988, Proc. IEEE.

[19]  Mark W. Spong,et al.  Robot dynamics and control , 1989 .

[20]  Radu Horaud,et al.  Stereo Correspondence Through Feature Grouping and Maximal Cliques , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Avinash C. Kak,et al.  A robot vision system for recognizing 3D objects in low-order polynomial time , 1989, IEEE Trans. Syst. Man Cybern..

[22]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  W. Eric L. Grimson,et al.  On the Sensitivity of the Hough Transform for Object Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Avinash C. Kak,et al.  A robot vision system for recognition of generic shaped objects , 1991, CVGIP Image Underst..

[25]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  W. Eric L. Grimson,et al.  On the Verification of Hypothesized Matches in Model-Based Recognition , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[28]  Anil K. Jain,et al.  BONSAI: 3D Object Recognition Using Constrained Search , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Avinash C. Kak,et al.  3-D Object Recognition Using Bipartite Matching Embedded in Discrete Relaxation , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Max A. Viergever,et al.  Scale and the differential structure of images , 1992, Image Vis. Comput..

[31]  Pascal Fua,et al.  Computational strategies for object recognition , 1992, CSUR.

[32]  Larry S. Davis,et al.  Model-Based Object Pose in 25 Lines of Code , 1992, ECCV.

[33]  W. Eric L. Grimson,et al.  Fast and robust 3D recognition by alignment , 1993, 1993 (4th) International Conference on Computer Vision.

[34]  Jake K. Aggarwal,et al.  Model-based object recognition in dense-range images—a review , 1993, CSUR.

[35]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Michael Lindenbaum Bounds on Shape Recognition Performance , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[38]  S. P. Mudur,et al.  Three-dimensional computer vision: a geometric viewpoint , 1993 .

[39]  Avinash C. Kak,et al.  Interactive Learning of a Multiple-Attribute Hash Table Classifier for Fast Object Recognition , 1995, Comput. Vis. Image Underst..

[40]  Akio Kosaka,et al.  Vision-based bin-picking: recognition and localization of multiple complex objects using simple visual cues , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[41]  Hiroshi Murase,et al.  Subspace methods for robot vision , 1996, IEEE Trans. Robotics Autom..

[42]  Larry S. Davis,et al.  Iterative Pose Estimation Using Coplanar Feature Points , 1996, Comput. Vis. Image Underst..

[43]  Jeff L. Edwards An active, appearance-based approach to the pose estimation of complex objects , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[44]  David W. Jacobs,et al.  Robust and Efficient Detection of Salient Convex Groups , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Luc Van Gool,et al.  Affine/ Photometric Invariants for Planar Intensity Patterns , 1996, ECCV.

[46]  Hiroshi Murase,et al.  Real-time 100 object recognition system , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[47]  Richard Lepage,et al.  Knowledge-Based Image Understanding Systems: A Survey , 1997, Comput. Vis. Image Underst..

[48]  Elias S. Manolakos,et al.  Unsupervised statistical neural networks for model-based object recognition , 1997, IEEE Trans. Signal Process..

[49]  Guillermo Sapiro,et al.  Minimal Surfaces Based Object Segmentation , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[50]  Tony Lindeberg,et al.  Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure , 1997, Image Vis. Comput..

[51]  Massimiliano Pontil,et al.  Support Vector Machines for 3D Object Recognition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[52]  Andrew M. Wallace,et al.  Measuring range using a triangulation sensor with variable geometry , 1998, IEEE Trans. Robotics Autom..

[53]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[54]  J. Mundy Object recognition based on geometry: progress over three decades , 1998, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[55]  Tomaso A. Poggio,et al.  A general framework for object detection , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[56]  Hiroshi Murase,et al.  Coarse-to-fine adaptive masks for appearance matching of occluded scenes , 1998, Machine Vision and Applications.

[57]  A. Martinez,et al.  Face image retrieval using HMMs , 1999, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL'99).

[58]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[59]  Adam Baumberg,et al.  Reliable feature matching across widely separated views , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[60]  Luc Van Gool,et al.  Wide Baseline Stereo Matching based on Local, Affinely Invariant Regions , 2000, BMVC.

[61]  Tomaso A. Poggio,et al.  Example-Based Object Detection in Images by Components , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[62]  Krystian Mikolajczyk,et al.  Detection of local features invariant to affines transformations , 2002 .

[63]  Cordelia Schmid,et al.  An Affine Invariant Interest Point Detector , 2002, ECCV.

[64]  Rainer Lienhart,et al.  Empirical Analysis of Detection Cascades of Boosted Classifiers for Rapid Object Detection , 2003, DAGM-Symposium.

[65]  Bart M. ter Haar Romeny,et al.  Front-End Vision and Multi-Scale Image Analysis , 2003, Computational Imaging and Vision.

[66]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[67]  Shimon Ullman,et al.  Recognizing solid objects by alignment with an image , 1990, International Journal of Computer Vision.

[68]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[69]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[70]  Tony Lindeberg,et al.  Feature Detection with Automatic Scale Selection , 1998, International Journal of Computer Vision.

[71]  Linda G. Shapiro,et al.  Accumulator-based inexact matching using relational summaries , 2005, Machine Vision and Applications.

[72]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2005, International Journal of Computer Vision.

[73]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[74]  Larry S. Davis,et al.  Model-based object pose in 25 lines of code , 1992, International Journal of Computer Vision.

[75]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.