Statistical object recognition

To be practical, recognition systems must deal with uncertainty. Positions of image features in scenes vary. Features sometimes fail to appear because of unfavorable illumination. In this work, methods of statistical inference are combined with empirical models of uncertainty in order to evaluate and refine hypotheses about the occurrence of a known object in a scene. Probabilistic models are used to characterize image features and their correspondences. A statistical approach is taken for the acquisition of object models from observations in images: Mean Edge Images are used to capture object features that are reasonably stable with respect to variations in illumination. The Alignment approach to recognition, that has been described by Huttenlocher and Ullman, is used. The mechanisms that are employed to generate initial hypotheses are distinct from those that are used to verify (and refine) them. In this work, posterior probability and Maximum Likelihood are the criteria for evaluating and refining hypotheses. The recognition strategy advocated in this work may be summarized as Align Refine Verify, whereby local search in pose space is utilized to refine hypotheses from the alignment stage before verification is carried out. Two formulations of model-based object recognition are described. MAP Model Matching evaluates joint hypotheses of match and pose, while Posterior Marginal Pose Estimation evaluates the pose only. Local search in pose space is carried out with the Expectation-Maximization (EM) algorithm. Recognition experiments are described where the EM algorithm is used to refine and evaluate pose hypotheses in 2D and 3D. Initial hypotheses for the 2D experiments were generated by a simple indexing method: Angle Pair Indexing. The Linear Combination of Views method of Ullman and Basri is employed as the projection model in the 3D experiments. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)

[1]  A. Maradudin,et al.  Elements of solid state theory , 1959 .

[2]  Robert C. Bolles,et al.  Parametric Correspondence and Chamfer Matching: Two New Techniques for Image Matching , 1977, IJCAI.

[3]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[4]  Rodney A. Brooks,et al.  Model-Based Three-Dimensional Interpretations of Two-Dimensional Images , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Robert M. Haralick,et al.  Digital Step Edges from Zero Crossing of Second Directional Derivatives , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  W. Eric L. Grimson,et al.  Computational Experiments with a Feature Based Stereo Algorithm , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  José L. Marroquín,et al.  Probabilistic solution of inverse problems , 1985 .

[9]  Ramesh C. Jain,et al.  Three-dimensional object recognition , 1985, CSUR.

[10]  Jeffrey H. Shapiro,et al.  Performance Analyses For Peak-Detecting Laser Radars , 1986, Other Conferences.

[11]  Olivier D. Faugeras,et al.  HYPER: A New Approach for the Recognition and Positioning of Two-Dimensional Objects , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Berthold K. P. Horn Robot vision , 1986, MIT electrical engineering and computer science series.

[13]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Andrew Blake,et al.  Visual Reconstruction , 1987, Deep Learning for EEG-Based Brain–Computer Interfaces.

[15]  W. Eric L. Grimson,et al.  Localizing Overlapping Parts by Searching the Interpretation Tree , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  S. Goldman EFFICIENT METHODS FOR CALCULATING MAXIMUM ENTROPY DISTRIBUTIONS , 1987 .

[17]  Tomaso Poggio,et al.  Probabilistic Solution of Ill-Posed Problems in Computational Vision , 1987 .

[18]  Stephen T. Barnard,et al.  Stereo Matching by Hierarchical, Microcanonical Annealing , 1987, IJCAI.

[19]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[20]  T.A. Cass A robust parallel implementation of 2D model-based recognition , 1988, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Josef Kittler,et al.  A survey of the hough transform , 1988, Comput. Vis. Graph. Image Process..

[22]  Gunilla Borgefors,et al.  Hierarchical Chamfer Matching: A Parametric Edge Matching Algorithm , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Shimon Ullman,et al.  Structural Saliency: The Detection Of Globally Salient Structures using A Locally Connected Network , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[24]  Pascal Fua,et al.  Objective functions for feature discrimination: applications to semiautomated and automated feature extraction , 1989 .

[25]  A. R. Hanson,et al.  Robust estimation of camera location and orientation from noisy data having outliers , 1989, [1989] Proceedings. Workshop on Interpretation of 3D Scenes.

[26]  J. Ross Beveridge,et al.  Optimization of 2-dimensional model matching , 1989 .

[27]  Pascal Fua,et al.  Objective functions for feature discrimination: theory , 1989 .

[28]  Micha Sharir,et al.  The upper envelope of voronoi surfaces and its applications , 1991, SCG '91.

[29]  A. Shashua Correspondence and Affine Shape from Two Orthographic Views: Motion and Recognition , 1991 .

[30]  W. Well MAP model matching , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31]  David W. Jacobs,et al.  Model group indexing for recognition , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[32]  Richard S. Stephens,et al.  Probabilistic approach to the Hough transform , 1991, Image Vis. Comput..

[33]  Murali M. Menon Massively parallel image restoration , 1991, Defense, Security, and Sensing.

[34]  David W. Jacobs,et al.  Space and Time Bounds on Indexing 3D Models from 2D Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  William M. Wells Statistical approach to model matching , 1991, Other Conferences.

[37]  Todd A. Cass,et al.  Polynomial-Time Object Recognition in the Presence of Clutter, Occlusion, and Uncertainty , 1992, ECCV.

[38]  Edward M. Riseman,et al.  Matching complex images to multiple 3D objects using view description networks , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[39]  Jeffrey H. Shapiro,et al.  Maximum-likelihood laser radar range profiling with the expectation-maximization algorithm , 1992 .

[40]  Shimon Edelman,et al.  Bringing the Grandmother back into the Picture: A Memory-Based View of Object Recognition , 1990, Int. J. Pattern Recognit. Artif. Intell..

[41]  Richard A. Robb,et al.  New approach to 3-D registration of multimodality medical images by surface matching , 1992, Other Conferences.

[42]  Micha Sharir,et al.  The upper envelope of voronoi surfaces and its applications , 1993, Discret. Comput. Geom..