A Two-Stage Probabilistic Approach for Object Recognition

Assume that some objects are present in an image but can be seen only partially and are overlapping each other. To recognize the objects, we have to firstly separate the objects from one another, and then match them against the modeled objects using partial observation. This paper presents a probabilistic approach for solving this problem. Firstly, the task is formulated as a two-stage optimal estimation process. The first stage, matching, separates different objects and finds feature correspondences between the scene and each potential model object. The second stage, recognition, resolves inconsistencies among the results of matching to different objects and identifies object categories. Both the matching and recognition are formulated in terms of the maximum a posteriori (MAP) principle. Secondly, contextual constraints, which play an important role in solving the problem, are incorporated in the probabilistic formulation. Specifically, between-object constraints are encoded in the prior distribution modeled as a Markov random field, and within-object constraints are encoded in the likelihood distribution modeled as a Gaussian. They are combined into the posterior distribution which defines the MAP solution. Experimental results are presented for matching and recognizing jigsaw objects under partial occlusion, rotation, translation and scaling.

[1]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[2]  S. Ullman High-Level Vision: Object Recognition and Visual Cognition , 1996 .

[3]  Jun Zhang,et al.  A Markov Random Field Model-Based Approach to Image Interpretation , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[5]  Steven W. Zucker,et al.  On the Foundations of Relaxation Labeling Processes , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Ramesh C. Jain,et al.  Three-dimensional object recognition , 1985, CSUR.

[7]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Stan Z. Li,et al.  Markov Random Field Modeling in Computer Vision , 1995, Computer Science Workbench.

[9]  J. Hornegger,et al.  Statistical learning, localization, and identification of objects , 1995, Proceedings of IEEE International Conference on Computer Vision.

[10]  Kanti V. Mardia,et al.  Statistics and Images 1:Overview , 1993 .

[11]  Olivier D. Faugeras,et al.  HYPER: A New Approach for the Recognition and Positioning of Two-Dimensional Objects , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Carsten Peterson,et al.  A New Method for Mapping Optimization Problems Onto Neural Networks , 1989, Int. J. Neural Syst..

[13]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[14]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[15]  William Grimson,et al.  Object recognition by computer - the role of geometric constraints , 1991 .

[16]  Paul R. Cooper,et al.  Parallel structure recognition with uncertainty: coupled segmentation and matching , 1990, [1990] Proceedings Third International Conference on Computer Vision.