A generalized method for 3D object location from single 2D images

Abstract A method is presented to solve for the three-dimensional (3D) object position and orientation from single two-dimensional (2D) images based on a systematic hierarchy of features: primitive features, generalized features, and compound structures. A virtual viewpoint analysis shows that a feature-centered coordinate system allows us to drastically reduce the mathematical complexity of the inverse projection problem for higher order curves. At the heart of the matchings are the matching functions for primitive features, which are: point-to-point correspondence, line-to-line correspondence, and ellipse-to-ellipse correspondence. Parametric matching by generalized features is used to solve for viewpoints at any stage of matching. The generalized features can be viewed as the generalization of features used by previous researchers. The matching method goes “top-down” and then “bottom- up”, if necessary. Models of the objects are analyzed first. In the top-down process, search starts from the most complex features to the least complex features. In the bottom-up process, the least complex generalized features are used to solve for initial viewpoints. The viewpoints are further refined by matching the compound structures which are obtained by merging the consistent generalized features. The contributions include: (1) solving the problem of ellipse-to-ellipse correspondence, thus extending the class of recognizable objects; (2) the generalized features can assume various configurations, thus enabling the system to detect a multitude of features; (3) parametric matching instead of point-point matching makes the algorithm robust; (4) the top-down-bottom-up strategy combines efficiency with robustness.

[1]  Robert C. Bolles,et al.  Robust Feature Matching Through Maximal Cliques , 1979, Other Conferences.

[2]  Radu Horaud,et al.  New Methods for Matching 3-D Objects with Single Perspective Views , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  David G. Lowe,et al.  Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[4]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[5]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[6]  Robert M. Haralick,et al.  Computer vision theory: The lack thereof , 1986, Comput. Vis. Graph. Image Process..

[7]  Lawrence G. Roberts,et al.  Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[8]  R. Brooks Model-Based Three-Dimensional Interpretations of Two-Dimensional Images , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Takeo Kanade,et al.  Recovery of the Three-Dimensional Shape of an Object from a Single View , 1981, Artif. Intell..

[10]  Coenraad Bron,et al.  Finding all cliques of an undirected graph , 1973 .

[11]  D. W. Thompson,et al.  Three-dimensional model matching from an unconstrained viewpoint , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[12]  Takeo Kanade,et al.  Mapping Image Properties into Shape Constraints: Skewed Symmetry, Affine-Transformable Patterns, and the Shape-from-Texture Paradigm , 1983 .

[13]  Larry S. Davis,et al.  Pose Determination of a Three-Dimensional Object Using Triangle Pairs , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[15]  Chris Goad,et al.  Special purpose automatic programming for 3D model-based vision , 1987 .

[16]  Rodney A. Brooks,et al.  Model-Based Three-Dimensional Interpretations of Two-Dimensional Images , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Antonio Albano,et al.  Representation of Digitized Contours in Terms of Conic Arcs and Straight-Line Segments , 1974, Comput. Graph. Image Process..