3-D to 2-D Pose Determination with Regions

This paper presents a novel approach to parts-based object recognition in the presence of occlusion. We focus on the problem of determining the pose of a 3-D object from a single 2-D image when convex parts of the object have been matched to corresponding regions in the image. We consider three types of occlusions: self-occlusion, occlusions whose locus is identified in the image, and completely arbitrary occlusions. We show that in the first two cases this is a convex optimization problem, derive efficient algorithms, and characterize their performance. For the last case, we prove that the problem of finding valid poses is computationally hard, but provide an efficient, approximate algorithm. This work generalizes our previous work on region-based object recognition, which focused on the case of planar models.

[1]  Hooshang Hemami,et al.  Identification of Three-Dimensional Objects Using Fourier Descriptors of the Boundary Curve , 1974, IEEE Trans. Syst. Man Cybern..

[2]  Radu Horaud,et al.  New Methods for Matching 3-D Objects with Single Perspective Views , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  David J. Kriegman,et al.  Invariant-based recognition of complex curved 3D objects from image contours , 1995, Proceedings of IEEE International Conference on Computer Vision.

[4]  D. Whittaker,et al.  A Course in Functional Analysis , 1991, The Mathematical Gazette.

[5]  Rodney A. Brooks,et al.  Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[6]  Marco Pellegrini,et al.  Stabbing and ray shooting in 3 dimensional space , 1990, SCG '90.

[7]  Robert Bergevin,et al.  Generic Object Recognition: Building and Matching Coarse Descriptions from Line Drawings , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Shree K. Nayar,et al.  Reflectance based object recognition , 1996, International Journal of Computer Vision.

[9]  Dimitris N. Metaxas,et al.  Dynamic 3D Models with Local and Global Deformations: Deformable Superquadrics , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  W. Eric L. Grimson,et al.  Object recognition by alignment using invariant projections of planar surfaces , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[11]  V. Rich Personal communication , 1989, Nature.

[12]  Ramakant Nevatia,et al.  Using Invariance and Quasi-Invariance for the Segmentation and Recovery of Curved Objects , 1993, Applications of Invariance in Computer Vision.

[13]  Terrance E. Boult,et al.  Recovery of generalized cylinders from a single intensity image , 1990 .

[14]  Yehezkel Lamdan,et al.  Geometric Hashing: A General And Efficient Model-based Recognition Scheme , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[15]  King-Sun Fu,et al.  Shape Discrimination Using Fourier Descriptors , 1977, IEEE Trans. Syst. Man Cybern..

[16]  Dimitris N. Metaxas,et al.  Dynamic 3D models with local and global deformations: deformable superquadrics , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[17]  I. Biederman Human image understanding: Recent research and a theory , 1985, Computer Vision Graphics and Image Processing.

[18]  William Rucklidge,et al.  Efficiently Locating Objects Using the Hausdorff Distance , 1997, International Journal of Computer Vision.

[19]  David W. Jacobs Matching 3-D Models to 2-D Images , 2004, International Journal of Computer Vision.

[20]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[21]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[22]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  J J Koenderink,et al.  Affine structure from motion. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[24]  King-Sun Fu,et al.  Shape Discrimination Using Fourier Descriptors , 1977, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Takeo Kanade,et al.  A Paraperspective Factorization Method for Shape and Motion Recovery , 1994, ECCV.

[26]  D. W. Thompson,et al.  Three-dimensional model matching from an unconstrained viewpoint , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[27]  David A. Forsyth,et al.  Recognising rotationally symmetric surfaces from their outlines , 1992, ECCV.

[28]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  T. Kanade,et al.  The theory of straight homogeneous generalized cylinders and A taxonomy of generalized cylinders , 1983 .

[30]  Takeo Kanade,et al.  Shape and motion from image streams under orthography: a factorization method , 1992, International Journal of Computer Vision.

[31]  Shimon Ullman,et al.  Recognizing solid objects by alignment with an image , 1990, International Journal of Computer Vision.

[32]  David W. Jacobs,et al.  Uncertainty Propagation in Model-Based Recognition , 1998, International Journal of Computer Vision.

[33]  Ronen Basri,et al.  Recognition Using Region Correspondences , 1997, International Journal of Computer Vision.

[34]  David T. Clemens Region-Based Feature Interpretation for Recognizing 3-D Models in 2-D Images , 1991 .

[35]  Nina Amenta,et al.  Bounded boxes, Hausdorff distance, and a new proof of an interesting Helly-type theorem , 1994, SCG '94.

[36]  Ernest L. Hall,et al.  Three-Dimensional Moment Invariants , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  David A. Forsyth,et al.  Recognizing algebraic surfaces from their outlines , 1993, Vision.

[38]  David A. Forsyth,et al.  Extracting projective structure from single perspective views of 3D point sets , 1993, 1993 (4th) International Conference on Computer Vision.

[39]  David W. Jacobs,et al.  Robust and Efficient Detection of Salient Convex Groups , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  I. Rock The Logic of Perception , 1983 .

[41]  Ramakant Nevatia,et al.  Shape from Contour: Straight Homogeneous Generalized Cylinders and Constant Cross Section Generalized Cylinders , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  David J. Kriegman,et al.  On Recognizing and Positioning Curved 3-D Objects from Image Contours , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Nina Amenta Finding a line transversal of axial objects in three dimensions , 1992, SODA '92.

[44]  Anthony P. Reeves,et al.  Three-Dimensional Shape Analysis Using Moments and Fourier Descriptors , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Jean Ponce,et al.  Invariant Properties of Straight Homogeneous Generalized Cylinders and Their Contours , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[46]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[48]  Nimrod Megiddo Research Report FINDING A LINE OF SIGHT THRU BOXES IN d-SPACE IN LINEAR TIME , 1996 .

[49]  Ronen Basri,et al.  Matching Convex Polygons and Polyhedra, Allowing for Occlusion , 1996, WACG.

[50]  David W. Jacobs,et al.  Recognizing 3-D Objects Using 2-D Images , 1992 .

[51]  Raimund Seidel,et al.  Linear programming and convex hulls made easy , 1990, SCG '90.

[52]  Robert B. McGhee,et al.  Aircraft Identification by Moment Invariants , 1977, IEEE Transactions on Computers.

[53]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[54]  Ruzena Bajcsy,et al.  Recovery of Parametric Models from Range Images: The Case for Superquadrics with Global Deformations , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[55]  Azriel Rosenfeld,et al.  Recognition by Functional Parts , 1995, Comput. Vis. Image Underst..

[56]  Akihiro Sugimoto Object recognition by combining paraperspective images , 2004, International Journal of Computer Vision.

[57]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[58]  A. Pentland Recognition by Parts , 1987 .

[59]  Ronen Basri,et al.  Paraperspective ≡ affine , 1994, International Journal of Computer Vision.

[60]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[61]  Daniel P. Huttenlocher,et al.  Tracking non-rigid objects in complex scenes , 1993, 1993 (4th) International Conference on Computer Vision.