Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes

Viewpoint independent recognition of free-form objects and their segmentation in the presence of clutter and occlusions is a challenging task. We present a novel 3D model-based algorithm which performs this task automatically and efficiently. A 3D model of an object is automatically constructed offline from its multiple unordered range images (views). These views are converted into multidimensional table representations (which we refer to as tensors). Correspondences are automatically established between these views by simultaneously matching the tensors of a view with those of the remaining views using a hash table-based voting scheme. This results in a graph of relative transformations used to register the views before they are integrated into a seamless 3D model. These models and their tensor representations constitute the model library. During online recognition, a tensor from the scene is simultaneously matched with those in the library by casting votes. Similarity measures are calculated for the model tensors which receive the most votes. The model with the highest similarity is transformed to the scene and, if it aligns accurately with an object in the scene, that object is declared as recognized and is segmented. This process is repeated until the scene is completely segmented. Experiments were performed on real and synthetic data comprised of 55 models and 610 scenes and an overall recognition rate of 95 percent was achieved. Comparison with the spin images revealed that our algorithm is superior in terms of recognition rate and efficiency

[1]  Atsushi Nakazawa,et al.  Parallel alignment of a large number of range images , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[2]  Fernand S. Cohen,et al.  Part II: 3-D Object Recognition and Shape Estimation from Image Contours Using B-splines, Shape Invariant Matching, and Neural Network , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Yoram Moses,et al.  Probabilistic Multi-view Correspondence in a Distributed Setting with No Central Server , 2004, ECCV.

[4]  Katsushi Ikeuchi,et al.  The Complex EGI: A New Representation for 3-D Pose Determination , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Marc Levoy,et al.  A volumetric method for building complex models from range images , 1996, SIGGRAPH.

[6]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[7]  Martial Hebert,et al.  Parts-based 3D object classification , 2004, CVPR 2004.

[8]  Anil K. Jain,et al.  CAD-Based Computer Vision: From CAD Models to Relational Graphs , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  John A. Williams,et al.  Simultaneous Registration of Multiple Corresponding Point Sets , 2001, Comput. Vis. Image Underst..

[10]  Herbert Freeman,et al.  Machine Vision for Three-Dimensional Scenes , 1990 .

[11]  Martial Hebert,et al.  Linear model hashing and batch RANSAC for rapid and accurate object recognition , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[12]  Mohammed Bennamoun,et al.  3D Recognition and Segmentation of Objects in Cluttered Scenes , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[13]  Mohammed Bennamoun,et al.  A Novel Representation and Feature Matching Algorithm for Automatic Pairwise Registration of Range Images , 2005, International Journal of Computer Vision.

[14]  Ray Jarvis,et al.  3D free-form surface registration and object recognition , 2004, International Journal of Computer Vision.

[15]  大野 義夫,et al.  Computer Graphics : Principles and Practice, 2nd edition, J.D. Foley, A.van Dam, S.K. Feiner, J.F. Hughes, Addison-Wesley, 1990 , 1991 .

[16]  M. Bennamoun,et al.  Automatic multiview coarse registration of range images for 3D modeling , 2004, IEEE Conference on Cybernetics and Intelligent Systems, 2004..

[17]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  William E. Lorensen,et al.  Marching cubes: A high resolution 3D surface construction algorithm , 1987, SIGGRAPH.

[19]  Chitra Dorai,et al.  COSMOS - A Representation Scheme for 3D Free-Form Objects , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Paul J. Besl,et al.  The Free-Form Surface Matching Problem , 1990 .

[21]  Yehezkel Lamdan,et al.  Geometric Hashing: A General And Efficient Model-based Recognition Scheme , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[22]  Mohammed Bennamoun,et al.  Representation and Recognition of 3D Free-Form Objects , 2002, Digit. Signal Process..

[23]  Martial Hebert,et al.  Large data sets and confusing scenes in 3-D surface matching and recognition , 1999, Second International Conference on 3-D Digital Imaging and Modeling (Cat. No.PR00062).

[24]  Yi-Ping Hung,et al.  RANSAC-Based DARCES: A New Approach to Fast Automatic Registration of Partially Overlapping Range Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  David J. Kriegman,et al.  HOT curves for modelling and recognition of smooth curved 3D objects , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Martial Hebert,et al.  3D modeling using a statistical sensor model and stochastic search , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[27]  Katsushi Ikeuchi,et al.  A Spherical Representation for Recognition of Free-Form Surfaces , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Atsushi Nakazawa,et al.  Parallel alignment of a large number of range images , 2003 .

[29]  Mohammed Bennamoun,et al.  From unordered range images to 3D models: a fully automatic multiview correspondence algorithm , 2004, Proceedings Theory and Practice of Computer Graphics, 2004..

[30]  Chin Seng Chua,et al.  Point Signatures: A New Representation for 3D Object Recognition , 1997, International Journal of Computer Vision.

[31]  Bernt Schiele,et al.  3D object recognition from range images using local feature histograms , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[32]  Peter Johannes Neugebauer,et al.  Reconstruction of Real-World Objects via Simultaneous Registration and Robust Combination of Multiple Range Images , 1997, Int. J. Shape Model..

[33]  Thomas Ertl,et al.  Computer Graphics - Principles and Practice, 3rd Edition , 2014 .

[34]  Mohammed Bennamoun,et al.  A novel algorithm for automatic 3D model-based free-form object recognition , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[35]  Mohammed Bennamoun,et al.  Automatic Correspondence for 3d Modeling: an Extensive Review , 2005, Int. J. Shape Model..

[36]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Katsushi Ikeuchi,et al.  Building 3-D models from unregistered range images , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[38]  Martial Hebert,et al.  Automatic Class Selection and Prototyping for 3-D Object Classification , 2005, Fifth International Conference on 3-D Digital Imaging and Modeling (3DIM'05).

[39]  Michael Garland,et al.  Surface simplification using quadric error metrics , 1997, SIGGRAPH.

[40]  Robert B. Fisher,et al.  Finding Surface Correspondance for Object Recognition and Registration Using Pairwise Geometric Histograms , 1998, ECCV.

[41]  Thomas Willhalm,et al.  Software Packages , 2001, Drawing Graphs.

[42]  Reinhard Koch,et al.  Invariant-based registration of surface patches , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[43]  Patrick J. Flynn,et al.  A Survey Of Free-Form Object Representation and Recognition Techniques , 2001, Comput. Vis. Image Underst..

[44]  Gérard G. Medioni,et al.  Structural Indexing: Efficient 3-D Object Recognition , 1992, IEEE Trans. Pattern Anal. Mach. Intell..