Object Detection from Large-Scale 3D Datasets Using Bottom-Up and Top-Down Descriptors

We propose an approach for detecting objects in large-scale range datasets that combines bottom-up and top-down processes. In the bottom-up stage, fast-to-compute local descriptors are used to detect potential target objects. The object hypotheses are verified after alignment in a top-down stage using global descriptors that capture larger scale structure information. We have found that the combination of spin images and Extended Gaussian Images, as local and global descriptors respectively, provides a good trade-off between efficiency and accuracy. We present results on real outdoors scenes containing millions of scanned points and hundreds of targets. Our results compare favorably to the state of the art by being applicable to much larger scenes captured under less controlled conditions, by being able to detect object classes and not specific instances, and by being able to align the query with the best matching model accurately, thus obtaining precise segmentation.

[1]  Thomas A. Funkhouser,et al.  Partial matching of 3D shapes with priority-driven search , 2006, SGP '06.

[2]  Martial Hebert,et al.  Rapid object indexing using locality sensitive hashing and joint 3D-signature space estimation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Robert B. Fisher,et al.  Finding Surface Correspondance for Object Recognition and Registration Using Pairwise Geometric Histograms , 1998, ECCV.

[4]  Jiří Matas,et al.  Computer Vision - ECCV 2004 , 2004, Lecture Notes in Computer Science.

[5]  Bernd Neumann,et al.  Computer Vision — ECCV’98 , 1998, Lecture Notes in Computer Science.

[6]  Harry Shum,et al.  Directional histogram model for three-dimensional shape similarity , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[7]  Martial Hebert,et al.  Large data sets and confusing scenes in 3-D surface matching and recognition , 1999, Second International Conference on 3-D Digital Imaging and Modeling (Cat. No.PR00062).

[8]  Harpreet S. Sawhney,et al.  Shapeme histogram projection and matching for partial object recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  D. Healy,et al.  Computing Fourier Transforms and Convolutions on the 2-Sphere , 1994 .

[10]  Andrew E. Johnson,et al.  Toward a General 3-D Matching Engine: Multiple Models, Complex Scenes, and Efficient Data Filtering , 1998 .

[11]  Linda G. Shapiro,et al.  Symbolic signatures for deformable shapes , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Christian Früh,et al.  Data Processing Algorithms for Generating Textured 3D Building Facade Meshes from Laser Scans and Camera Images , 2005, International Journal of Computer Vision.

[13]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Chitra Dorai,et al.  COSMOS - A Representation Scheme for 3D Free-Form Objects , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Bernard Chazelle,et al.  Shape distributions , 2002, TOGS.

[16]  Jitendra Malik,et al.  Recognizing Objects in Range Data Using Regional Point Descriptors , 2004, ECCV.

[17]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Gérard G. Medioni,et al.  Structural hashing: efficient three dimensional object recognition , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Gérard G. Medioni,et al.  Structural Indexing: Efficient 3-D Object Recognition , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Szymon Rusinkiewicz,et al.  Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors , 2003, Symposium on Geometry Processing.

[21]  Richard K. Beatson,et al.  Reconstruction and representation of 3D objects with radial basis functions , 2001, SIGGRAPH.

[22]  Berthold K. P. Horn Extended Gaussian images , 1984, Proceedings of the IEEE.

[23]  Katsushi Ikeuchi,et al.  The Complex EGI: A New Representation for 3-D Pose Determination , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  Martial Hebert,et al.  Parts-based 3D object classification , 2004, CVPR 2004.

[25]  Mi-Suen Lee,et al.  A Computational Framework for Segmentation and Grouping , 2000 .

[26]  Katsushi Ikeuchi,et al.  A Spherical Representation for Recognition of Free-Form Surfaces , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Kostas Daniilidis,et al.  Fully Automatic Registration of 3D Point Clouds , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[28]  Mohammed Bennamoun,et al.  Three-Dimensional Model-Based Object Recognition and Segmentation in Cluttered Scenes , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Ruzena Bajcsy,et al.  Recovery of Parametric Models from Range Images: The Case for Superquadrics with Global Deformations , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Sunil Arya,et al.  An optimal algorithm for approximate nearest neighbor searching fixed dimensions , 1998, JACM.