Textured/textureless object recognition and pose estimation using RGB-D image

In this paper, we propose a novel global object descriptor, so-called Viewpoint oriented Color-Shape Histogram (VCSH), which combines 3D object’s color and shape features. The descriptor is efficiently used in a real-time textured/textureless object recognition and 6D pose estimation system, while also applied for object localization in a coherent semantic map. We build the object model first by registering from multi-view color point clouds, and generate partial-view object color point clouds from different synthetic viewpoints. Thereafter, the extracted color and shape features are correlated as a VCSH to represent the corresponding object patch data. For object recognition, the object can be identified and its initial pose is estimated through matching within our built database. Afterwards the object pose can be optimized by utilizing an iterative closest point strategy. Therefore, all the objects in the observed area are finally recognized and their corresponding accurate poses are retrieved. We validate our approach through a large number of experiments, including daily complex scenarios and indoor semantic mapping. Our method is proven to be efficient by guaranteeing high object recognition rate, accurate pose estimation result as well as exhibiting the capability of dealing with environmental illumination changes.

[1]  Markus H. Gross,et al.  Multi‐scale Feature Extraction on Point‐Sampled Surfaces , 2003, Comput. Graph. Forum.

[2]  Aly A. Farag,et al.  CSIFT: A SIFT Descriptor with Color Invariant Characteristics , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  G W Humphreys,et al.  Varieties of Object Constancy , 1989, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[4]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[5]  Theo Gevers,et al.  Robust histogram construction from color invariants for object recognition , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Wei Wang,et al.  Fast object recognition and 6D pose estimation using viewpoint oriented color-shape histogram , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[7]  I. Biederman,et al.  Recognizing depth-rotated objects: evidence and conditions for three-dimensional viewpoint invariance. , 1993, Journal of experimental psychology. Human perception and performance.

[8]  Gary R. Bradski,et al.  Fast 3D recognition and pose using the Viewpoint Feature Histogram , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Pieter Abbeel,et al.  A textured object recognition pipeline for color and depth image data , 2012, 2012 IEEE International Conference on Robotics and Automation.

[10]  Thomas A. Funkhouser,et al.  Selecting Distinctive 3D Shape Descriptors for Similarity Retrieval , 2006, IEEE International Conference on Shape Modeling and Applications 2006 (SMI'06).

[11]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  S. Edelman,et al.  Orientation dependence in the recognition of familiar and novel views of three-dimensional objects , 1992, Vision Research.

[13]  Martin Rumpf,et al.  Robust feature detection and local classification for surfaces based on moment analysis , 2004, IEEE Transactions on Visualization and Computer Graphics.

[14]  Wei Wang,et al.  Articulated Object Modeling based on Visual and Haptic Observations , 2013, VISAPP.

[15]  Federico Tombari,et al.  Unique Signatures of Histograms for Local Surface Description , 2010, ECCV.

[16]  Yasuo Kuniyoshi,et al.  Partial matching of real textured 3D objects using color cubic higher-order local auto-correlation features , 2010, The Visual Computer.

[17]  Leonidas J. Guibas,et al.  Probabilistic fingerprints for shapes , 2006, SGP '06.

[18]  I. Biederman,et al.  Evidence for Complete Translational and Reflectional Invariance in Visual Object Priming , 1991, Perception.

[19]  Wei Wang,et al.  A coherent semantic mapping system based on parametric environment abstraction and 3D object localization , 2013, 2013 European Conference on Mobile Robots.

[20]  Leonidas J. Guibas,et al.  Robust global registration , 2005, SGP '05.

[21]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[22]  Bao-Liang Lu,et al.  Fast recognition of multi-view faces with feature selection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[23]  Markus Vincze,et al.  Ensemble of shape functions for 3D object classification , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[24]  Arnold W. M. Smeulders,et al.  Color-based object recognition , 1997, Pattern Recognit..

[25]  Nico Blodow,et al.  Fast Point Feature Histograms (FPFH) for 3D registration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[26]  Jiebo Luo,et al.  Robust color object detection using spatial-color joint probability functions , 2004, CVPR 2004.

[27]  Vincent Lepetit,et al.  Model Based Training, Detection and Pose Estimation of Texture-Less 3D Objects in Heavily Cluttered Scenes , 2012, ACCV.

[28]  Bülent Sankur,et al.  Multivariate Density-Based 3D Shape Descriptors , 2007, IEEE International Conference on Shape Modeling and Applications 2007 (SMI '07).

[29]  K. S. Arun,et al.  Least-Squares Fitting of Two 3-D Point Sets , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Cordelia Schmid,et al.  Flexible Object Models for Category-Level 3D Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Binoy Pinto,et al.  Speeded Up Robust Features , 2011 .

[32]  Jiebo Luo,et al.  Color object detection using spatial-color joint probability functions , 2004, IEEE Transactions on Image Processing.

[33]  Yasuo Kuniyoshi,et al.  Voxelized Shape and Color Histograms for RGB-D , 2011, IROS 2011.

[34]  Luc Van Gool,et al.  Towards Multi-View Object Class Detection , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[35]  I Biederman,et al.  Size Invariance in Visual Object Priming of Gray-Scale Images , 1995, Perception.

[36]  Wolfram Burgard,et al.  Improved Techniques for Grid Mapping With Rao-Blackwellized Particle Filters , 2007, IEEE Transactions on Robotics.

[37]  Zhengyou Zhang,et al.  Iterative point matching for registration of free-form curves and surfaces , 1994, International Journal of Computer Vision.

[38]  Cohen-OrDaniel,et al.  Computing and Rendering Point Set Surfaces , 2003 .

[39]  Ilan Shimshoni,et al.  On edge detection on surfaces , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[41]  Marc Alexa,et al.  Computing and Rendering Point Set Surfaces , 2003, IEEE Trans. Vis. Comput. Graph..

[42]  Shamik Sural,et al.  A Histogramm with Perceptually Smooth Color Transition for Image Retrieval , 2002, JCIS.

[43]  Vincent Lepetit,et al.  Model Based Training, Detection and Pose Estimation of Texture-Less 3D Objects in Heavily Cluttered Scenes , 2012, ACCV.

[44]  Uwe D. Hanebeck,et al.  Density trees for efficient nonlinear state estimation , 2010, 2010 13th International Conference on Information Fusion.

[45]  Sandra Hirche,et al.  Real-time human body motion estimation based on multi-layer laser scans , 2011, 2011 8th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI).

[46]  Federico Tombari,et al.  A combined texture-shape descriptor for enhanced 3D feature matching , 2011, 2011 18th IEEE International Conference on Image Processing.

[47]  Cordelia Schmid,et al.  Viewpoint-independent object class detection using 3D Feature Maps , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[48]  Henrik I. Christensen,et al.  3D pose estimation of daily objects using an RGB-D camera , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.