Voxelized Shape and Color Histograms for RGB-D

Real world environments typically include objects with different perceptual appearances. A household, for example, includes textured, textureless and even partially transparent objects. While perception methods exist that work well on one such class of objects at a time, the perception of various classes of objects in a scene is still a challenge. It is our view that the construction of a descriptor that takes both color and shape into account, thereby fostering high discriminating power, will help to solve this problem. In this paper we present an approach that is capable of efficiently capturing both the geometric and visual appearance of common objects of daily use into one feature. We showcase this feature’s applicability for the purpose of classifying objects in cluttered scenes with obstructions, and we evaluate two classification approaches. In our experiments we make use of Kinect, a new RGB-D device, and build a database of 63 objects. Preliminary results on novel views show recognition rates of 72.2%.

[1]  Barbara Caputo,et al.  How to Combine Color and Shape Information for 3D Object Recognition: Kernels do the Trick , 2002, NIPS.

[2]  Hong Qin,et al.  Surface completion for shape and appearance , 2006, The Visual Computer.

[3]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Shigeaki Watanabe,et al.  Subspace method to pattern recognition , 1973 .

[6]  Nico Blodow,et al.  Fast Point Feature Histograms (FPFH) for 3D registration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[7]  Yasuo Kuniyoshi,et al.  Fast object detection for robots in a cluttered indoor environment using integral 3D feature table , 2011, 2011 IEEE International Conference on Robotics and Automation.

[8]  Gary R. Bradski,et al.  Fast 3D recognition and pose using the Viewpoint Feature Histogram , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Yasuo Kuniyoshi,et al.  High-speed 3D object recognition using additive features in a linear subspace , 2010, 2010 IEEE International Conference on Robotics and Automation.

[10]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[11]  Michael Beetz,et al.  Real-time perception-guided motion planning for a personal robot , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  Nicola Orio,et al.  Retrieval of Colored 3D Models , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[13]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[14]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.

[15]  Nico Blodow,et al.  General 3D modelling of novel objects from a single view , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  N. Otsu,et al.  Action and simultaneous multiple-person identification using cubic higher-order local auto-correlation , 2004, ICPR 2004.

[17]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Zoltan-Csaba Marton,et al.  Hierarchical object geometric categorization and appearance classification for mobile manipulation , 2010, 2010 10th IEEE-RAS International Conference on Humanoid Robots.

[19]  Takumi Kobayashi,et al.  Action and simultaneous multiple-person identification using cubic higher-order local auto-correlation , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[20]  Patrick Haffner,et al.  Support vector machines for histogram-based image classification , 1999, IEEE Trans. Neural Networks.

[21]  Jörg Stückler,et al.  Combining depth and color cues for scale- and viewpoint-invariant object segmentation and recognition using Random Forests , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22]  Adrian Hilton,et al.  Shape-Colour Histograms for matching 3D video sequences , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.