Near Real-time Object Detection in RGBD Data

Most methods of object detection with RGBD cameras set hard constraints on their operational area. They only work with specific objects, in specific environments, or rely on time consuming computations. In the context of home robotics, such hard constraints cannot be made. Specifically, an autonomous home robot shall be equipped with an object detection pipeline that runs in near real-time and produces reliable results without restricting object type and environment. For this purpose, a baseline framework that works on RGB data only is extended by suitable depth features that are selected on the basis of a comparative evaluation. The additional depth data is further exploited to reduce the computational cost of the detection algorithm. A final evaluation of the enhanced framework shows significant improvements compared to its original version and state-of-the-art methods in terms of both, detection performance and real-time capability.

[1]  Wei Wang,et al.  Fast object recognition and 6D pose estimation using viewpoint oriented color-shape histogram , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[2]  Gary R. Bradski,et al.  Fast 3D recognition and pose using the Viewpoint Feature Histogram , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Jonathan T. Barron,et al.  A category-level 3-D object dataset: Putting the Kinect to work , 2011, ICCV Workshops.

[4]  Dieter Fox,et al.  Unsupervised Feature Learning for RGB-D Based Object Recognition , 2012, ISER.

[5]  Federico Tombari,et al.  Object Recognition in 3D Scenes with Occlusions and Clutter by Hough Voting , 2010, 2010 Fourth Pacific-Rim Symposium on Image and Video Technology.

[6]  M. Vincze,et al.  BLORT-The Blocks World Robotic Vision Toolbox , 2010 .

[7]  Dieter Fox,et al.  A Scalable Tree-Based Approach for Joint Object and Pose Recognition , 2011, AAAI.

[8]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  James M. Keller,et al.  Histogram of Oriented Normal Vectors for Object Recognition with a Depth Sensor , 2012, ACCV.

[10]  Vincent Lepetit,et al.  Model Based Training, Detection and Pose Estimation of Texture-Less 3D Objects in Heavily Cluttered Scenes , 2012, ACCV.

[11]  Bernt Schiele,et al.  An Implicit Shape Model for Combined Object Categorization and Segmentation , 2006, Toward Category-Level Object Recognition.

[12]  Vincent Lepetit,et al.  Dominant orientation templates for real-time detection of texture-less objects , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Nico Blodow,et al.  Fast Point Feature Histograms (FPFH) for 3D registration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  Dieter Fox,et al.  A large-scale hierarchical multi-view RGB-D object dataset , 2011, 2011 IEEE International Conference on Robotics and Automation.

[16]  Vincent Lepetit,et al.  Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes , 2011, 2011 International Conference on Computer Vision.

[17]  Luc Van Gool,et al.  Hough Transform and 3D SURF for Robust Three Dimensional Classification , 2010, ECCV.

[18]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[19]  Yann LeCun,et al.  Indoor Semantic Segmentation using depth information , 2013, ICLR.

[20]  Tinne Tuytelaars,et al.  Discriminatively Trained Templates for 3D Object Detection: A Real Time Scalable Approach , 2013, 2013 IEEE International Conference on Computer Vision.

[21]  Jitendra Malik,et al.  Learning Rich Features from RGB-D Images for Object Detection and Segmentation , 2014, ECCV.

[22]  Alexander Verl,et al.  Evaluation of 3D feature descriptors for classification of surface geometries in point clouds , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23]  Dieter Fox,et al.  Learning hierarchical sparse features for RGB-(D) object recognition , 2014, Int. J. Robotics Res..

[24]  Luc Van Gool,et al.  An Introduction to Random Forests for Multi-class Object Detection , 2011, Theoretical Foundations of Computer Vision.

[25]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[26]  Damien Vergnaud Efficient and Secure Generalized Pattern Matching via Fast Fourier Transform , 2011, AFRICACRYPT.

[27]  Sven Behnke,et al.  Depth-Enhanced Hough Forests for Object-Class Detection and Continuous Pose Estimation , 2013 .