Combining depth and gray images for fast 3D object recognition

Reliable and stable visual perception systems are needed for humanoid robotic assistants to perform complex grasping and manipulation tasks. The recognition of the object and its precise 6D pose are required. This paper addresses the challenge of detecting and positioning a textureless known object, by estimating its complete 6D pose in cluttered scenes. A 3D perception system is proposed in this paper, which can robustly recognize CAD models in cluttered scenes for the purpose of grasping with a mobile manipulator. Our approach uses a powerful combination of two different camera technologies, Time-Of-Flight (TOF) and RGB, to segment the scene and extract objects. Combining the depth image and gray image to recognize instances of a 3D object in the world and estimate their 3D poses. The full pose estimation process is based on depth images segmentation and an efficient shape-based matching. At first, the depth image is used to separate the supporting plane of objects from the cluttered background. Thus, cluttered backgrounds are circumvented and the search space is extremely reduced. And a hierarchical model based on the geometry information of a priori CAD model of the object is generated in the offline stage. Then using the hierarchical model we perform a shape-based matching in 2D gray images. Finally, we validate the proposed method in a number of experiments. The results show that utilizing depth and gray images together can reach the demand of a time-critical application and reduce the error rate of object recognition significantly.

[1]  Markus Ulrich,et al.  Combining Scale-Space and Similarity-Based Aspect Graphs for Fast 3D Object Recognition , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Linda G. Shapiro,et al.  3D Object Recognition and Pose with Relational Indexing , 2000, Comput. Vis. Image Underst..

[3]  Lucas Paletta,et al.  Appearance-based active object recognition , 2000, Image Vis. Comput..

[4]  Robert D. Schiffenbauer A Survey of Aspect Graphs , 2001 .

[5]  Nicholas Roy,et al.  Recognition and Pose Estimation of Rigid Transparent Objects with a Kinect Sensor , 2013 .

[6]  Markus Ulrich,et al.  CAD-based recognition of 3D objects in monocular images , 2009, 2009 IEEE International Conference on Robotics and Automation.

[7]  Robert O. Ambrose,et al.  Robonaut 2 - The first humanoid robot in space , 2011, 2011 IEEE International Conference on Robotics and Automation.

[8]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[9]  Galia V. Tzvetkova LIFE ROBONAUT 2 : MISSION , TECHNOLOGIES , 2014 .

[10]  Julia Badger,et al.  Robonaut 2 on the International Space Station: Status Update and Preparations for IVA Mobility , 2013 .

[11]  Nassir Navab,et al.  N3M: Natural 3D Markers for Real-Time Object Detection and Pose Estimation , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[12]  Kostas Daniilidis,et al.  Single image 3D object detection and pose estimation for grasping , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[13]  Benjamin B. Kimia,et al.  A Similarity-Based Aspect-Graph Approach to 3D Object Recognition , 2004, International Journal of Computer Vision.

[14]  Philip David,et al.  Object recognition in high clutter images using line features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[15]  Dmitry B. Goldgof,et al.  The scale space aspect graph , 1992, CVPR.