Robust object localization based on error patterns learning for dexterous mobile manipulation

In this article we describe an approach for object detection and pose estimation from stereo RGB frames for robot manipulation in manufacturing scenarios. This solution was developed in the framework of the second challenge of the EuRoC project, and meets the need of a registration method invariant to the view perspective and robust to the structural symmetries and ambiguities of the target objects. Our contribution consists of automatic correction of sub-optimal results of registration algorithms. As most registration algorithms only converge on local optima, a tool for recognizing and correcting wrong alignments is highly desirable. Our insight is that, for a given target point cloud, it is important to study the alignment space offline and identify sub-optimal solutions before the registration. The convergence of the algorithm leads to the error pattern knowledge that can be used to discard the wrong solutions, and recover the correct alignment. Experiments on synthesized and real data show that exploiting the known information about the spatial properties of the objects, together with appropriate pre-processing and refining of the data, we can have a substantial improvement in discarding wrong hypothesis for geometrically ambiguous items.

[1]  Giorgio Grisetti,et al.  NICP: Dense normal based point cloud registration , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  H. Hirschmüller Accurate and Efficient Stereo Processing by Semi-Global Matching and Mutual Information , 2005, CVPR.

[3]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[4]  Dah-Jye Lee,et al.  Review of stereo vision algorithms and their suitability for resource-limited systems , 2013, Journal of Real-Time Image Processing.

[5]  Stephan Hussmann,et al.  A Performance Review of 3D TOF Vision Systems in Comparison to Stereo Vision Systems , 2008 .

[6]  Pavel Krsek,et al.  The Trimmed Iterative Closest Point algorithm , 2002, Object recognition supported by user interaction for service robots.

[7]  Paul J. Besl,et al.  Method for registration of 3-D shapes , 1992, Other Conferences.

[8]  Don Ray Murray,et al.  Using Real-Time Stereo Vision for Mobile Robot Navigation , 2000, Auton. Robots.

[9]  Bruno Siciliano,et al.  EuRoC - The Challenge Initiative for European Robotics , 2014, ISR 2014.

[10]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[11]  Shi-Min Hu,et al.  3D indoor scene modeling from RGB-D data: a survey , 2015, Computational Visual Media.

[12]  Maarten Weyn,et al.  A Survey of Rigid 3D Pointcloud Registration Algorithms , 2014 .

[13]  Roland Siegwart,et al.  A Review of Point Cloud Registration Algorithms for Mobile Robotics , 2015, Found. Trends Robotics.

[14]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[15]  Wolfram Burgard,et al.  Real-time 3D visual SLAM with a hand-held camera , 2011 .

[16]  Roland Siegwart,et al.  Comparing ICP variants on real-world data sets , 2013, Auton. Robots.

[17]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..