Interoperable vision component for object detection and 3D pose estimation for modularized robot control

Abstract Finding objects and tracking their poses are essential functions for service robots, in order to manipulate objects and interact with humans. We present novel algorithms for local feature matching for object detection, and 3D pose estimation. Our feature matching algorithm takes advantage of local geometric consistency for better performance, and the new 3D pose estimation algorithm solves the pose in a closed-form using homography, followed by a non-linear optimization step for stability. Advantages of our approach include better performance, minimal prior knowledge for the target pattern, and easy implementation and portability as a modularized software component. We have implemented our approach along with both CPU and GPU-based feature extraction, and built an interoperable component that can be used in any Robot Technology (RT)-based control system. Experiment shows that our approach produces very robust results for the estimated 3D pose, and maintain very low false positive rate. It is also fast enough to be used in on-line applications. We integrated our vision component in an autonomous robot system with a search-and-grasp task, and tested it with several objects that are found in ordinary domestic environment. We present the details of our approach, the design of our modular component design, and the results of the experiments in this paper.

[1]  Tamim Asfour,et al.  Accurate shape-based 6-DoF pose estimation of single-colored objects , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  David G. Lowe,et al.  What and Where: 3D Object Recognition with Accurate Pose , 2006, Toward Category-Level Object Recognition.

[3]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[4]  Cordelia Schmid,et al.  3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[5]  Andrew Calway,et al.  Efficient visual odometry using a structure-driven temporal map , 2012, 2012 IEEE International Conference on Robotics and Automation.

[6]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[7]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[8]  Sukhan Lee,et al.  Real-time 3D object pose estimation and tracking for natural landmark based visual servo , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Herman Bruyninckx,et al.  Open robot control software: the OROCOS project , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[11]  Stepán Obdrzálek,et al.  Object Recognition using Local Affine Frames on Distinguished Regions , 2002, BMVC.

[12]  Niko Sünderhauf,et al.  COMPARING SEVERAL IMPLEMENTATIONS OF TWO RECENTLY PUBLISHED FEATURE DETECTORS , 2007 .

[13]  Tamim Asfour,et al.  Stereo-based 6D object localization for grasping with humanoid robot systems , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[15]  Carlos Sagüés,et al.  Fast pose estimation for visual navigation using homographies , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  Nobuyuki Otera,et al.  ORiN: Open Robot Interface for the Network , 2002 .