Efficient shape and pose recovery of unknown objects from three camera views

Knowing the shape and pose of objects of interest is critical information when planning robotic grasping and manipulation maneuvers. The ability to recover this information from objects for which there is no a priori knowledge is a valuable behavior for an autonomous or semi-autonomous robot. In this paper, we present methods and algorithms for the shape and pose recovery of unknown objects using no a priori information. Three camera views are taken of the object from three mutually orthogonal directions. Using shape from silhouettes, a point surface reconstruction of the object is performed and a superquadric is fit to the resulting array of points. We show through simulation and experiment that this method yields reconstruction accuracy sufficient for planning and executing grasping maneuvers. In addition to novel contributions, we have made performance improvements to recently published algorithms.

[1]  J. Sijs,et al.  Vision-based control of the Manus using SIFT , 2007, 2007 IEEE 10th International Conference on Rehabilitation Robotics.

[2]  NoborioHiroshi,et al.  Construction of the Octree Approximating a Three-Dimensional Object by Using Multiple Views , 1988 .

[3]  Kimitoshi Yamazaki,et al.  A grasp planning for picking up an unknown object for a mobile manipulator , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[4]  Kimitoshi Yamazaki,et al.  3-D object modeling by a camera equipped on a mobile robot , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[5]  Franc Solina,et al.  Segmentation and Recovery of Superquadrics , 2000, Computational Imaging and Vision.

[6]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[7]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[8]  Ashutosh Saxena,et al.  Robotic Grasping of Novel Objects using Vision , 2008, Int. J. Robotics Res..

[9]  Azriel Rosenfeld,et al.  Computer Vision , 1988, Adv. Comput..

[10]  Bradley N. Miller,et al.  The Python Programming Language , 2006 .

[11]  Dae-Jin Kim,et al.  Eye-in-hand stereo visual servoing of an assistive robot arm in unstructured environments , 2009, 2009 IEEE International Conference on Robotics and Automation.

[12]  C. Dyer Volumetric Scene Reconstruction from Multiple Views , 2001 .

[13]  Markus Vincze,et al.  Rethinking Robot Vision – Combining Shape and Appearance , 2007 .

[14]  Jorge Nocedal,et al.  Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[15]  Richard Szeliski,et al.  Rapid octree construction from image sequences , 1993 .

[16]  A. Laurentini,et al.  The Visual Hull Concept for Silhouette-Based Image Understanding , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Henrik I. Christensen,et al.  Automatic grasp planning using shape primitives , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[18]  David Suter,et al.  Robot manipulation grasping of recognized objects for assistive technology support using stereo vision. , 2008, ICRA 2008.

[19]  Arun K. Pujari,et al.  Volume intersection with optimal set of directions , 1991, Pattern Recognit. Lett..

[20]  H. Nguyen El-E: An Assistive Robot that Fetches Objects from Flat Surfaces , 2008 .

[21]  Hiroshi Noborio,et al.  Construction of the Octree Approximating a Three-Dimensional Object by Using Multiple Views , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Vincenzo Lippiello,et al.  Surface model reconstruction of 3D objects from multiple views , 2009, 2009 IEEE International Conference on Robotics and Automation.

[23]  Linda G. Shapiro,et al.  Computer Vision , 2001 .

[24]  Guido van Rossum,et al.  Python Programming Language , 2007, USENIX Annual Technical Conference.

[25]  Danica Kragic,et al.  Vision for robotic object manipulation in domestic settings , 2005, Robotics Auton. Syst..