Robotic Grasping of Novel Objects using Vision

We consider the problem of grasping novel objects, specifically objects that are being seen for the first time through vision. Grasping a previously unknown object, one for which a 3-d model is not available, is a challenging problem. Furthermore, even if given a model, one still has to decide where to grasp the object. We present a learning algorithm that neither requires nor tries to build a 3-d model of the object. Given two (or more) images of an object, our algorithm attempts to identify a few points in each image corresponding to good locations at which to grasp the object. This sparse set of points is then triangulated to obtain a 3-d location at which to attempt a grasp. This is in contrast to standard dense stereo, which tries to triangulate every single point in an image (and often fails to return a good 3-d model). Our algorithm for identifying grasp locations from an image is trained by means of supervised learning, using synthetic images for the training set. We demonstrate this approach on two robotic manipulation platforms. Our algorithm successfully grasps a wide variety of objects, such as plates, tape rolls, jugs, cellphones, keys, screwdrivers, staplers, a thick coil of wire, a strangely shaped power horn and others, none of which were seen in the training set. We also apply our method to the task of unloading items from dishwashers.

[1]  T. Bower,et al.  Demonstration of Intention in the Reaching Behaviour of Neonate Humans , 1970, Nature.

[2]  Matthew Thomas Mason,et al.  Manipulator grasping and pushing operations , 1982 .

[3]  Oussama Khatib,et al.  The Potential Field Approach And Operational Space Formulation In Robot Control , 1986 .

[4]  S. Gruber,et al.  Robot hands and the mechanics of manipulation , 1987, Proceedings of the IEEE.

[5]  Andrew S. Glassner,et al.  An introduction to ray tracing , 1989 .

[6]  L. Jakobson,et al.  A neurological dissociation between perceiving objects and grasping them , 1991, Nature.

[7]  Shimon Edelman,et al.  Learning to grasp using visual information , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[8]  Karun B. Shimoga,et al.  Robot Grasp Synthesis Algorithms: A Survey , 1996, Int. J. Robotics Res..

[9]  Takeo Kanade,et al.  Probabilistic modeling of local appearance and spatial relationships for object recognition , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[10]  Vijay Kumar,et al.  Robotic grasping and contact: a review , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[11]  Justus H. Piater,et al.  Developing haptic and visual perceptual categories for reaching and grasping with a humanoid robot , 2001, Robotics Auton. Syst..

[12]  Antonio Morales,et al.  Vision-based computation of three-finger grasps on unknown planar objects , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Antonio Morales,et al.  An experiment in constraining vision-based finger contact selection with gripper geometry , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  Justus Piater Learning Visual Features to Predict Hand Orientations , 2002 .

[15]  Henrik I. Christensen,et al.  Automatic grasp planning using shape primitives , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[16]  Ronald Lumia,et al.  Manipulation of unmodeled objects using intelligent grasping schemes , 2003, IEEE Trans. Fuzzy Syst..

[17]  Danica Kragic,et al.  Robust Visual Servoing , 2003, Int. J. Robotics Res..

[18]  Peter K. Allen,et al.  An SVM learning approach to robotic grasping , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[19]  M. Goodale,et al.  Adapting to monocular vision: grasping with one eye , 2004, Experimental Brain Research.

[20]  U. Castiello The neuroscience of grasping , 2005, Nature Reviews Neuroscience.

[21]  Ashutosh Saxena,et al.  High speed obstacle avoidance using monocular vision and reinforcement learning , 2005, ICML.

[22]  Robert Platt,et al.  Re-using schematic grasping policies , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[23]  U. Castiello The neuroscience of grasping , 2005, Nature Reviews Neuroscience.

[24]  J. Latombe,et al.  Adaptive dynamic collision checking for single and multiple articulated robots in complex environments , 2005, IEEE Transactions on Robotics.

[25]  Ashutosh Saxena,et al.  Learning Depth from Single Monocular Images , 2005, NIPS.

[26]  Jianwei Zhang,et al.  Learning of demonstrated grasping skills by stereoscopic tracking of human head configuration , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[27]  Oussama Khatib,et al.  Bayesian estimation for autonomous object manipulation based on tactile sensors , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[28]  Tomás Lozano-Pérez,et al.  Imitation Learning of Whole-Body Grasps , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[29]  Ashutosh Saxena,et al.  Robotic Grasping of Novel Objects , 2006, NIPS.

[30]  Ashutosh Saxena,et al.  Learning to Grasp Novel Objects Using Vision , 2006, ISER.

[31]  Ashutosh Saxena,et al.  3-D Depth Reconstruction from a Single Still Image , 2007, International Journal of Computer Vision.

[32]  Ashutosh Saxena,et al.  Make3D: Learning 3D Scene Structure from a Single Still Image , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Ashutosh Saxena,et al.  Depth Estimation Using Monocular and Stereo Cues , 2007, IJCAI.

[34]  Gary R. Bradski,et al.  Peripheral-Foveal Vision for Real-time Object Recognition and Tracking in Video , 2007, IJCAI.

[35]  Leslie Pack Kaelbling,et al.  Grasping POMDPs , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[36]  Gert Kootstra,et al.  International Conference on Robotics and Automation (ICRA) , 2008, ICRA 2008.

[37]  Ashutosh Saxena,et al.  Learning 3-D object orientation from images , 2009, 2009 IEEE International Conference on Robotics and Automation.