Learning to reach into the unknown: Selecting initial conditions when reaching in clutter

Often in highly-cluttered environments, a robot can observe the exterior of the environment with ease, but cannot directly view nor easily infer its detailed internal structure (e.g., dense foliage or a full refrigerator shelf). We present a data-driven approach that greatly improves a robot's success at reaching to a goal location in the unknown interior of an environment based on observable external properties, such as the category of the clutter and the locations of openings into the clutter (i.e., apertures). We focus on the problem of selecting a good initial configuration for a manipulator when reaching with a greedy controller. We use density estimation to model the probability of a successful reach given an initial condition and then perform constrained optimization to find an initial condition with the highest estimated probability of success. We evaluate our approach with two simulated robots reaching in clutter, and provide a demonstration with a real PR2 robot reaching to locations through random apertures. In our evaluations, our approach significantly outperformed two alternative approaches when making two consecutive reach attempts to goals in distinct categories of unknown clutter. Our approach only uses sparse readily-apparent features.

[1]  Siddhartha S. Srinivasa,et al.  A data-driven statistical framework for post-grasp manipulation , 2014, Int. J. Robotics Res..

[2]  Marc Toussaint,et al.  Trajectory prediction: learning to map situations to robot trajectories , 2009, ICML '09.

[3]  Jorge Nocedal,et al.  Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[4]  Siddhartha S. Srinivasa,et al.  A Framework for Push-Grasping in Clutter , 2011, Robotics: Science and Systems.

[5]  Hugh F. Durrant-Whyte,et al.  Gaussian Process modeling of large scale terrain , 2009, 2009 IEEE International Conference on Robotics and Automation.

[6]  Dmitry Berenson,et al.  A robot path planning framework that learns from experience , 2012, 2012 IEEE International Conference on Robotics and Automation.

[7]  Leslie Pack Kaelbling,et al.  Grasping POMDPs , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[8]  Pieter Abbeel,et al.  Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[9]  James M. Rehg,et al.  Rapid categorization of object properties from incidental contact with a tactile sensing robot arm , 2013, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids).

[10]  Siddhartha S. Srinivasa,et al.  Learning from Experience in Manipulation Planning: Setting the Right Goals , 2011, ISRR.

[11]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[12]  Siddhartha S. Srinivasa,et al.  Planning pre-grasp manipulation for transport tasks , 2010, 2010 IEEE International Conference on Robotics and Automation.

[13]  Satoshi Kagami,et al.  Planning and Executing Navigation Among Movable Obstacles , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  Advait Jain,et al.  Manipulation in Clutter with Whole-Arm Tactile Sensing , 2013, ArXiv.

[15]  Leila Takayama,et al.  Strategies for human-in-the-loop robotic grasping , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[16]  Martial Hebert,et al.  Contextual Sequence Prediction with Application to Control Library Optimization , 2012, Robotics: Science and Systems.