Tactile Grasp Refinement using Deep Reinforcement Learning and Analytic Grasp Stability Metrics

Reward functions are at the heart of every reinforcement learning (RL) algorithm. In robotic grasping, rewards are often complex and manually engineered functions that do not rely on well-justified physical models from grasp analysis. This work demonstrates that analytic grasp stability metrics constitute powerful optimization objectives for RL algorithms that refine grasps on a three-fingered hand using only tactile and joint position information. We outperform a binary-reward baseline by 42.9% and find that a combination of geometric and force-agnostic grasp stability metrics yields the highest average success rates of 95.4% for cuboids, 93.1% for cylinders, and 62.3% for spheres across wrist position errors between 0 and 7 centimeters and rotational errors between 0 and 14 degrees. In a second experiment, we show that grasp refinement algorithms trained with contact feedback (contact positions, normals, and forces) perform up to 6.6% better than a baseline that receives no tactile information.

[1]  Máximo A. Roa,et al.  Grasp quality measures: review and performance , 2014, Autonomous Robots.

[2]  Gaurav S. Sukhatme,et al.  Self-supervised regrasping using spatio-temporal tactile features and reinforcement learning , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[3]  John F. Canny,et al.  Planning optimal grasps , 1992, Proceedings 1992 IEEE International Conference on Robotics and Automation.

[4]  Timo Korthals,et al.  Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks , 2021, Frontiers in Robotics and AI.

[5]  Hideki Hashimoto,et al.  Dextrous hand grasping force optimization , 1996, IEEE Trans. Robotics Autom..

[6]  Siddhartha S. Srinivasa,et al.  DART: Dynamic Animation and Robotics Toolkit , 2018, J. Open Source Softw..

[7]  Bohan Wu,et al.  MAT: Multi-Fingered Adaptive Tactile Grasping via Deep Reinforcement Learning , 2019, CoRL.

[8]  Andrew Howard,et al.  Design and use paradigms for Gazebo, an open-source multi-robot simulator , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[9]  S. Sastry,et al.  Task oriented optimal grasping by multifingered robot hands , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[10]  John F. Canny,et al.  Easily computable optimum grasps in 2-D and 3-D , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[11]  Gerd Hirzinger,et al.  Grasp planning: how to choose a suitable task wrench space , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[12]  Sergey Levine,et al.  Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.

[13]  Ludovic Righetti,et al.  Leveraging Contact Forces for Learning to Grasp , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[14]  Doina Precup,et al.  Reward is enough , 2021, Artif. Intell..

[15]  John Hsu,et al.  Analysis of grasping failures in multi-rigid body simulations , 2016, 2016 IEEE International Conference on Simulation, Modeling, and Programming for Autonomous Robots (SIMPAR).

[16]  Morgan Quigley,et al.  ROS: an open-source Robot Operating System , 2009, ICRA 2009.

[17]  Wenbin Hu,et al.  Reaching, Grasping and Re-grasping: Learning Multimode Grasping Skills. , 2020 .

[18]  Nancy S. Pollard,et al.  Synthesizing grasps from generalized prototypes , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[19]  Andrew Y. Ng,et al.  Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.

[20]  John M. Hsu,et al.  Extending Open Dynamics Engine for the DARPA Virtual Robotics Challenge , 2014, SIMPAR.

[21]  Timo Korthals,et al.  Tactile Sensing and Deep Reinforcement Learning for In-Hand Manipulation Tasks , 2019 .