Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks

Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand object manipulation tasks that tactile information can substantially increase sample efficiency for training (by up to more than threefold). We also observe an improvement in performance (up to 46%) after adding tactile information. To examine the role of tactile-sensor parameters in these improvements, we included experiments with varied sensor-measurement accuracy (ground truth continuous values, noisy continuous values, Boolean values), and varied spatial resolution of the tactile sensors (927 sensors, 92 sensors, and 16 pooled sensor areas in the hand). To facilitate further studies and comparisons, we make these touch-sensor extensions available as a part of the OpenAI Gym Shadow-Dexterous-Hand robotics environments.

[1]  Raia Hadsell,et al.  Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard , 2020, IEEE Robotics and Automation Letters.

[2]  Jitendra Malik,et al.  More Than a Feeling: Learning to Grasp and Regrasp Using Vision and Touch , 2018, IEEE Robotics and Automation Letters.

[3]  Jürgen Leitner,et al.  Multisensory assisted in-hand manipulation of objects with a dexterous hand , 2019, ICRA 2019.

[4]  Jan Peters,et al.  Learning robot in-hand manipulation with tactile features , 2015, 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids).

[5]  Helge J. Ritter,et al.  Modularization of End-to-End Learning: Case Study in Arcade Games , 2019, ArXiv.

[6]  Shigeki Sugano,et al.  Tactile object recognition using deep learning and dropout , 2014, 2014 IEEE-RAS International Conference on Humanoid Robots.

[7]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[8]  Joshua Newth,et al.  Minkowski Portal Refinement and Speculative Contacts in Box2D , 2013 .

[9]  R. S. Johansson,et al.  Roles of glabrous skin receptors and sensorimotor memory in automatic control of precision grip when lifting rougher or more slippery objects , 2004, Experimental Brain Research.

[10]  Helge J. Ritter,et al.  A highly sensitive 3D-shaped tactile sensor , 2013, 2013 IEEE/ASME International Conference on Advanced Intelligent Mechatronics.

[11]  Peter K. Allen,et al.  Blind grasping: Stable robotic grasping using tactile feedback and hand kinematics , 2011, 2011 IEEE International Conference on Robotics and Automation.

[12]  M. Schilling,et al.  An Approach to Hierarchical Deep Reinforcement Learning for a Decentralized Walking Control Architecture , 2018, Biologically Inspired Cognitive Architectures 2018.

[13]  Jan Peters,et al.  Stable reinforcement learning with autoencoders for tactile and visual data , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[14]  Ludovic Righetti,et al.  Leveraging Contact Forces for Learning to Grasp , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[15]  Sergey Levine,et al.  Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations , 2017, Robotics: Science and Systems.

[16]  Oliver Kroemer,et al.  Learning robot tactile sensing for object manipulation , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17]  Gary Snethen,et al.  XenoCollide: Complex Collision Made Simple , 2008 .

[18]  Sergey Levine,et al.  Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.

[19]  Sergey Levine,et al.  Manipulation by Feel: Touch-Based Control with Deep Predictive Models , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[20]  Sergey Levine,et al.  Optimal control with learned local models: Application to dexterous manipulation , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Helge J. Ritter,et al.  Approaching Manual Intelligence , 2010, KI - Künstliche Intelligenz.

[22]  Timo Korthals,et al.  Jointly Trained Variational Autoencoder for Multi-Modal Sensor Fusion , 2019, 2019 22th International Conference on Information Fusion (FUSION).

[23]  Tom Schaul,et al.  Universal Value Function Approximators , 2015, ICML.

[24]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[25]  Yuval Tassa,et al.  MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26]  Sergey M. Plis,et al.  Learning to Run challenge solutions: Adapting reinforcement learning methods for neuromusculoskeletal environments , 2018, ArXiv.

[27]  Norbert Elkmann,et al.  Tactile sensing: A key technology for safe physical human robot interaction , 2011, 2011 6th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[28]  Daniel P Ferris,et al.  EEG correlates of sensorimotor processing: independent components involved in sensory and motor processing , 2017, Scientific Reports.

[29]  Herke van Hoof,et al.  Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.

[30]  Jakub W. Pachocki,et al.  Learning dexterous in-hand manipulation , 2018, Int. J. Robotics Res..

[31]  Marcin Andrychowicz,et al.  Hindsight Experience Replay , 2017, NIPS.

[32]  Timo Korthals,et al.  Learn to Move Through a Combination of Policy Gradient Algorithms: DDPG, D4PG, and TD3 , 2020, LOD.

[33]  Helge J. Ritter,et al.  Distinguishing sliding from slipping during object pushing , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[34]  Marcin Andrychowicz,et al.  Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research , 2018, ArXiv.

[35]  R. Klatzky,et al.  Hand movements: A window into haptic object recognition , 1987, Cognitive Psychology.

[36]  Sabine U. König,et al.  Embodied cognition , 2018, 2018 6th International Conference on Brain-Computer Interface (BCI).

[37]  Wojciech Zaremba,et al.  OpenAI Gym , 2016, ArXiv.

[38]  Helge J. Ritter,et al.  Flexible and stretchable fabric-based tactile sensor , 2015, Robotics Auton. Syst..

[39]  Helge J. Ritter,et al.  Robot self-protection by virtual actuator fatigue: Application to tendon-driven dexterous hands during grasping , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).