A sensorimotor reinforcement learning framework for physical Human-Robot Interaction

Modeling of physical human-robot collaborations is generally a challenging problem due to the unpredictive nature of human behavior. To address this issue, we present a data-efficient reinforcement learning framework which enables a robot to learn how to collaborate with a human partner. The robot learns the task from its own sensorimotor experiences in an unsupervised manner. The uncertainty in the interaction is modeled using Gaussian processes (GP) to implement a forward model and an action-value function. Optimal action selection given the uncertain GP model is ensured by Bayesian optimization. We apply the framework to a scenario in which a human and a PR2 robot jointly control the ball position on a plank based on vision and force/torque data. Our experimental results show the suitability of the proposed method in terms of fast and data-efficient model learning, optimal action selection under uncertainty and equal role sharing between the partners.

[1]  Danica Kragic,et al.  Online kinematics estimation for active human-robot manipulation of jointly held objects , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Paul Evrard,et al.  Homotopy switching model for dyad haptic interaction in physical collaborative tasks , 2009, World Haptics 2009 - Third Joint EuroHaptics conference and Symposium on Haptic Interfaces for Virtual Environment and Teleoperator Systems.

[3]  Philippe Fraisse,et al.  Experimental study on haptic communication of a human in a shared human-robot collaborative task , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Danica Kragic,et al.  Self-learning and adaptation in a sensorimotor framework , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[5]  Don Joven Agravante,et al.  Human-humanoid joint haptic table carrying task with height stabilization using vision , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Nando de Freitas,et al.  Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[7]  Danica Kragic,et al.  Mapping human intentions to robot motions via physical interaction through a jointly-held object , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[8]  Weihua Sheng,et al.  Using human motion estimation for human-robot cooperative manipulation , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[10]  J. Kevin O'Regan,et al.  What it is like to see: A sensorimotor theory of perceptual experience , 2001, Synthese.

[11]  Carl E. Rasmussen,et al.  Gaussian Processes for Machine Learning (GPML) Toolbox , 2010, J. Mach. Learn. Res..

[12]  Abderrahmane Kheddar,et al.  Motion learning and adaptive impedance for robot control during physical interaction with humans , 2011, 2011 IEEE International Conference on Robotics and Automation.

[13]  Don Joven Agravante,et al.  Collaborative human-humanoid carrying using vision and haptic sensing , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[15]  Yusuke Maeda Takayuki Ham,et al.  Human-Robot Cooperative Manipulation with Motion Estimation , 2001 .

[16]  Hidenori Kimura,et al.  Human-robot collaboration in precise positioning of a three-dimensional object , 2009, Autom..

[17]  François Keith,et al.  Proactive behavior of a humanoid robot in a haptic transportation task with a human partner , 2012, 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication.

[18]  Jan Peters,et al.  A Survey on Policy Search for Robotics , 2013, Found. Trends Robotics.

[19]  Atsuto Maki,et al.  A sensorimotor approach for self-learning of hand-eye coordination , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[20]  Carl E. Rasmussen,et al.  Gaussian Processes for Data-Efficient Learning in Robotics and Control , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Malte Kuß,et al.  Gaussian process models for robust regression, classification, and reinforcement learning , 2006 .

[22]  Gert Kootstra,et al.  Learning visual forward models to compensate for self-induced image motion , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[23]  Andrew McHutchon Differentiating Gaussian Processes , 2013 .

[24]  Tamio Arai,et al.  Human-robot cooperative manipulation with motion estimation , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).