论文信息 - A sensorimotor reinforcement learning framework for physical Human-Robot Interaction

A sensorimotor reinforcement learning framework for physical Human-Robot Interaction

Modeling of physical human-robot collaborations is generally a challenging problem due to the unpredictive nature of human behavior. To address this issue, we present a data-efficient reinforcement learning framework which enables a robot to learn how to collaborate with a human partner. The robot learns the task from its own sensorimotor experiences in an unsupervised manner. The uncertainty in the interaction is modeled using Gaussian processes (GP) to implement a forward model and an action-value function. Optimal action selection given the uncertain GP model is ensured by Bayesian optimization. We apply the framework to a scenario in which a human and a PR2 robot jointly control the ball position on a plank based on vision and force/torque data. Our experimental results show the suitability of the proposed method in terms of fast and data-efficient model learning, optimal action selection under uncertainty and equal role sharing between the partners.

[1] Danica Kragic,et al. Online kinematics estimation for active human-robot manipulation of jointly held objects , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2] Paul Evrard,et al. Homotopy switching model for dyad haptic interaction in physical collaborative tasks , 2009, World Haptics 2009 - Third Joint EuroHaptics conference and Symposium on Haptic Interfaces for Virtual Environment and Teleoperator Systems.

[3] Philippe Fraisse,et al. Experimental study on haptic communication of a human in a shared human-robot collaborative task , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4] Danica Kragic,et al. Self-learning and adaptation in a sensorimotor framework , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[5] Don Joven Agravante,et al. Human-humanoid joint haptic table carrying task with height stabilization using vision , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6] Nando de Freitas,et al. Taking the Human Out of the Loop: A Review of Bayesian Optimization , 2016, Proceedings of the IEEE.

[7] Danica Kragic,et al. Mapping human intentions to robot motions via physical interaction through a jointly-held object , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[8] Weihua Sheng,et al. Using human motion estimation for human-robot cooperative manipulation , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.

[10] J. Kevin O'Regan,et al. What it is like to see: A sensorimotor theory of perceptual experience , 2001, Synthese.

[11] Carl E. Rasmussen,et al. Gaussian Processes for Machine Learning (GPML) Toolbox , 2010, J. Mach. Learn. Res..

[12] Abderrahmane Kheddar,et al. Motion learning and adaptive impedance for robot control during physical interaction with humans , 2011, 2011 IEEE International Conference on Robotics and Automation.

[13] Don Joven Agravante,et al. Collaborative human-humanoid carrying using vision and haptic sensing , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[14] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[15] Yusuke Maeda Takayuki Ham,et al. Human-Robot Cooperative Manipulation with Motion Estimation , 2001 .

[16] Hidenori Kimura,et al. Human-robot collaboration in precise positioning of a three-dimensional object , 2009, Autom..

[17] François Keith,et al. Proactive behavior of a humanoid robot in a haptic transportation task with a human partner , 2012, 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication.

[18] Jan Peters,et al. A Survey on Policy Search for Robotics , 2013, Found. Trends Robotics.

[19] Atsuto Maki,et al. A sensorimotor approach for self-learning of hand-eye coordination , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[20] Carl E. Rasmussen,et al. Gaussian Processes for Data-Efficient Learning in Robotics and Control , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Malte Kuß,et al. Gaussian process models for robust regression, classification, and reinforcement learning , 2006 .

[22] Gert Kootstra,et al. Learning visual forward models to compensate for self-induced image motion , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[23] Andrew McHutchon. Differentiating Gaussian Processes , 2013 .

[24] Tamio Arai,et al. Human-robot cooperative manipulation with motion estimation , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).