Adaptive indirect control through communication in collaborative human-robot interaction

This paper addresses the problem of human-robot collaboration in scenarios where a robot assists a human by executing a complex motion involving the manipulation of an object. We focus on tasks in which success in the task depends on reaching a target pose that is controlled by the human. We contribute a reinforcement learning-based approach that allows the robot to reason about its own ability to successfully complete the task given the current target pose and indirectly adjust that pose by prompting the human user. Our approach allows the robot both to trade-off the benefits of adjusting the target position against the cost of bothering the human user while, at the same time, adapting to each user's responses. Our approach was tested in a real-world human-robot collaboration scenario involving the Baxter robot.

[1]  Benjamin Recht,et al.  Weighted Sums of Random Kitchen Sinks: Replacing minimization with randomization in learning , 2008, NIPS.

[2]  Hendrik Van Brussel,et al.  Human-inspired robot assistant for fast point-to-point movements , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[3]  Manuela M. Veloso,et al.  Personalized Assistance for Dressing Users , 2015, ICSR.

[4]  Richard S. Sutton,et al.  Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[5]  Tamio Arai,et al.  Human-robot cooperative manipulation with motion estimation , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[6]  Sarit Kraus,et al.  Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination , 2010, AAAI.

[7]  Manuela M. Veloso,et al.  Adaptive Symbiotic Collaboration for Targeted Complex Manipulation Tasks , 2016, ECAI.

[8]  Jan Peters,et al.  Learning interaction for collaborative tasks with probabilistic movement primitives , 2014, 2014 IEEE-RAS International Conference on Humanoid Robots.

[9]  Siddhartha S. Srinivasa,et al.  Generating Legible Motion , 2013, Robotics: Science and Systems.

[10]  Stephanie Rosenthal,et al.  An effective personal mobile robot agent through symbiotic human-robot interaction , 2010, AAMAS.

[11]  Benjamin Recht,et al.  Random Features for Large-Scale Kernel Machines , 2007, NIPS.

[12]  Yiannis Demiris,et al.  User modelling for personalised dressing assistance by humanoid robots , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Dmitry Berenson,et al.  Predicting human reaching motion in collaborative tasks using Inverse Optimal Control and iterative re-planning , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Dmitry Berenson,et al.  Human-robot collaborative manipulation planning using early prediction of human motion , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .

[16]  Andrea Lockerd Thomaz,et al.  Generating human-like motion for robots , 2013, Int. J. Robotics Res..

[17]  Siddhartha S. Srinivasa,et al.  Effects of Robot Motion on Human-Robot Collaboration , 2015, 2015 10th ACM/IEEE International Conference on Human-Robot Interaction (HRI).