A policy-blending formalism for shared control

In shared control teleoperation, the robot assists the user in accomplishing the desired task, making teleoperation easier and more seamless. Rather than simply executing the user’s input, which is hindered by the inadequacies of the interface, the robot attempts to predict the user’s intent, and assists in accomplishing it. In this work, we are interested in the scientific underpinnings of assistance: we propose an intuitive formalism that captures assistance as policy blending, illustrate how some of the existing techniques for shared control instantiate it, and provide a principled analysis of its main components: prediction of user intent and its arbitration with the user input. We define the prediction problem, with foundations in inverse reinforcement learning, discuss simplifying assumptions that make it tractable, and test these on data from users teleoperating a robotic manipulator. We define the arbitration problem from a control-theoretic perspective, and turn our attention to what users consider good arbitration. We conduct a user study that analyzes the effect of different factors on the performance of assistance, indicating that arbitration should be contextual: it should depend on the robot’s confidence in itself and in the user, and even the particulars of the user. Based on the study, we discuss challenges and opportunities that a robot sharing the control with the user might face: adaptation to the context and the user, legibility of behavior, and the closed loop between prediction and user behavior.

[1]  Anind K. Dey,et al.  Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[2]  Marc Toussaint,et al.  Trajectory prediction: learning to map situations to robot trajectories , 2009, ICML '09.

[3]  Sterling J. Anderson,et al.  Semi-Autonomous Stability Control and Hazard Avoidance for Manned and Unmanned Ground Vehicles , 2010 .

[4]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[5]  Allison M. Okamura,et al.  Recognition of operator motions for real-time assistance using virtual fixtures , 2003, 11th Symposium on Haptic Interfaces for Virtual Environment and Teleoperator Systems, 2003. HAPTICS 2003. Proceedings..

[6]  E. Todorov,et al.  A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..

[7]  Chrystopher L. Nehaniv,et al.  Imitation as a Dual-Route Process Featuring Predictive and Learning Components: A Biologically Plausible Computational Model , 2002 .

[8]  Michael A. Goodrich,et al.  Characterizing efficiency of human robot interaction: a case study of shared-control teleoperation , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Danica Kragic,et al.  Adaptive Virtual Fixtures for Machine-Assisted Teleoperation Tasks , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[10]  D. Kortenkamp,et al.  Adjustable control autonomy for manned space flight , 2000, 2000 IEEE Aerospace Conference. Proceedings (Cat. No.00TH8484).

[11]  Annika Waern,et al.  Recognising Human Plans: Issues for Plan Recognition in Human - Computer Interaction , 1996 .

[12]  Zhao Wang,et al.  How Autonomy Impacts Performance and Satisfaction: Results From a Study With Spinal Cord Injured Subjects Using an Assistive Robot , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[13]  Kris K. Hauser,et al.  Recognition, prediction, and planning for assisted teleoperation of freeform tasks , 2012, Autonomous Robots.

[14]  Siddhartha S. Srinivasa,et al.  Planning-based prediction for pedestrians , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Alexander J. Smola,et al.  Advances in Large Margin Classifiers , 2000 .

[16]  Stefan Schaal,et al.  STOMP: Stochastic trajectory optimization for motion planning , 2011, 2011 IEEE International Conference on Robotics and Automation.

[17]  Ranjan Mukherjee,et al.  A shared-control approach to haptic interface design for minimally invasive telesurgical training , 2005, IEEE Transactions on Control Systems Technology.

[18]  Siddhartha S. Srinivasa,et al.  Manipulation planning with goal sets using constrained trajectory optimization , 2011, 2011 IEEE International Conference on Robotics and Automation.

[19]  Pieter Abbeel,et al.  Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[20]  Gregory D. Hager,et al.  Spatial motion constraints: theory and demonstrations for robot guidance using virtual fixtures , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[21]  Wayne Book,et al.  Blended Shared Control of Zermelo's navigation problem , 2010, Proceedings of the 2010 American Control Conference.

[22]  Louis B. Rosenberg,et al.  Virtual fixtures: Perceptual tools for telerobotic manipulation , 1993, Proceedings of IEEE Virtual Reality Annual International Symposium.

[23]  Christian Laugier,et al.  Intentional Motion Online Learning and Prediction , 2005, FSR.

[24]  Martial Hebert,et al.  Contextual Sequence Prediction with Application to Control Library Optimization , 2012, Robotics: Science and Systems.

[25]  Christian Laugier,et al.  The International Journal of Robotics Research (IJRR) - Special issue on ``Field and Service Robotics '' , 2009 .

[26]  Christian Smith,et al.  Teleoperation for a ball-catching task with significant dynamics , 2008, Neural Networks.

[27]  Christian Laugier,et al.  Intentional motion on-line learning and prediction , 2008, Machine Vision and Applications.

[28]  Charles F. Schmidt,et al.  A Model of the Common-Sense Theory of Intention and Personal Causation , 1973, IJCAI.

[29]  Redwan Alqasemi,et al.  Telemanipulation Assistance Based on Motion Intention Recognition , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[30]  Vijay Kumar,et al.  Mixed Initiative Control of Autonomous Vehicles , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[31]  Brett Browning,et al.  Sliding Autonomy for Peer-To-Peer Human-Robot Teams , 2008 .

[32]  Siddhartha S. Srinivasa,et al.  CHOMP: Gradient optimization techniques for efficient motion planning , 2009, 2009 IEEE International Conference on Robotics and Automation.

[33]  J. Christian Gerdes,et al.  A Unified Approach to Driver Assistance Systems Based on Artificial Potential Fields , 1999, Dynamic Systems and Control.

[34]  Robert D. Howe,et al.  Cooperative Human and Machine Perception in Teleoperated Assembly , 2000, ISER.

[35]  Brenan J. McCarragher,et al.  Human integration into robot control utilising potential fields , 1997, Proceedings of International Conference on Robotics and Automation.

[36]  Gregory D. Hager,et al.  Human-Machine Collaborative Systems for Microsurgical Applications , 2005, Int. J. Robotics Res..

[37]  Robert Platt,et al.  Extracting User Intent in Mixed Initiative Teleoperator Control , 2004 .

[38]  Siddhartha S. Srinivasa,et al.  Learning from Experience in Manipulation Planning: Setting the Right Goals , 2011, ISRR.

[39]  Sanjiv Singh,et al.  User Modelling for Principled Sliding Autonomy in Human-Robot Teams , 2005 .

[40]  Jonathan Kofman,et al.  Teleoperation of a robot manipulator using a vision-based human-robot interface , 2005, IEEE Transactions on Industrial Electronics.

[41]  Allison M. Okamura,et al.  Effect of virtual fixture compliance on human-machine cooperative manipulation , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[42]  Marc Toussaint,et al.  Robot trajectory optimization using approximate inference , 2009, ICML '09.

[43]  Leila Takayama,et al.  Strategies for human-in-the-loop robotic grasping , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[44]  J. Andrew Bagnell,et al.  Maximum margin planning , 2006, ICML.

[45]  Martin Buss,et al.  Position and force augmentation in a telepresence system and their effects on perceived realism , 2009, World Haptics 2009 - Third Joint EuroHaptics conference and Symposium on Haptic Interfaces for Virtual Environment and Teleoperator Systems.

[46]  Robert A. Henning,et al.  Can Teams Outperform Individuals in a Simulated Dynamic Control Task? , 2000 .

[47]  Holly A. Yanco,et al.  Blending human and robot inputs for sliding scale autonomy , 2005, ROMAN 2005. IEEE International Workshop on Robot and Human Interactive Communication, 2005..

[48]  Donald D. Dudenhoeffer,et al.  Dynamic-Autonomy for Urban Search and Rescue , 2002, AAAI Mobile Robot Competition.

[49]  Gillian M. Hayes,et al.  Imitation as a dual-route process featuring prediction and learning components: A biologically plaus , 2002 .

[50]  Jian Shen,et al.  A collaborative-shared control system with safe obstacle avoidance capability , 2004, IEEE Conference on Robotics, Automation and Mechatronics, 2004..

[51]  Kris K. Hauser,et al.  Assisted Teleoperation Strategies for Aggressively Controlling a Robot Arm with 2D Input , 2011, Robotics: Science and Systems.

[52]  K. Dautenhahn,et al.  Imitation in Animals and Artifacts , 2002 .

[53]  Rüdiger Dillmann,et al.  Understanding users intention: programming fine manipulation tasks by demonstration , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[54]  Siddhartha S. Srinivasa,et al.  Online customization of teleoperation interfaces , 2012, 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication.