Learning Arbitration for Shared Autonomy by Hindsight Data Aggregation

In this paper we present a framework for the teleoperation of pick-and-place tasks. We define a shared control policy that allows to blend between direct user control and autonomous control based on user intent inference. One of the main challenges in shared autonomy systems is to define the arbitration function, which decides when to let the autonomous agent take over. In this work, we propose a model and training method to learn the arbitration function. Our model is based on a recurrent neural network that takes as input the state, intent prediction scores and user command to produce an arbitration between user and robot commands. This work extends our previous work on differentiable policies for shared autonomy. Differentiability of the policy is desirable to further train the shared autonomy system end-to-end. In this work we propose training of the arbitration function by using data from user performing the task with shared control. We present initial results by teleoperating a gripper in a virtual environment using pre-trained motion generation and intent prediction. We compare our data aggregation training procedure to a handcrafted arbitration function. Our preliminary results show the efficacy of the approach and shed light on limitations that we believe demonstrate the need for user adaptability in shared autonomy systems.

[1]  Brian D. Ziebart,et al.  Goal-predictive robotic teleoperation from noisy sensors , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Anind K. Dey,et al.  Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[3]  Dmitry Berenson,et al.  Toward a user-guided manipulation framework for high-DOF robots with limited communication , 2013, 2013 IEEE Conference on Technologies for Practical Robot Applications (TePRA).

[4]  Dmitry Berenson,et al.  Goal Set Inverse Optimal Control and Iterative Replanning for Predicting Human Reaching Motions in Shared Workspaces , 2016, IEEE Transactions on Robotics.

[5]  Dmitry Berenson,et al.  Toward a user-guided manipulation framework for high-DOF robots with limited communication , 2013, TePRA.

[6]  Anca D. Dragan,et al.  Shared Autonomy via Deep Reinforcement Learning , 2018, Robotics: Science and Systems.

[7]  Siddhartha S. Srinivasa,et al.  Human-Robot Mutual Adaptation in Shared Autonomy , 2017, 2017 12th ACM/IEEE International Conference on Human-Robot Interaction (HRI.

[8]  Dmitry Berenson,et al.  From Autonomy to Cooperative Traded Control of Humanoid Manipulation Tasks with Unreliable Communication , 2016, J. Intell. Robotic Syst..

[9]  Stefan Schaal,et al.  Real-Time Perception Meets Reactive Motion Generation , 2017, IEEE Robotics and Automation Letters.

[10]  Stefan Schaal,et al.  Warping the workspace geometry with electric potentials for motion optimization of manipulation tasks , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  J. Andrew Bagnell,et al.  Efficient Reductions for Imitation Learning , 2010, AISTATS.

[12]  Thomas B. Sheridan,et al.  Telerobotics, Automation, and Human Supervisory Control , 2003 .

[13]  Marc Toussaint,et al.  A Differentiable Policy for Shared Autonomy , 2019 .

[14]  Siddhartha S. Srinivasa,et al.  Shared autonomy via hindsight optimization for teleoperation and teaming , 2017, Int. J. Robotics Res..

[15]  Andrew Y. Ng,et al.  Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[16]  Michael A. Goodrich,et al.  Teleoperation and Beyond for Assistive Humanoid Robots , 2013 .

[17]  B. Argall,et al.  Human-in-the-Loop Optimization of Shared Autonomy in Assistive Robotics , 2017, IEEE Robotics and Automation Letters.

[18]  Siddhartha S. Srinivasa,et al.  A policy-blending formalism for shared control , 2013, Int. J. Robotics Res..

[19]  Taskin Padir,et al.  A Blended Human-Robot Shared Control Framework to Handle Drift and Latency , 2018, 2019 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR).

[20]  Thomas B. Sheridan,et al.  Supervisory control of remote manipulation , 1967, IEEE Spectrum.