A Differentiable Policy for Shared Autonomy

In this paper we present a framework for the teleoperation of pick and place tasks. We define a shared control policy that allows to blend between direct user control and autonomous control based on user intent inference. One of the main challenges in shared autonomy systems is to define the arbitration function, which decides when to let the autonomous agent take over. In this work, we propose a differentiable policy model that integrates motion generation, user intent inference and arbitration. Full differentialbilty of the policy is desirable to further train the shared autonomy system using Reinforcement Learning (RL). We present initial results teleoperating a gripper in a virtual environment using pre-training and hand tuning of the arbitration function. Our results demonstrate the efficacy of the approach when the intent inference module is trained on a task similar to the one performed at test time. Our results also shed light on limitations that we believe demonstrate the need for a shared autonomy RL setup.

[1]  Dmitry Berenson,et al.  From Autonomy to Cooperative Traded Control of Humanoid Manipulation Tasks with Unreliable Communication , 2016, J. Intell. Robotic Syst..

[2]  Siddhartha S. Srinivasa,et al.  Shared autonomy via hindsight optimization for teleoperation and teaming , 2017, Int. J. Robotics Res..

[3]  Stefan Schaal,et al.  Warping the workspace geometry with electric potentials for motion optimization of manipulation tasks , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[4]  Anca D. Dragan,et al.  Shared Autonomy via Deep Reinforcement Learning , 2018, Robotics: Science and Systems.

[5]  Siddhartha S. Srinivasa,et al.  A policy-blending formalism for shared control , 2013, Int. J. Robotics Res..

[6]  Siddhartha S. Srinivasa,et al.  Human-Robot Mutual Adaptation in Shared Autonomy , 2017, 2017 12th ACM/IEEE International Conference on Human-Robot Interaction (HRI.

[7]  Sergey Levine,et al.  Guided Policy Search , 2013, ICML.