Supervisory teleoperation with online learning and optimal control

We present a general approach for online learning and optimal control of manipulation tasks in a supervisory teleoperation context, targeted to underwater remotely operated vehicles (ROVs). We use an online Bayesian nonparametric learning algorithm to build models of manipulation motions as task-parametrized hidden semi-Markov models (TP-HSMM) that capture the spatiotemporal characteristics of demonstrated motions in a probabilistic representation. Motions are then executed autonomously using an optimal controller, namely a model predictive control (MPC) approach in a receding horizon fashion. This way the remote system locally closes a high-frequency control loop that robustly handles noise and dynamically changing environments. Our system automates common and recurring tasks, allowing the operator to focus only on the tasks that genuinely require human intervention. We demonstrate how our solution can be used for a hot-stabbing motion in an underwater teleoperation scenario. We evaluate the performance of the system over multiple trials and compare with a state-of-the-art approach. We report that our approach generalizes well with only a few demonstrations, accurately performs the learned task and adapts online to dynamically changing task conditions.

[1]  Klaus Landzettel,et al.  Multisensory Shared Autonomy - A Key Issue In The Space Robot Technology Experiment ROTEX , 1992, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Chin-Hui Lee,et al.  Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..

[3]  Christian A. Mueller,et al.  Dexterous Undersea Interventions with Far Distance Onshore Supervision: the DexROV Project , 2016 .

[4]  Darwin G. Caldwell,et al.  An Approach for Imitation Learning on Riemannian Manifolds , 2017, IEEE Robotics and Automation Letters.

[5]  Robert D. Howe,et al.  Virtual Fixtures for Robotic Cardiac Surgery , 2001, MICCAI.

[6]  Ajay Kumar Tanwani,et al.  Learning Robot Manipulation Tasks With Task-Parameterized Semitied Hidden Semi-Markov Model , 2016, IEEE Robotics and Automation Letters.

[7]  Jan Peters,et al.  Probabilistic Movement Primitives , 2013, NIPS.

[8]  Allison M. Okamura,et al.  Haptic Virtual Fixtures for Robot-Assisted Manipulation , 2005, ISRR.

[9]  Sylvain Calinon,et al.  Learning assistive teleoperation behaviors from demonstration , 2016, 2016 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR).

[10]  S. Calinon,et al.  Online Incremental Learning of Manipulation Tasks for Semi-Autonomous Teleoperation , 2016 .

[11]  Louis B. Rosenberg,et al.  Virtual fixtures: Perceptual tools for telerobotic manipulation , 1993, Proceedings of IEEE Virtual Reality Annual International Symposium.

[12]  Darwin G. Caldwell,et al.  Encoding the time and space constraints of a task in explicit-duration Hidden Markov Model , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[14]  Freek Stulp,et al.  Co-manipulation with multiple probabilistic virtual guides , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[15]  Howard Jay Chizeck,et al.  Advanced telerobotic underwater manipulation using virtual fixtures and haptic rendering , 2013, 2013 OCEANS - San Diego.

[16]  Thomas B. Sheridan,et al.  Supervisory control of remote manipulation , 1967, IEEE Spectrum.

[17]  Michael I. Jordan,et al.  Revisiting k-means: New Algorithms via Bayesian Nonparametrics , 2011, ICML.

[18]  Ke Jiang,et al.  Small-Variance Asymptotics for Hidden Markov Models , 2013, NIPS.

[19]  Gregory D. Hager,et al.  Semi-autonomous telerobotic assembly over high-latency networks , 2016, 2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[20]  Sylvain Calinon,et al.  Robot Learning with Task-Parameterized Generative Models , 2015, ISRR.

[21]  Shunzheng Yu,et al.  Practical implementation of an efficient forward-backward algorithm for an explicit-duration hidden Markov model , 2006, IEEE Transactions on Signal Processing.