Variable duration movement encoding with minimal intervention control

Programming by Demonstration (PbD) offers a user-friendly way to transfer skills from human to robot. Typically, demonstration data do not contain the control inputs required to reproduce the demonstrated skill. These can be obtained from a low-level controller that tracks the modeled movement. We present a PbD approach for minimal intervention control - a control strategy that only corrects perturbations that interfere with task performance. The novelty of our approach is the probabilistic encoding of the movement duration, providing a performance measure that enables minimal intervention control in a temporal sense. This is achieved by combining a probabilistic movement encoding based on Hidden Semi-Markov Model (HSMM) with Model Predictive Control (MPC). The probabilistic model is used to construct an objective function, hereby assuming that variance is a measure for task performance. The proposed method is demonstrated in a robot experiment and compared with our earlier work.

[1]  Jun Nakanishi,et al.  Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors , 2013, Neural Computation.

[2]  Sadaoki Furui,et al.  Speaker-independent isolated word recognition using dynamic features of speech spectrum , 1986, IEEE Trans. Acoust. Speech Signal Process..

[3]  Heiga Zen,et al.  Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences , 2007, Comput. Speech Lang..

[4]  Satoshi Nakamura,et al.  Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models , 2011, Adv. Robotics.

[5]  Stephen E. Levinson,et al.  Continuously variable duration hidden Markov models for automatic speech recognition , 1986 .

[6]  Michael I. Jordan,et al.  Optimal feedback control as a theory of motor coordination , 2002, Nature Neuroscience.

[7]  Clifford H. Thurber,et al.  Parameter estimation and inverse problems , 2005 .

[8]  Darwin G. Caldwell,et al.  Encoding the time and space constraints of a task in explicit-duration Hidden Markov Model , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Sandra Hirche,et al.  Risk-Sensitive Optimal Feedback Control for Haptic Assistance , 2012, 2012 IEEE International Conference on Robotics and Automation.

[10]  Shunzheng Yu,et al.  Hidden semi-Markov models , 2010, Artif. Intell..

[11]  Yiannis Demiris,et al.  A nonparametric Bayesian approach toward robot learning by demonstration , 2012, Robotics Auton. Syst..

[12]  Jan Peters,et al.  Probabilistic Movement Primitives , 2013, NIPS.

[13]  ProblemsPer Christian HansenDepartment The L-curve and its use in the numerical treatment of inverse problems , 2000 .

[14]  Aude Billard,et al.  Learning control Lyapunov function to ensure stability of dynamical system-based robot reaching motions , 2014, Robotics Auton. Syst..

[15]  Alberto Bemporad,et al.  Predictive Control for Linear and Hybrid Systems , 2017 .

[16]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[17]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[18]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[19]  Nikolaos G. Tsagarakis,et al.  Statistical dynamical systems for skills acquisition in humanoids , 2012, 2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012).

[20]  Darwin G. Caldwell,et al.  A task-parameterized probabilistic model with minimal intervention control , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).