论文信息 - Extracting low-dimensional control variables for movement primitives

Extracting low-dimensional control variables for movement primitives

Movement primitives (MPs) provide a powerful framework for data driven movement generation that has been successfully applied for learning from demonstrations and robot reinforcement learning. In robotics we often want to solve a multitude of different, but related tasks. As the parameters of the primitives are typically high dimensional, a common practice for the generalization of movement primitives to new tasks is to adapt only a small set of control variables, also called meta parameters, of the primitive. Yet, for most MP representations, the encoding of these control variables is pre-coded in the representation and can not be adapted to the considered tasks. In this paper, we want to learn the encoding of task-specific control variables also from data instead of relying on fixed meta-parameter representations. We use hierarchical Bayesian models (HBMs) to estimate a low dimensional latent variable model for probabilistic movement primitives (ProMPs), which is a recent movement primitive representation. We show on two real robot datasets that ProMPs based on HBMs outperform standard ProMPs in terms of generalization and learning from a small amount of data and also allows for an intuitive analysis of the movement. We also extend our HBM by a mixture model, such that we can model different movement types in the same dataset.

Jan Peters | G. Neumann | A. Paraschos | E. Rückert | Jan Mundo

[1] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[2] Jonathan Baxter,et al. A Model of Inductive Bias Learning , 2000, J. Artif. Intell. Res..

[3] Jun Nakanishi,et al. Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.

[4] Emilio Bizzi,et al. Combinations of muscle synergies in the construction of a natural motor behavior , 2003, Nature Neuroscience.

[5] Massimiliano Pontil,et al. Regularized multi--task learning , 2004, KDD.

[6] Anton Schwaighofer,et al. Learning Gaussian processes from multiple tasks , 2005, ICML.

[7] Radford M. Neal. Pattern Recognition and Machine Learning , 2007, Technometrics.

[8] Lawrence Carin,et al. Multi-Task Learning for Classification with Dirichlet Process Priors , 2007, J. Mach. Learn. Res..

[9] Massimiliano Pontil,et al. Convex multi-task feature learning , 2008, Machine Learning.

[10] Jan Peters,et al. Policy Search for Motor Primitives in Robotics , 2008, NIPS 2008.

[11] Stefan Schaal,et al. Learning and generalization of motor skills by learning from demonstration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[12] Hal Daumé,et al. Bayesian Multitask Learning with Latent Hierarchies , 2009, UAI.

[13] Alessandro Lazaric,et al. Bayesian Multi-Task Reinforcement Learning , 2010, ICML.

[14] A. Billard,et al. Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[15] Jan Peters,et al. A biomimetic approach to robot table tennis , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16] Jun Morimoto,et al. Task-Specific Generalization of Discrete and Periodic Dynamic Movement Primitives , 2010, IEEE Transactions on Robotics.

[17] Bayesian MultiTask Reinforcement Learning , 2010 .

[18] Christoph H. Lampert,et al. Real-time detection of colored objects in multiple camera streams with off-the-shelf hardware components , 2012, Journal of Real-Time Image Processing.

[19] Hal Daumé,et al. Infinite Predictor Subspace Models for Multitask Learning , 2010, AISTATS.

[20] Aude Billard,et al. Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[21] Jan Peters,et al. Reinforcement Learning to Adjust Robot Movements to New Situations , 2010, IJCAI.

[22] Jacques Wainer,et al. Flexible Modeling of Latent Task Structures in Multitask Learning , 2012, ICML.

[23] Jan Peters,et al. Nonamemanuscript No. (will be inserted by the editor) Reinforcement Learning to Adjust Parametrized Motor Primitives to , 2011 .

[24] Hal Daumé,et al. Learning Task Grouping and Overlap in Multi-task Learning , 2012, ICML.

[25] Jun Morimoto,et al. On-line motion synthesis and adaptation using a trajectory database , 2012, Robotics Auton. Syst..

[26] Jan Peters,et al. Probabilistic Movement Primitives , 2013, NIPS.

[27] Carme Torras,et al. Learning Collaborative Impedance-Based Robot Behaviors , 2013, AAAI.

[28] Massimiliano Pontil,et al. Multilinear Multitask Learning , 2013, ICML.

[29] Jan Peters,et al. A probabilistic approach to robot trajectory generation , 2013, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids).

[30] Jan Peters,et al. Data-Efficient Generalization of Robot Skills with Contextual Policy Search , 2013, AAAI.

[31] Eric Eaton,et al. Online Multi-Task Learning via Sparse Dictionary Optimization , 2014, AAAI.