论文信息 - Probabilistic Movement Primitives

Probabilistic Movement Primitives

Movement Primitives (MP) are a well-established approach for representing modular and re-usable robot movement generators. Many state-of-the-art robot learning successes are based MPs, due to their compact representation of the inherently continuous and high dimensional robot movements. A major goal in robot learning is to combine multiple MPs as building blocks in a modular control architecture to solve complex tasks. To this effect, a MP representation has to allow for blending between motions, adapting to altered task variables, and co-activating multiple MPs in parallel. We present a probabilistic formulation of the MP concept that maintains a distribution over trajectories. Our probabilistic approach allows for the derivation of new operations which are essential for implementing all aforementioned properties in one framework. In order to use such a trajectory distribution for robot movement control, we analytically derive a stochastic feedback controller which reproduces the given trajectory distribution. We evaluate and compare our approach to existing methods on several simulated as well as real robot scenarios.

[1] J. Woods,et al. Probability and Random Processes with Applications to Signal Processing , 2001 .

[2] Jun Nakanishi,et al. Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.

[3] Michael I. Jordan,et al. Optimal feedback control as a theory of motor coordination , 2002, Nature Neuroscience.

[4] Jun Nakanishi,et al. Learning Movement Primitives , 2005, ISRR.

[5] Emilio Bizzi,et al. Shared and specific muscle synergies in natural motor behaviors. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6] Ludovic Righetti,et al. Programmable central pattern generators: an application to biped locomotion control , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[7] Marc Toussaint,et al. Modelling motion primitives and their timing in biologically executed movements , 2007, NIPS.

[8] Jun Nakanishi,et al. A Unifying Methodology for Robot Control with Redundant DOFs , 2008 .

[9] Jan Peters,et al. Policy Search for Motor Primitives in Robotics , 2008, NIPS 2008.

[10] Jun Nakanishi,et al. A unifying framework for robot control with redundant DOFs , 2007, Auton. Robots.

[11] Marc Toussaint,et al. Robot trajectory optimization using approximate inference , 2009, ICML '09.

[12] Christoph H. Lampert,et al. Movement templates for learning of hitting and batting , 2010, 2010 IEEE International Conference on Robotics and Automation.

[13] Alessandro Lazaric,et al. Bayesian Multi-Task Reinforcement Learning , 2010, ICML.

[14] Darwin G. Caldwell,et al. Robot motor skill coordination with EM-based Reinforcement Learning , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15] A. Billard,et al. Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[16] Jun Morimoto,et al. Task-Specific Generalization of Discrete and Periodic Dynamic Movement Primitives , 2010, IEEE Transactions on Robotics.

[17] Aude Billard,et al. Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[18] Jan Peters,et al. Learning concurrent motor skills in versatile solution spaces , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19] Scott Kuindersma,et al. Robot learning from demonstration by constructing skill trees , 2012, Int. J. Robotics Res..

[20] Bruno Castro da Silva,et al. Learning Parameterized Skills , 2012, ICML.

[21] Darwin G. Caldwell,et al. Compliant skills acquisition and multi-optima policy search with EM-based reinforcement learning , 2013, Robotics Auton. Syst..

[22] Marc Toussaint,et al. Learned graphical models for probabilistic planning provide a new class of movement primitives , 2013, Front. Comput. Neurosci..

[23] Carme Torras,et al. Learning Collaborative Impedance-Based Robot Behaviors , 2013, AAAI.

[24] Jan Peters,et al. A probabilistic approach to robot trajectory generation , 2013, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids).