论文信息 - Learning parameterized motor skills on a humanoid robot

Learning parameterized motor skills on a humanoid robot

We demonstrate a sample-efficient method for constructing reusable parameterized skills that can solve families of related motor tasks. Our method uses learned policies to analyze the policy space topology and learn a set of regression models which, given a novel task, appropriately parameterizes an underlying low-level controller. By identifying the disjoint charts that compose the policy manifold, the method can separately model the qualitatively different sub-skills required for solving distinct classes of tasks. Such sub-skills are useful because they can be treated as new discrete, specialized actions by higher-level planning processes. We also propose a method for reusing seemingly unsuccessful policies as additional, valid training samples for synthesizing the skill, thus accelerating learning. We evaluate our method on a humanoid iCub robot tasked with learning to accurately throw plastic balls at parameterized target locations.

[1] J. Tenenbaum,et al. A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[2] Olivier Sigaud,et al. Learning compact parameterized skills with a single regression , 2013, 2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids).

[3] Peter Stone,et al. Autonomous transfer for reinforcement learning , 2008, AAMAS.

[4] Jun Nakanishi,et al. Learning Movement Primitives , 2005, ISRR.

[5] Peter Stone,et al. Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping , 2006, AAAI.

[6] Giulio Sandini,et al. The iCub humanoid robot: An open-systems platform for research in cognitive development , 2010, Neural Networks.

[7] Vishal Soni,et al. Reinforcement learning of hierarchical skills on the sony aibo robot , 2005, AAAI 2005.

[8] Olivier Sigaud,et al. Path Integral Policy Improvement with Covariance Matrix Adaptation , 2012, ICML.

[9] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.

[10] Bruno Castro da Silva,et al. Learning Parameterized Skills , 2012, ICML.

[11] Jan Peters,et al. Information-Theoretic Motor Skill Learning , 2013, AAAI 2013.

[12] Stefan Schaal,et al. A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[13] Peter Stone,et al. Learning Powerful Kicks on the Aibo ERS-7: The Quest for a Striker , 2010, RoboCup.

[14] Jan Peters,et al. Nonamemanuscript No. (will be inserted by the editor) Reinforcement Learning to Adjust Parametrized Motor Primitives to , 2011 .