论文信息 - Machine Learning for Fast Quadrupedal Locomotion

Machine Learning for Fast Quadrupedal Locomotion

For a robot, the ability to get from one place to another is one of the most basic skills. However, locomotion on legged robots is a challenging multidimensional control problem. This paper presents a machine learning approach to legged locomotion, with all training done on the physical robots. The main contributions are a specification of our fully automated learning environment and a detailed empirical comparison of four different machine learning algorithms for learning quadrupedal locomotion. The resulting learned walk is considerably faster than all previously reported hand-coded walks for the same robot platform.

Peter Stone | Nate Kohl | Nate Kohl | P. Stone

[1] David E. Goldberg,et al. Genetic Algorithms with Sharing for Multimodalfunction Optimization , 1987, ICGA.

[2] William H. Press,et al. Numerical Recipes in FORTRAN - The Art of Scientific Computing, 2nd Edition , 1987 .

[3] C. Watkins. Learning from delayed rewards , 1989 .

[4] F. A. Seiler,et al. Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[5] Dorothea Heiss-Czedik,et al. An Introduction to Genetic Algorithms. , 1997, Artificial Life.

[6] Gregory S. Hornby,et al. Autonomous evolution of gaits with the Sony Quadruped Robot , 1999 .

[7] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[8] Andrew W. Moore,et al. A Nonparametric Approach to Noisy and Costly Optimization , 2000, ICML.

[9] Andrew W. Moore,et al. Q2: memory-based active learning for optimizing noisy continuous functions , 1998, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[10] Peter L. Bartlett,et al. Infinite-Horizon Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..

[11] Jeff G. Schneider,et al. Autonomous helicopter control using reinforcement learning policy search methods , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[12] Claude Sammut,et al. Omnidirectional Locomotion for Quadruped Robots , 2001, RoboCup.

[13] Nicholas K. Jong,et al. The UT Austin Villa 2003 Four-Legged Team , 2003 .

[14] S. Shankar Sastry,et al. Autonomous Helicopter Flight via Reinforcement Learning , 2003, NIPS.

[15] Prahlad Vadakkepat,et al. An Evolutionary Algorithm for Trajectory Based Gait Generation of Biped Robot , 2003 .

[16] William T. B. Uther,et al. Automatic Gait Optimisation for Quadruped Robots , 2003 .

[17] Stephan K. Chalup,et al. Techniques for Improving Vision and Locomotion on the Sony AIBO Robot , 2003 .

[18] Peter Stone,et al. A Model-Based Approach to Robot Joint Control , 2005, RoboCup.

[19] Martin A. Riedmiller,et al. RoboCup 2004: Robot Soccer World Cup VIII , 2005, RoboCup.

[20] Sung-Bae Cho,et al. Efficient huge-scale feature selection with speciated genetic algorithm , 2005 .