论文信息 - Learning Powerful Kicks on the Aibo ERS-7: The Quest for a Striker

Learning Powerful Kicks on the Aibo ERS-7: The Quest for a Striker

Coordinating complex motion sequences remains a challenging task for robotics. Machine Learning has aided this process, successfully improving motion sequences such as walking and grasping. However, to the best of our knowledge, outside of simulation, learning has never been applied to the task of kicking the ball. We apply machine learning methods to optimize kick power entirely on a real robot. The resulting learned kick is significantly more powerful than the most powerful handcoded kick of one of the most successful RoboCup four-legged league teams, and is learned in a principled manner which requires very little engineering of the parameter space. Finally, model inversion is applied to the problem of creating a parameterized kick capable of kicking the ball a specified distance.

Peter Stone | Matthew J. Hausknecht

[1] Peter Stone,et al. Machine Learning for Fast Quadrupedal Locomotion , 2004, AAAI.

[2] Alex M. Andrew,et al. ROBOT LEARNING, edited by Jonathan H. Connell and Sridhar Mahadevan, Kluwer, Boston, 1993/1997, xii+240 pp., ISBN 0-7923-9365-1 (Hardback, 218.00 Guilders, $120.00, £89.95). , 1999, Robotica (Cambridge. Print).

[3] Masahiro Fujita,et al. Evolving robust gaits with AIBO , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[4] Stephan K. Chalup,et al. Techniques for Improving Vision and Locomotion on the Sony AIBO Robot , 2003 .

[5] Peter Stone,et al. Autonomous Learning of Stable Quadruped Locomotion , 2006, RoboCup.

[6] William T. B. Uther,et al. Automatic Gait Optimisation for Quadruped Robots , 2003 .

[7] Andrea Cherubini,et al. Layered Learning for a Soccer Legged Robot Helped with a 3D Simulator , 2008, RoboCup.

[8] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[9] Peter Stone,et al. Layered learning in multiagent systems - a winning approach to robotic soccer , 2000, Intelligent robotics and autonomous agents.

[10] Nicholas K. Jong,et al. The UT Austin Villa 2004 RoboCup Four-Legged Team: Coming of Age , .

[11] Peter Stone,et al. Generalized model learning for Reinforcement Learning on a humanoid robot , 2010, 2010 IEEE International Conference on Robotics and Automation.

[12] Peter Stone,et al. Policy gradient reinforcement learning for fast quadrupedal locomotion , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[13] Javier Ruiz-del-Solar,et al. Learning to Kick the Ball Using Back to Reality , 2004, RoboCup.

[14] Manuela M. Veloso,et al. Learning and using models of kicking motions for legged robots , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[15] Manuela M. Veloso,et al. An evolutionary approach to gait learning for four-legged robots , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[16] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[17] Peter L. Bartlett,et al. Infinite-Horizon Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..

[18] Ayumi Shinohara,et al. Autonomous Learning of Ball Trapping in the Four-Legged Robot League , 2006, RoboCup.

[19] Peter Stone,et al. Instance-Based Action Models for Fast Action Planning , 2007, RoboCup.

[20] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[21] Peter Stone,et al. The Chin Pinch: A Case Study in Skill Learning on a Legged Robot , 2006, RoboCup.

[22] Gregory S. Hornby,et al. Autonomous evolution of gaits with the Sony Quadruped Robot , 1999 .