论文信息 - Learning Ball Acquisition on a Physical Robot

Learning Ball Acquisition on a Physical Robot

For a robot to learn to improve its performance based entirely on real-world environmental feedback, the robot’s behavior specification and learning algorithm must be constructed so as to enable data-efficient learning. Building upon previous work enabling a quadrupedal robot to learn a fast walk with all of the training done on the physical robot and with no human intervention [1], we demonstrate the ability of the same robot to learn a more high-level, goal-oriented task using the same methodology. In particular, we enable the robot to learn to capture (or “grasp”) a ball. The learning occurs over about three hours of robot run time and generates a behavior that is significantly better than a baseline hand-coded behavior. Our method is fully implemented and tested on a Sony Aibo ERS-7 robot.

Peggy Fidelman | Peggy Fidelman

[1] Peter Stone,et al. Machine Learning for Fast Quadrupedal Locomotion , 2004, AAAI.

[2] William H. Press,et al. Numerical Recipes in FORTRAN - The Art of Scientific Computing, 2nd Edition , 1987 .

[3] Shimon Edelman,et al. Learning to grasp using visual information , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[4] Thomas Röfer,et al. Evolutionary Gait-Optimization Using a Fitness Function Based on Proprioception , 2004, RoboCup.

[5] Manuela M. Veloso,et al. Learning and using models of kicking motions for legged robots , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[6] Vijay Kumar,et al. Robotic grasping and contact: a review , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[7] F. A. Seiler,et al. Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[8] Rodney A. Brooks,et al. Real Robots, Real Learning Problems , 1993 .

[9] Nicholas K. Jong,et al. The UT Austin Villa 2003 Four-Legged Team , 2003 .

[10] Peter Stone,et al. Policy gradient reinforcement learning for fast quadrupedal locomotion , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.