Learning Ball Acquisition on a Physical Robot

For a robot to learn to improve its performance based entirely on real-world environmental feedback, the robot’s behavior specification and learning algorithm must be constructed so as to enable data-efficient learning. Building upon previous work enabling a quadrupedal robot to learn a fast walk with all of the training done on the physical robot and with no human intervention [1], we demonstrate the ability of the same robot to learn a more high-level, goal-oriented task using the same methodology. In particular, we enable the robot to learn to capture (or “grasp”) a ball. The learning occurs over about three hours of robot run time and generates a behavior that is significantly better than a baseline hand-coded behavior. Our method is fully implemented and tested on a Sony Aibo ERS-7 robot.

[1]  Peter Stone,et al.  Machine Learning for Fast Quadrupedal Locomotion , 2004, AAAI.

[2]  William H. Press,et al.  Numerical Recipes in FORTRAN - The Art of Scientific Computing, 2nd Edition , 1987 .

[3]  Shimon Edelman,et al.  Learning to grasp using visual information , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[4]  Thomas Röfer,et al.  Evolutionary Gait-Optimization Using a Fitness Function Based on Proprioception , 2004, RoboCup.

[5]  Manuela M. Veloso,et al.  Learning and using models of kicking motions for legged robots , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[6]  Vijay Kumar,et al.  Robotic grasping and contact: a review , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[7]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[8]  Rodney A. Brooks,et al.  Real Robots, Real Learning Problems , 1993 .

[9]  Nicholas K. Jong,et al.  The UT Austin Villa 2003 Four-Legged Team , 2003 .

[10]  Peter Stone,et al.  Policy gradient reinforcement learning for fast quadrupedal locomotion , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.