On-line Learning of an Object Manipulation Behavior for Legged Robots

In this work we address on-line learning of behaviors for legged robots. We present a m ethodology for generating and adapting object manipulation behaviors using reinforcement learning. An AIBO four-legged robot carries cylinders to a desired position by pushing them with its frontal legs. The movement of a cylinder depends on where the robot exerts forces on it, and upon several physical characteristics (geometry, weight, floor roughness, etc.). Initially the robot doesn't know this dynamics and it learns it from its own experience. By continuously evaluating the action that minimizes the ti me to achieve the goal, and using a global configuration fitness function, the robots can complete the target configuration, and learn the tradeoff between pushing a given cylinder on a given position and moving itself to a new pushing position. Successful results on the application of this methodology are presented.

[1]  Iván Sarmiento,et al.  UChile1 2004 Team Description Paper , 2004 .

[2]  Andres El-Fakdi,et al.  Semi-online neural-Q/spl I.bar/leaming for real-time robot learning , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[3]  Minoru Asada,et al.  Purposive behavior acquisition for a real robot by vision-based reinforcement learning , 1995, Machine Learning.

[4]  Stefano Nolfi,et al.  Evolutionary Robotics: The Biology, Intelligence, and Technology of Self-Organizing Machines , 2000 .

[5]  Yoram Koren,et al.  Potential field methods and their inherent limitations for mobile robot navigation , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.