Cooperative behavior acquisition mechanism for a multi-robot system based on reinforcement learning in continuous space

This paper describes an approach to controlling an autonomous multi-robot system. One of the most important issues for this type of system is how to design an on-line autonomous behavior acquisition mechanism which is capable of developing each robot's role in an embedded environment. Our approach is applying reinforcement learning that uses Bayesian discrimination method for segmenting the continuous state and action spaces simultaneously. In addition to this, neural networks are provided for predicting the other robots' moves at the next time step in order to support the learning in a dynamic environment that originates from the other learning robots. The output signals are utilized as the sensory information for the reinforcement learning to increase the stability of the learning problem. A homogeneous multi-robot system is built for evaluation.

[1]  Kazuhiro Ohkura,et al.  Adaptive role development in a homogeneous connected robot group , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[2]  Seiji Yamada,et al.  Adaptive action selection without explicit communication for multi-robot box-pushing , 1999, Proceedings 1999 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human and Environment Friendly Robots with High Intelligence and Emotional Quotients (Cat. No.99CH36289).

[3]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[4]  Maja J. Mataric,et al.  Learning social behavior , 1997, Robotics Auton. Syst..

[5]  Minoru Asada,et al.  Learning from conceptual aliasing caused by direct teaching , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[6]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[7]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[8]  Kazuhiro Kosuge,et al.  Transportation of a single object by multiple decentralized-controlled nonholonomic mobile robots , 1999, Proceedings 1999 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human and Environment Friendly Robots with High Intelligence and Emotional Quotients (Cat. No.99CH36289).

[9]  Toshio Fukuda,et al.  Micro autonomous robotic system and biologically inspired immune swarm strategy as a multi agent robotic system , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[10]  Tucker R. Balch,et al.  Motor Schema-Based Formation Control for Multiagent Robot Teams , 1995, ICMAS.

[11]  Andrew W. Moore,et al.  Memory-based Reinforcement Learning: Converging with Less Data and Less Real Time , 1993 .