Adaptive spatio-temporal organization in groups of robots

This paper presents experiments, in simulation, with a group of robots that improve their performance on a straightforward transportation task by using reinforcement learning to associate input states with a set of abstract behaviors. We show that the improvement in performance is a result of the group adapting its spatio-temporal organization to the given environment. Spatio-temporal adaptation is a general form of adaptation in that it can improve performance over a range of different tasks and environments. Hence it increases the general applicability and autonomy of robotic systems. Lastly, we present two communication strategies that improve this ability to adapt by generally improving learning rates for cooperative robots in highly dynamic domains.

[1]  Ming Tan,et al.  Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.

[2]  John S. Bay,et al.  Spatial self-organization in large populations of mobile robots , 1994, Proceedings of 1994 9th IEEE International Symposium on Intelligent Control.

[3]  Tucker R. Balch,et al.  Communication in reactive multiagent robotic systems , 1995, Auton. Robots.

[4]  Sandip Sen,et al.  Learning to Coordinate without Sharing Information , 1994, AAAI.

[5]  Maja J. Matarić Reducing Locality Through Communication in Distributed Multi-Agent Learning , 1996 .

[6]  Hitoshi Matsubara,et al.  Learning Cooperative Behavior in Multi-Agent Environment - A Case Study of Choice of Play-Plans in Soccer , 1996, PRICAI.

[7]  Maja J. Mataric,et al.  Learning social behavior , 1997, Robotics Auton. Syst..

[8]  Maja J. Mataric,et al.  Reinforcement Learning in the Multi-Robot Domain , 1997, Auton. Robots.

[9]  Maja J. Mataric,et al.  Using communication to reduce locality in distributed multiagent learning , 1997, J. Exp. Theor. Artif. Intell..

[10]  Thomas G. Dietterich Adaptive computation and machine learning , 1998 .

[11]  Tucker Balch,et al.  Reward and Diversity in Multirobot Foraging , 1999, IJCAI 1999.

[12]  Tucker R. Balch The impact of diversity on performance in multi-robot foraging , 1999, AGENTS '99.

[13]  Manuela M. Veloso,et al.  Task Decomposition, Dynamic Role Assignment, and Low-Bandwidth Communication for Real-Time Strategic Teamwork , 1999, Artif. Intell..

[14]  Dani Goldberg Design and Evaluation of Robust Behavior-Based Controllers for Distributed Multi-Robot Collection Tasks , 2001 .

[15]  Gaurav S. Sukhatme,et al.  Emergent bucket brigading: a simple mechanisms for improving performance in multi-robot constrained-space foraging tasks , 2001, AGENTS '01.

[16]  Gaurav S. Sukhatme,et al.  Most valuable player: a robot device server for distributed control , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).