Coevolution of heterogeneous multi-robot teams

Evolving multiple robots so that each robot acting independently can contribute to the maximization of a system level objective presents significant scientific challenges. For example, evolving multiple robots to maximize aggregate information in exploration domains (e.g., planetary exploration, search and rescue) requires coordination, which in turn requires the careful design of the evaluation functions. Additionally, where communication among robots is expensive (e.g., limited power or computation), the coordination must be achieved passively, without robots explicitly informing others of their states/intended actions. Coevolving robots in these situations is a potential solution to producing coordinated behavior, where the robots are coupled through their evaluation functions. In this work, we investigate coevolution in three types of domains: (i) where precisely n homogeneous robots need to perform a task; (ii) where n is the optimal number of homogeneous robots for the task; and (iii) where n is the optimal number of heterogeneous robots for the task. Our results show that coevolving robots with evaluation functions that are locally aligned with the system evaluation significantly improve performance over robots evolving using the system evaluation function directly, particularly in dynamic environments.

[1]  Stefano Nolfi,et al.  How to Evolve Autonomous Robots: Different Approaches in Evolutionary Robotics , 1994 .

[2]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[3]  Michael P. Wellman,et al.  Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[4]  Kenneth A. De Jong,et al.  Modeling Variation in Cooperative Coevolution Using Evolutionary Game Theory , 2002, FOGA.

[5]  Risto Miikkulainen,et al.  Eugenic Evolution Utilizing A Domain Model , 2002, GECCO.

[6]  Michail G. Lagoudakis,et al.  Coordinated Reinforcement Learning , 2002, ICML.

[7]  Dynamics of Coalition Formation in Combinatorial Trading , 2003, IJCAI.

[8]  Xin Li,et al.  An Integrated Multilevel Learning Approach to Multiagent Coalition Formation , 2003, IJCAI.

[9]  Maja J. Mataric,et al.  Multi-robot task allocation: analyzing the complexity and optimality of key architectures , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[10]  R. Paul Wiegand,et al.  Improving Coevolutionary Search for Optimal Multiagent Behaviors , 2003, IJCAI.

[11]  David C. Parkes,et al.  An MDP-Based Approach to Online Mechanism Design , 2003, NIPS.

[12]  Kagan Tumer,et al.  Coordinating multi-rover systems: evaluation functions for dynamic and noisy environments , 2005, GECCO '05.

[13]  R. Paul Wiegand,et al.  Robustness in cooperative coevolution , 2006, GECCO '06.

[14]  R. Paul Wiegand,et al.  Biasing Coevolutionary Search for Optimal Multiagent Behaviors , 2006, IEEE Transactions on Evolutionary Computation.

[15]  Kagan Tumer,et al.  Distributed evaluation functions for fault tolerant multi-rover systems , 2006, GECCO '06.

[16]  Peter Stone,et al.  A multi-robot system for continuous area sweeping tasks , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[17]  Sarit Kraus,et al.  Enhancing MAS Cooperative Search Through Coalition Partitioning , 2007, IJCAI.

[18]  Wolfram Burgard,et al.  Robotics: Science and Systems II , 2007 .

[19]  Kagan Tumer,et al.  Efficient Evaluation Functions for Evolving Coordination , 2008, Evolutionary Computation.

[20]  Kagan Tumer,et al.  Analyzing and visualizing multiagent rewards in dynamic and stochastic domains , 2008, Autonomous Agents and Multi-Agent Systems.

[21]  Kenneth O. Stanley,et al.  Generative encoding for multiagent learning , 2008, GECCO '08.

[22]  Wolfram Burgard,et al.  Robotics: Science and Systems XV , 2010 .