Learning enabled cooperative agent behavior in an evolutionary and competitive environment

The proposed method is implemented in three steps: first, when a variation in environment is perceived, agents take appropriate actions. Second, the behaviors are stimulated and controlled through communication with other agents. Finally, the most frequently stimulated behavior is adopted as a group behavior strategy. In this paper, two different reward models, reward model 1 and reward model 2, are applied. Each reward model is designed to consider the reinforcement or constraint of behaviors. In competitive agent environments, the behavior considered to be advantageous is reinforced as adding reward values. On the contrary, the behavior considered to be disadvantageous is constrained by reducing the reward values. The validity of this strategy is verified through simulation.

[1]  Michael P. Georgeff,et al.  A Theory of Action for MultiAgent Planning , 1984, AAAI.

[2]  Nicholas R. Jennings,et al.  Co-ordination in Multi-Agent Systems , 1997, Software Agents and Soft Computing.

[3]  David R. Jefferson,et al.  An Artificial Neural Network Representation for Artificial Organisms , 1990, PPSN.

[4]  Sandip Sen,et al.  Learning and Adaptation in Multi-Agent Systems , 2006 .

[5]  Stefano Nolfi,et al.  Co-evolving predator and prey robots , 1998, Artificial Life.

[6]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[7]  L. Darrell Whitley,et al.  Adding Learning to the Cellular Development of Neural Networks: Evolution and the Baldwin Effect , 1993, Evolutionary Computation.

[8]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[9]  Robert J. Collins,et al.  AntFarm: Towards Simulated Evolution , 2007 .

[10]  Nader Azarmi,et al.  Software Agents and Soft Computing Towards Enhancing Machine Intelligence , 1997, Lecture Notes in Computer Science.

[11]  C. Lee Giles,et al.  Talking Helps: Evolving Communicating Agents for the Predator-Prey Pursuit Problem , 2000, Artificial Life.

[12]  Mario Tokoro,et al.  An Adaptive Architecture for Modular Q-Learning , 1997, IJCAI.

[13]  Charles E. Taylor,et al.  Artificial Life II , 1991 .

[14]  David C. Parkes,et al.  Learning and Adaption in Multi-Agent Systems , 2006, Lecture Notes in Computer Science.

[15]  Victor R. Lesser,et al.  Cooperative Multiagent Systems: A Personal View of the State of the Art , 1999, IEEE Trans. Knowl. Data Eng..

[16]  J. Baldwin A New Factor in Evolution , 1896, The American Naturalist.

[17]  R. Collins Studies in artificial evolution , 1992 .

[18]  Jeffrey L. Elman,et al.  Learning and Evolution in Neural Networks , 1994, Adapt. Behav..

[19]  Shin Ishii,et al.  Multi-agent reinforcement learning: an approach based on the other agent's internal model , 2000, Proceedings Fourth International Conference on MultiAgent Systems.