Investigating reinforcement learning in multiagent coalition formation

In this paper we investigate the use of reinforcement learning to address the multiagent coalition formation problem in dynamic, uncertain, real-time, and noisy environments. To adapt to the complex environmental factors, we equip each agent with the case-based reinforcement learning ability which is the integration of case-based reasoning and reinforcement learning. The agent can use case-based reasoning to derive a coalition formation plan in a real-time manner based on the past experience, and then instantiate the plan adapting to the dynamic and uncertain environment with the reinforcement learning on coalition formation experience. In this paper we focus on describing multiple aspects of the application of reinforcement learning in multiagent coalition formation. We classify two types of reinforcement learning: case-oriented reinforcement learning and peerrelated reinforcement learning, corresponding to strategic, off-line learning scenario and tactical, online learning scenario respectively. An agent might learn about the others’ joint or individual behavior during coalition formation, as a result, we identify them as joint-behavior reinforcement learning and individual-behavior reinforcement learning. We embed the learning approach in a multi-phase coalition formation model and have implemented the approach.

[1]  Nicholas R. Jennings,et al.  Learning to select a coordination mechanism , 2002, AAMAS '02.

[2]  Gerhard Weiss,et al.  Multiagent Systems , 1999 .

[3]  Mario Tokoro,et al.  An Adaptive Architecture for Modular Q-Learning , 1997, IJCAI.

[4]  Daniel Kudenko,et al.  Learning in multi-agent systems , 2001, The Knowledge Engineering Review.

[5]  José M. Vidal,et al.  Learning in Multiagent Systems: An Introduction from a Game-Theoretic Perspective , 2003, Adaptive Agents and Multi-Agents Systems.

[6]  Victor R. Lesser,et al.  The Use of Meta-level Information in Learning Situation-Specific Coordination , 1997, IJCAI.

[7]  Jürgen Schmidhuber,et al.  Learning Team Strategies: Soccer Case Studies , 1998, Machine Learning.

[8]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[9]  Victor R. Lesser,et al.  Learning to Improve Coordinated Actions in Cooperative Distributed Problem-Solving Environments , 1998, Machine Learning.

[10]  Leen-Kiat Soh,et al.  Reflective Negotiating Agents for Real-Time Multisensor Target Tracking , 2001, IJCAI.

[11]  Michael P. Wellman,et al.  Online learning about other agents in a dynamic multiagent system , 1998, AGENTS '98.

[12]  Robert H. Crites,et al.  Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.

[13]  MakingTuomas W. Sandholm Distributed Rational Decision Making 5.1 Introduction , 1999 .

[14]  Shin Ishii,et al.  Multi-agent reinforcement learning: an approach based on the other agent's internal model , 2000, Proceedings Fourth International Conference on MultiAgent Systems.

[15]  Xin Li,et al.  An Integrated Multilevel Learning Approach to Multiagent Coalition Formation , 2003, IJCAI.

[16]  Sarit Kraus,et al.  Methods for Task Allocation via Agent Coalition Formation , 1998, Artif. Intell..

[17]  Sandip Sen,et al.  Learning in multiagent systems , 1999 .

[18]  Victor R. Lesser,et al.  Coalition Formation among Bounded Rational Agents , 1995, IJCAI.

[19]  Tuomas Sandholm,et al.  Distributed rational decision making , 1999 .

[20]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[21]  Sandip Sen,et al.  Learning cases to resolve conflicts and improve group behavior , 1998, Int. J. Hum. Comput. Stud..