In this paper we describe an integrated multilevel learning approach to multiagent coalition formation in a real-time environment. In our domain, agents negotiate to form teams to solve joint problems. The agent that initiates a coalition shoulders the responsibility of overseeing and managing the formation process. A coalition formation process consists of two stages. During the initialization stage, the initiating agent identifies the candidates of its coalition, i.e., known neighbors that could help. The initiating agent negotiates with these candidates during the finalization stage to determine the neighbors that are willing to help. Since our domain is dynamic, noisy, and time-constrained, the coalitions are not optimal. However, our approach employs learning mechanisms at several levels to improve the quality of the coalition formation process. At a tactical level, we use reinforcement learning to identify viable candidates based on their potential utility to the coalition, and case-based learning to refine negotiation strategies. At a strategic level, we use distributed, cooperative casebased learning to improve general negotiation strategies. We have implemented the above three learning components and conducted experiments in multisensor target tracking and CPU re-allocation applications.
[1]
Enric Plaza,et al.
Cooperative Case-Based Reasoning
,
1996,
ECAI Workshop LDAIS / ICMAS Workshop LIOME.
[2]
Somesh Jha,et al.
Multi-Agent Coordination through Coalition Formation
,
1997,
ATAL.
[3]
Enric Plaza,et al.
Knowledge and Experience Reuse Through Communication Among Competent (Peer) Agents
,
1999,
Int. J. Softw. Eng. Knowl. Eng..
[4]
Sandip Sen,et al.
Searching for optimal coalition structures
,
2000,
Proceedings Fourth International Conference on MultiAgent Systems.
[5]
Onn Shehory,et al.
Coalition structure generation with worst case guarantees
,
2022
.
[6]
Leen-Kiat Soh,et al.
Reflective Negotiating Agents for Real-Time Multisensor Target Tracking
,
2001,
IJCAI.
[7]
Yaser Al-Onaizan,et al.
On being a teammate: experiences acquired in the design of RoboCup teams
,
1999,
AGENTS '99.
[8]
Tuomas Sandholm,et al.
Coalition Formation Processes with Belief Revision among Bounded-rational Self-interested Agents
,
1999,
J. Log. Comput..
[9]
M. V. Nagendra Prasad,et al.
Corporate Memories as Distributed Case Libraries
,
1996
.