Team formation, i.e., allocating agents to roles within a team or subteams of a team, and the reorganization of a team upon team member failure or arrival of new tasks are critical aspects of teamwork. Despite significant progress, research in multiagent team formation and reorganization has failed to provide a rigorous analysis of the computational complexities of the approaches proposed or their degree of optimality. This shortcoming has hindered quantitative comparisons of approaches or their complexity-optimality tradeoffs, e.g., is the team reorganization approach in practical teamwork models such as STEAM optimal in most cases or only as an exception? To alleviate these difficulties, this paper presents R-COM-MTDP, a formal model based on decentralized communicating POMDPs, where agents explicitly take on and change roles to (re)form teams. R-COM-MTDP significantly extends an earlier COM-MTDP model, by analyzing how agents’ roles, local states and reward decompositions gradually reduce the complexity of its policy generation from NEXP-complete to PSPACE-complete to P-complete. We also encode key role reorganization approaches (e.g., STEAM) as R-COM-MTDP policies, and compare them with a locally optimal policy derivable in R-COM-MTDP, thus, theoretically and empirically illustrating the complexity-optimality tradeoffs.
[1]
Milind Tambe,et al.
Towards Flexible Teamwork
,
1997,
J. Artif. Intell. Res..
[2]
Yaser Al-Onaizan,et al.
Experiences Acquired in the Design of RoboCup Teams: A Comparison of Two Fielded Teams
,
2001,
Autonomous Agents and Multi-Agent Systems.
[3]
Neil Immerman,et al.
The Complexity of Decentralized Control of Markov Decision Processes
,
2000,
UAI.
[4]
Wei-Min Shen,et al.
A Dynamic Distributed Constraint Satisfaction Approach to Resource Allocation
,
2001,
CP.
[5]
T. Yoshikawa.
Decomposition of dynamic team decision problems
,
1978
.
[6]
Milind Tambe,et al.
Multiagent teamwork: analyzing the optimality and complexity of key theories and models
,
2002,
AAMAS '02.
[7]
Craig Boutilier,et al.
Planning, Learning and Coordination in Multiagent Decision Processes
,
1996,
TARK.
[8]
G. Tidhar,et al.
Guided Team Selection *
,
1996
.
[9]
Tsuneo Yoshikawa.
Decomposition of dynamic team decision problems
,
1978
.
[10]
Sarit Kraus,et al.
Collaborative Plans for Complex Group Action
,
1996,
Artif. Intell..
[11]
Kee-Eung Kim,et al.
Learning to Cooperate via Policy Search
,
2000,
UAI.