Multiagent Teamwork: Hybrid Approaches

Today within the multiagent community, we see at least four competing methods to building multiagent systems: beliefdesire-intention (BDI), distributed constraint optimization (DCOP), distributed POMDPs, and auctions or game-theoretic methods. While there is exciting progress within each approach, there is a lack of cross-cutting research. This article highlights the various hybrid techniques for multiagent teamwork developed by the teamcore group. In particular, for the past decade, the TEAMCORE research group has focused on building agent teams in complex, dynamic domains. While our early work was inspired by BDI, we will present an overview of recent research that uses DCOPs and distributed POMDPs in building agent teams. While DCOP and distributed POMDP algorithms provide promising results, hybrid approaches allow us to use the complementary strengths of different techniques to create algorithms that perform better than either of their component algorithms alone. For example, in the BDI-POMDP hybrid approach, BDI team plans are exploited to improve POMDP tractability, and POMDPs improve BDI team plan performance.

[1]  Milind Tambe,et al.  Distributed Sensor Networks: A Multiagent Perspective , 2003 .

[2]  Tamer Basar,et al.  Coalition formation in proportionally fair divisible auctions , 2003, AAMAS '03.

[3]  Richard R. Brooks,et al.  Distributed Sensor Networks: A Multiagent Perspective , 2008 .

[4]  Milind Tambe,et al.  Two Fielded Teams and Two Experts: A RoboCup Challenge Response from the Trenches , 1999, IJCAI.

[5]  Milind Tambe,et al.  Towards Flexible Teamwork , 1997, J. Artif. Intell. Res..

[6]  M. Yokoo,et al.  Distributed Breakout Algorithm for Solving Distributed Constraint Satisfaction Problems , 1996 .

[7]  Milind Tambe,et al.  Solution sets for DCOPs and graphical games , 2006, AAMAS '06.

[8]  Milind Tambe,et al.  Using multiagent teams to improve the training of incident commanders , 2006, AAMAS '06.

[9]  Milind Tambe,et al.  Hybrid BDI-POMDP Framework for Multiagent Teaming , 2011, J. Artif. Intell. Res..

[10]  Milind Tambe,et al.  Valuations of Possible States (VPS): a quantitative framework for analysis of privacy loss among collaborative personal assistant agents , 2005, AAMAS '05.

[11]  Milind Tambe,et al.  Preprocessing techniques for accelerating the DCOP algorithm ADOPT , 2005, AAMAS '05.

[12]  Milind Tambe,et al.  Exploiting belief bounds: practical POMDPs for personal assistant agents , 2005, AAMAS '05.

[13]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[14]  Milind Tambe,et al.  Towards Adjustable Autonomy for the Real World , 2002, J. Artif. Intell. Res..

[15]  Milind Tambe,et al.  Intelligent Agents for Interactive Simulation Environments , 1995, AI Mag..

[16]  Wei-Min Shen,et al.  A Dynamic Distributed Constraint Satisfaction Approach to Resource Allocation , 2001, CP.

[17]  Roger Mailler Comparing two approaches to dynamic, distributed constraint satisfaction , 2005, AAMAS '05.

[18]  Makoto Yokoo,et al.  Distributed Multi-Criteria Coordination in Multi-Agent Systems , 2005 .

[19]  Makoto Yokoo,et al.  Adopt: asynchronous distributed constraint optimization with quality guarantees , 2005, Artif. Intell..

[20]  Makoto Yokoo,et al.  Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs , 2005, IJCAI.

[21]  Milind Tambe,et al.  A prototype infrastructure for distributed robot-agent-person teams , 2003, AAMAS '03.

[22]  Milind Tambe,et al.  Distributed Algorithms for DCOP: A Graphical-Game-Based Approach , 2004, PDCS.

[23]  Milind Tambe,et al.  Allocating tasks in extreme teams , 2005, AAMAS '05.

[24]  Makoto Yokoo,et al.  Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings , 2003, IJCAI.

[25]  Sarit Kraus,et al.  Security in multiagent systems by policy randomization , 2006, AAMAS '06.

[26]  Sarit Kraus,et al.  Towards a formalization of teamwork with resource constraints , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[27]  John P. Lewis,et al.  The DEFACTO System: Training Tool for Incident Commanders , 2005, AAAI.