Two decades of multiagent teamwork research: past, present, and future

This paper discusses some of the recent cooperative multiagent systems work in the TEAMCORE lab at the University of Southern California. Based in part on an invited talk at the CARE 2010 workshop, we highlight how and why execution-time reasoning has been supplementing, or replacing, planning-time reasoning in such systems.

[1]  Makoto Yokoo,et al.  Adopt: asynchronous distributed constraint optimization with quality guarantees , 2005, Artif. Intell..

[2]  Milind Tambe,et al.  Taking DCOP to the real world: efficient complete solutions for distributed multi-event scheduling , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[3]  Makoto Yokoo,et al.  When should there be a "Me" in "Team"?: distributed multi-agent optimization under uncertainty , 2010, AAMAS.

[4]  Makoto Yokoo,et al.  Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings , 2003, IJCAI.

[5]  Milind Tambe,et al.  Solving Multiagent Networks using Distributed Constraint Optimization , 2008, AI Mag..

[6]  Milind Tambe,et al.  Exploiting Coordination Locales in Distributed POMDPs via Social Model Shaping , 2009, ICAPS.

[7]  R. Sun Cognition and Multi-Agent Interactions: From Cognitive Modeling to Social Simulation , 2005 .

[8]  Milind Tambe,et al.  Robust Agent Teams via Socially-Attentive Monitoring , 2000, J. Artif. Intell. Res..

[9]  Makoto Yokoo,et al.  Networked Distributed POMDPs: A Synergy of Distributed Constraint Optimization and POMDPs , 2005, IJCAI.

[10]  Nicholas R. Jennings,et al.  Decentralised coordination of low-power embedded devices using the max-sum algorithm , 2008, AAMAS.

[11]  Milind Tambe,et al.  The Communicative Multiagent Team Decision Problem: Analyzing Teamwork Theories and Models , 2011, J. Artif. Intell. Res..

[12]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[13]  Milind Tambe,et al.  Towards Flexible Teamwork , 1997, J. Artif. Intell. Res..

[14]  Victor R. Lesser,et al.  Multi-agent policies: from centralized ones to decentralized ones , 2002, AAMAS '02.

[15]  Manuela M. Veloso,et al.  Reasoning about joint beliefs for execution-time communication decisions , 2005, AAMAS '05.

[16]  William P. Pierskalla,et al.  A survey of maintenance models: The control and surveillance of deteriorating systems , 1976 .

[17]  Makoto Yokoo,et al.  DCOPs meet the realworld: exploring unknown reward matrices with applications to mobile sensor networks , 2009, IJCAI 2009.

[18]  Hector J. Levesque,et al.  On Acting Together , 1990, AAAI.

[19]  Shlomo Zilberstein,et al.  Bounded Policy Iteration for Decentralized POMDPs , 2005, IJCAI.

[20]  C. Sidner,et al.  Plans for Discourse , 1988 .

[21]  Milind Tambe,et al.  Distributed Sensor Networks , 2003, Multiagent Systems, Artificial Societies, and Simulated Organizations.

[22]  Milind Tambe,et al.  Quality Guarantees on k-Optimal Solutions for Distributed Constraint Optimization Problems , 2007, IJCAI.

[23]  Feng Wu,et al.  Multi-Agent Online Planning with Communication , 2009, ICAPS.

[24]  Christian Lebiere,et al.  Cognition and Multi-Agent Interaction: From Cognitive Modeling to Social Simulation , 2006 .

[25]  Milind Tambe,et al.  Evolution of a Teamwork Model , 2005 .

[26]  Milind Tambe,et al.  Distributed Algorithms for DCOP: A Graphical-Game-Based Approach , 2004, PDCS.

[27]  Reid G. Simmons,et al.  Unsupervised learning of probabilistic models for robot navigation , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[28]  Milind Tambe,et al.  Distributed Sensor Networks: A Multiagent Perspective , 2003 .

[29]  Leslie Pack Kaelbling,et al.  Acting under uncertainty: discrete Bayesian models for mobile-robot navigation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[30]  Jonathan P. Pearce,et al.  KOPT : Distributed DCOP Algorithm for Arbitrary k-optima with Monotonically Increasing Utility , 2007 .

[31]  Victor R. Lesser,et al.  Solving distributed constraint optimization problems using cooperative mediation , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[32]  Neil Immerman,et al.  The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.

[33]  Philip R. Cohen,et al.  Intentions in Communication , 1992, Language.

[34]  Rong Yang,et al.  Teamwork and Coordination under Model Uncertainty in DEC-POMDPs , 2010, Interactive Decision Theory and Game Theory.

[35]  Milind Tambe,et al.  Asynchronous algorithms for approximate distributed constraint optimization with quality bounds , 2010, AAMAS.

[36]  Claudia V. Goldman,et al.  Solving Transition Independent Decentralized Markov Decision Processes , 2004, J. Artif. Intell. Res..