Coordination of Large-Scale Multiagent Systems

Challenges arise when the size of a group of cooperating agents is scaled to hundreds or thousands of members. In domains such as space exploration, military and disaster response, groups of this size (or larger) are required to achieve extremely complex, distributed goals. To effectively and efficiently achieve their goals, members of a group need to cohesively follow a joint course of action while remaining flexible to unforeseen developments in the environment. Coordination of Large-Scale Multiagent Systems provides extensive coverage of the latest research and novel solutions being developed in the field. It describes specific systems, such as SERSE and WIZER, as well as general approaches based on game theory, optimization and other more theoretical frameworks. It will be of interest to researchers in academia and industry, as well as advanced-level students.

[1]  Michael L. Littman,et al.  Graphical Models for Game Theory , 2001, UAI.

[2]  Thomas Wagner,et al.  A key-based coordination algorithm for dynamic readiness and repair service coordination , 2003, AAMAS '03.

[3]  Victor R. Lesser,et al.  Toward robust agent control in open environments , 2000, AGENTS '00.

[4]  Victor R. Lesser,et al.  Coalitions Among Computationally Bounded Agents , 1997, Artif. Intell..

[5]  Peter Stone,et al.  Layered learning in multiagent systems - a winning approach to robotic soccer , 2000, Intelligent robotics and autonomous agents.

[6]  Roger Hartley,et al.  Stochastic Dynamic Programming , 1982 .

[7]  Joseph F. Coates Theories, models, and simulations in international relations: Essays in honor of Harold Guetzkow : Michael Don Ward, ed., Westview Press, Boulder, Colorado, 1985, 625 pages, $35.00 , 1988 .

[8]  Victor R. Lesser,et al.  Design-to-Criteria Scheduling: Real-Time Agent Control , 2000, Agents Workshop on Infrastructure for Multi-Agent Systems.

[9]  Yang Xu,et al.  Scaling teamwork to very large teams , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[10]  Thomas Wagner,et al.  An Application View of COORDINATORS: Coordination Managers for First Responders , 2004, AAAI.

[11]  Stephen Fitzpatrick,et al.  Distributed Coordination through Anarchic Optimization , 2003 .

[12]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[13]  Satinder P. Singh,et al.  How to Dynamically Merge Markov Decision Processes , 1997, NIPS.

[14]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[15]  Kagan Tumer,et al.  Optimal Payoff Functions for Members of Collectives , 2001, Adv. Complex Syst..

[16]  Milind Tambe,et al.  The Communicative Multiagent Team Decision Problem: Analyzing Teamwork Theories and Models , 2011, J. Artif. Intell. Res..

[17]  Thomas Wagner,et al.  Software Agents: Enabling Dynamic Supply Chain Management for a Build to Order Product Line , 2002, International Conference on Internet Computing.

[18]  Katsutoshi Hirayama,et al.  Forming Coalitions for Breaking Deadlocks , 1995, ICMAS.

[19]  Kagan Tumer,et al.  Improving Search Algorithms by Using Intelligent Coordinates , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Kagan Tumer,et al.  Collectives for multiple resource job scheduling across heterogeneous servers , 2003, AAMAS '03.

[21]  Sandip Sen,et al.  Learning to Coordinate without Sharing Information , 1994, AAAI.

[22]  H. Simon,et al.  A Behavioral Model of Rational Choice , 1955 .

[23]  Milind Tambe,et al.  A prototype infrastructure for distributed robot-agent-person teams , 2003, AAMAS '03.

[24]  Leon A. Petrosyan,et al.  Game Theory (Second Edition) , 1996 .

[25]  Daphne Koller,et al.  Multi-Agent Influence Diagrams for Representing and Solving Games , 2001, IJCAI.

[26]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[27]  Milind Tambe,et al.  Towards Flexible Teamwork , 1997, J. Artif. Intell. Res..

[28]  Daphne Koller,et al.  Computing Factored Value Functions for Policies in Structured MDPs , 1999, IJCAI.

[29]  Kagan Tumer,et al.  Collective Intelligence and Braess' Paradox , 2000, AAAI/IAAI.

[30]  Kagan Tumer,et al.  A Survey of Collectives , 2004 .

[31]  Craig Boutilier,et al.  Sequential Optimality and Coordination in Multiagent Systems , 1999, IJCAI.

[32]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[33]  Victor R. Lesser,et al.  Communication decisions in multi-agent cooperation: model and experiments , 2001, AGENTS '01.

[34]  Keiji Kanazawa,et al.  A model for reasoning about persistence and causation , 1989 .

[35]  Robert H. Crites,et al.  Multiagent reinforcement learning in the Iterated Prisoner's Dilemma. , 1996, Bio Systems.

[36]  Kagan Tumer,et al.  Learning sequences of actions in collectives of autonomous agents , 2002, AAMAS '02.