Towards Flexible Teamwork in Persistent Teams: Extended Report

Teamwork is a critical capability in multi-agent environments. Many such environments mandate that the agents and agent-teams must be persistent i.e., exist over long periods of time. Agents in such persistent teams are bound together by their long-term common interests and goals. This paper focuses on flexible teamwork in such persistent teams. Unfortunately, while previous work has investigated flexible teamwork, persistent teams remain unexplored. For flexible teamwork, one promising approach that has emerged is model-based, i.e., providing agents with general models of teamwork that explicitly specify their commitments in teamwork. Such models enable agents to autonomously reason about coordination. Unfortunately, for persistent teams, such models may lead to coordination and communication actions that while locally optimal, are highly problematic for the team's long-term goals. We present a decision-theoretic technique based on Markov decision processes to enable persistent teams to overcome such limitations of the model-based approach. In particular, agents reason about expected team utilities of future team states that are projected to result from actions recommended by the teamwork model, as well as lower-cost (or higher-cost) variations on these actions. To accommodate real-time constraints, this reasoning is done in an any-time fashion. Implemented examples from an analytic search tree and some real-world domains are presented.

[1]  Milind Tambe,et al.  Building Agent Teams Using an Explicit Teamwork Model and Learning , 1999, Artif. Intell..

[2]  R. Bellman Dynamic programming. , 1957, Science.

[3]  Weixiong Zhang,et al.  Towards flexible teamwork in persistent teams , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).

[4]  Barbara Hayes-Roth,et al.  Multiagent Collaboration in Directed Improvisation , 1997, ICMAS.

[5]  Milind Tambe,et al.  Agent Architectures for Flexible, Practical Teamwork , 1997, AAAI/IAAI.

[6]  R. James Firby,et al.  An Investigation into Reactive Planning in Complex Domains , 1987, AAAI.

[7]  Tom M. Mitchell,et al.  Explanation-Based Generalization: A Unifying View , 1986, Machine Learning.

[8]  Milind Tambe,et al.  Towards Flexible Teamwork , 1997, J. Artif. Intell. Res..

[9]  Arthur C. Graesser,et al.  Is it an Agent, or Just a Program?: A Taxonomy for Autonomous Agents , 1996, ATAL.

[10]  Hector J. Levesque,et al.  On Acting Together , 1990, AAAI.

[11]  Victor R. Lesser,et al.  Designing a Family of Coordination Algorithms , 1997, ICMAS.

[12]  Milind Tambe,et al.  Intelligent Agents for Interactive Simulation Environments , 1995, AI Mag..

[13]  C. Sidner,et al.  Plans for Discourse , 1988 .

[14]  Gil Tidhar,et al.  Planned Team Activity , 1992, MAAMAW.

[15]  Asha Rao,et al.  Agent-oriented architecture for air combat simulation , 1993 .

[16]  Candace L. Sidner,et al.  COLLAGEN: when agents collaborate with people , 1997, AGENTS '97.

[17]  Craig Boutilier,et al.  Decision-Theoretic Planning: Structural Assumptions and Computational Leverage , 1999, J. Artif. Intell. Res..

[18]  Nicholas R. Jennings,et al.  Controlling Cooperative Problem Solving in Industrial Multi-Agent Systems Using Joint Intentions , 1995, Artif. Intell..

[19]  Milind Tambe,et al.  The Benefits of Arguing in a Team , 1999, AI Mag..

[20]  Sandip Sen,et al.  An automated meeting scheduling system that utilizes user preferences , 1997, AGENTS '97.

[21]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[22]  Milind Tambe,et al.  What Is Wrong With Us? Improving Robustness Through Social Diagnosis , 1998, AAAI/IAAI.

[23]  Wei-Min Shen,et al.  Teamwork in Cyberspace: Using TEAMCORE to Make Agents Team-Ready , 1999 .

[24]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[25]  Philip R. Cohen,et al.  Plans for Discourse , 2003 .

[26]  Richard E. Korf,et al.  Depth-First Iterative-Deepening: An Optimal Admissible Tree Search , 1985, Artif. Intell..

[27]  Kathleen M. Carley,et al.  Modeling Organizational Adaptation as a Simulated Annealing Process , 1996 .

[28]  Sarit Kraus,et al.  Collaborative Plans for Complex Group Action , 1996, Artif. Intell..

[29]  Yaser Al-Onaizan,et al.  Using an Explicit Model of Teamwork in RoboCup-97 , 1997, RoboCup.

[30]  A. Newell Unified Theories of Cognition , 1990 .

[31]  Laurent Jacolin,et al.  EVALUATION OF A COOPERATIVE AIR TRAFFIC MANAGEMENT MODEL USING PRINCIPLED NEGOTIATION BETWEEN INTELLIGENT AGENTS , 1998 .

[32]  Eric Horvitz,et al.  Models of Continual Computation , 1997, AAAI/IAAI.

[33]  Hiroaki Kitano,et al.  The RoboCup Synthetic Agent Challenge 97 , 1997, IJCAI.

[34]  Richard E. Korf,et al.  Real-Time Heuristic Search , 1990, Artif. Intell..

[35]  Edmund H. Durfee,et al.  Partial global planning: a coordination framework for distributed hypothesis formation , 1991, IEEE Trans. Syst. Man Cybern..

[36]  H. Brachinger,et al.  Decision analysis , 1997 .

[37]  Richard E. Korf,et al.  Performance of Linear-Space Search Algorithms , 1995, Artif. Intell..

[38]  Mark S. Boddy,et al.  Solving Time-Dependent Problems: A Decision-Theoretic Approach to Planning in Dynamic Environments , 1991 .