Adaptive agent tracking in real-world multiagent domains: a preliminary report

Intelligent interaction in multi-agent domains frequently requires an agent to track other agents' mental states: their current goals, beliefs and intentions. Accuracy in thisagent-trackingtask is critically dependent on the accuracy of the tracker's (tracking agent's) model of the trackee (tracked agent). Unfortunately, in real-world situations, model imperfections arise due to the tracker's resource and information constraints, as well as due to trackees' dynamic behavior modification. While such model imperfections are unavoidable, a tracker must nonetheless attempt to be adaptive in its agent tracking. This article identifies key issues in adaptive agent tracking and presents an approach called DEFT. At its core, DEFT is based on discrimination-based learning. The main idea is to identify the deficiency of a model based on tracking failures, and revise the model by using features that are critical in discriminating successful and failed tracking episodes. Because in real-world situations the set of candidate discriminating features is very large, DEFT relies on knowledge-based focusing to limit the discrimination to those features that it determines were relevant in successful tracking episodes?with an autonomous explanation capability as a major source of this knowledge. This article reports on experiments with an implementation of key aspects of DEFT in a complex synthetic air-to-air combat domain.

[1]  Wei-Min Shen,et al.  Autonomous learning from the environment , 1994 .

[2]  Barbara Hayes-Roth,et al.  Multiagent Collaboration in Directed Improvisation , 1997, ICMAS.

[3]  Christina L. Bouwens The DIS Vision: A Map to the Future of Distributed Simulation , 1993 .

[4]  W. Lewis Johnson,et al.  Agents that Learn to Explain Themselves , 1994, AAAI.

[5]  Milind Tambe Recursive Agent and Agent-Group Tracking in a Real-Time Dynamic Environment , 1995, ICMAS.

[6]  Milind Tambe,et al.  RESC: An Approach for Real-time, Dynamic Agent Tracking , 1995, IJCAI.

[7]  Milind Tambe,et al.  Architectures for Agents that Track Other Agents in Multi-Agent Worlds , 1995, ATAL.

[8]  H. Simon,et al.  Fitness Requirements for Scientific Theories Containing Recursive Theoretical Terms , 1993, The British Journal for the Philosophy of Science.

[9]  John R. Anderson,et al.  Cognitive Modeling and Intelligent Tutoring , 1990, Artif. Intell..

[10]  Henry A. Kautz,et al.  Generalized Plan Recognition , 1986, AAAI.

[11]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[12]  Steven Minton,et al.  Solving Large-Scale Constraint-Satisfaction and Scheduling Problems Using a Heuristic Repair Method , 1990, AAAI.

[13]  M. F. Mar,et al.  ModSAF Behavior Simulation and Control , 1993 .

[14]  Yolanda Gil,et al.  Efficient Domain-Independent Experimentation , 1993, ICML.

[15]  Ronald L. Rivest,et al.  Inference of finite automata using homing sequences , 1989, STOC '89.

[16]  Ronald L. Rivest,et al.  Inference of finite automata using homing sequences , 1989, STOC '89.

[17]  Masaru Ishii,et al.  Cooperation by observation: the framework and basic task patterns , 1994, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[18]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[19]  Milind Tambe,et al.  Tracking Dynamic Team Activity , 1996, AAAI/IAAI, Vol. 1.

[20]  Jerome Azarewicz,et al.  Plan Recognition for Airborne Tactical Decision Making , 1986, AAAI.

[21]  James F. Cremer,et al.  The Software Architecture for Scenario Control in the Iowa Driving Simulator , 1993 .

[22]  Wei-Mein Shen Learning Finite Automata Using Local Distinguishing Experiments , 1993, IJCAI.

[23]  Fei Song,et al.  Temporal Reasoning During Plan Recognition , 1991, AAAI.

[24]  Wei-Min Shen Complementary Discrimination Learning with Decision Lists , 1992, AAAI.

[25]  Anand S. Rao,et al.  Means-End Plan Recognition - Towards a Theory of Reactive Recognition , 1994, KR.

[26]  Robert L. Shaw,et al.  Fighter Combat: Tactics and Maneuvering , 1985 .

[27]  Ken Pimentel,et al.  Virtual reality - through the new looking glass , 1993 .

[28]  Richard Reviewer-Granger Unified Theories of Cognition , 1991, Journal of Cognitive Neuroscience.

[29]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .

[30]  John E. Laird,et al.  Toward Incremental Knowledge Correction for Agents in Complex Environments , 1995, Machine Intelligence 15.

[31]  Dana Angluin,et al.  Learning Regular Sets from Queries and Counterexamples , 1987, Inf. Comput..

[32]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[33]  Blake Darryl Ward,et al.  ET-SOAR: toward an ITS for theory-based representations , 1991 .

[34]  Wei-Min Shen,et al.  Complementary Discrimination Learning: A Duality Between Generalization and Discrimination , 1990, AAAI.

[35]  A. B. Loyall,et al.  Integrating Reactivity, Goals, and Emotion in a Broad Agent , 1992 .

[36]  Allen Newell,et al.  A Preliminary Analysis of the Soar Architecture as a Basis for General Intelligence , 1991, Artif. Intell..

[37]  Richard S. Sutton,et al.  Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.

[38]  Milind Tambe,et al.  Intelligent Agents for Interactive Simulation Environments , 1995, AI Mag..