论文信息 - Towards flexible multi-agent decision-making under time pressure

Towards flexible multi-agent decision-making under time pressure

To perform rational decision-making, autonomous agents need considerable computational resources. In multi-agent settings, when other agents are present in the environment, these demands are even more severe. We investigate ways in which the agent's knowledge and the results of deliberative decision-making can be compiled to reduce the complexity of decision-making procedures and to save time in urgent situations. We use machine learning algorithms to compile decision-theoretic deliberations into condition-action rules on how to coordinate in a multi-agent environment. Using different learning algorithms, we endow a resource-bounded agent with a tapestry of decision making tools, ranging from purely reactive to fully deliberative ones. The agent can then select a method depending on the time constraints of the particular situation. We also propose combining the decision-making tools, so that, for example, more reactive methods serve as a pre-processing stage to the more accurate but slower deliberative decision-making ones. We validate our framework with experimental results in simulated coordinated defense. The experiments show that compiling the results of decision-making saves deliberation time while offering good performance in our multi-agent domain.

Sanguk Noh | Piotr J. Gmytrasiewicz | P. Gmytrasiewicz | Sanguk Noh

[1] Eric Horvitz,et al. Reasoning under Varying and Uncertain Resource Constraints , 1988, AAAI.

[2] Stuart J. Russell,et al. Principles of Metareasoning , 1989, Artif. Intell..

[3] Devika Subramanian,et al. Provably Bounded Optimal Agents , 1993, IJCAI.

[4] R. Mike Cameron-Jones,et al. Efficient top-down induction of logic programs , 1994, SGAR.

[5] Eric Horvitz,et al. Reflection and Action Under Scarce Resources: Theoretical Principles and Empirical Study , 1989, IJCAI.

[6] H. Simon,et al. Models of Bounded Rationality: Empirically Grounded Economic Reason , 1997 .

[7] Sanguk Noh,et al. Rational communicative behavior in anti-air defense , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).

[8] Stuart J. Russell. Execution Architectures and Compilation , 1989, IJCAI.

[9] Shlomo Zilberstein,et al. Optimal Composition of Real-Time Systems , 1996, Artif. Intell..

[10] Eithan Ephrati,et al. Deriving Multi-Agent Coordination through Filtering Strategies , 1995, IJCAI.

[11] Piotr J. Gmytrasiewicz,et al. On Reasoning About Other Agents , 1995, ATAL.

[12] Sanguk Noh,et al. Agent Modeling in Antiair Defense , 1997 .

[13] David J. Israel,et al. Plans and resource‐bounded practical reasoning , 1988, Comput. Intell..

[14] John Fox,et al. Symbolic Decision Theory and Autonomous Systems , 1991, UAI 1991.

[15] Anand S. Rao,et al. BDI Agents: From Theory to Practice , 1995, ICMAS.

[16] Mark S. Boddy,et al. An Analysis of Time-Dependent Planning , 1988, AAAI.

[17] Robert H. M. Macfadzean. Surface-Based Air Defense System Analysis , 2000 .