Towards flexible multi-agent decision-making under time pressure

To perform rational decision-making, autonomous agents need considerable computational resources. In multi-agent settings, when other agents are present in the environment, these demands are even more severe. We investigate ways in which the agent's knowledge and the results of deliberative decision-making can be compiled to reduce the complexity of decision-making procedures and to save time in urgent situations. We use machine learning algorithms to compile decision-theoretic deliberations into condition-action rules on how to coordinate in a multi-agent environment. Using different learning algorithms, we endow a resource-bounded agent with a tapestry of decision making tools, ranging from purely reactive to fully deliberative ones. The agent can then select a method depending on the time constraints of the particular situation. We also propose combining the decision-making tools, so that, for example, more reactive methods serve as a pre-processing stage to the more accurate but slower deliberative decision-making ones. We validate our framework with experimental results in simulated coordinated defense. The experiments show that compiling the results of decision-making saves deliberation time while offering good performance in our multi-agent domain.

[1]  Eric Horvitz,et al.  Reasoning under Varying and Uncertain Resource Constraints , 1988, AAAI.

[2]  Stuart J. Russell,et al.  Principles of Metareasoning , 1989, Artif. Intell..

[3]  Devika Subramanian,et al.  Provably Bounded Optimal Agents , 1993, IJCAI.

[4]  R. Mike Cameron-Jones,et al.  Efficient top-down induction of logic programs , 1994, SGAR.

[5]  Eric Horvitz,et al.  Reflection and Action Under Scarce Resources: Theoretical Principles and Empirical Study , 1989, IJCAI.

[6]  H. Simon,et al.  Models of Bounded Rationality: Empirically Grounded Economic Reason , 1997 .

[7]  Sanguk Noh,et al.  Rational communicative behavior in anti-air defense , 1998, Proceedings International Conference on Multi Agent Systems (Cat. No.98EX160).

[8]  Stuart J. Russell Execution Architectures and Compilation , 1989, IJCAI.

[9]  Shlomo Zilberstein,et al.  Optimal Composition of Real-Time Systems , 1996, Artif. Intell..

[10]  Eithan Ephrati,et al.  Deriving Multi-Agent Coordination through Filtering Strategies , 1995, IJCAI.

[11]  Piotr J. Gmytrasiewicz,et al.  On Reasoning About Other Agents , 1995, ATAL.

[12]  Sanguk Noh,et al.  Agent Modeling in Antiair Defense , 1997 .

[13]  David J. Israel,et al.  Plans and resource‐bounded practical reasoning , 1988, Comput. Intell..

[14]  John Fox,et al.  Symbolic Decision Theory and Autonomous Systems , 1991, UAI 1991.

[15]  Anand S. Rao,et al.  BDI Agents: From Theory to Practice , 1995, ICMAS.

[16]  Mark S. Boddy,et al.  An Analysis of Time-Dependent Planning , 1988, AAAI.

[17]  Robert H. M. Macfadzean Surface-Based Air Defense System Analysis , 2000 .