Definition and Complexity of Some Basic Metareasoning Problems

In most real-world settings, due to limited time or other resources, an agent cannot perform all potentially useful deliberation and information gathering actions. This leads to the metareasoning problem of selecting such actions. Decision-theoretic methods for metareasoning have been studied in AI, but there are few theoretical results on the complexity of metareasoning. We derive hardness results for three settings which most real metareasoning systems would have to encompass as special cases. In the first, the agent has to decide how to allocate its deliberation time across anytime algorithms running on different problem instances. We show this to be ATP-complete. In the second, the agent has to (dynamically) allocate its deliberation or information gathering resources across multiple actions that it has to choose among. We show this to be AfP-hard even when evaluating each individual action is extremely simple. In the third, the agent has to (dynamically) choose a limited number of deliberation or information gathering actions to disambiguate the state of the world. We show that this is AfP-hard under a natural restriction, and PSP ACE hard in general.

[1]  Sarvapali D. Ramchurn,et al.  Argumentation-based negotiation , 2003, The Knowledge Engineering Review.

[2]  Ronald A. Howard,et al.  Information Value Theory , 1966, IEEE Trans. Syst. Sci. Cybern..

[3]  Norman Y. Foo,et al.  Logical Foundations of Negotiation: Outcome, Concession, and Adaptation , 2004, AAAI.

[4]  Stuart J. Russell,et al.  Do the right thing , 1991 .

[5]  Eric Horvitz,et al.  Reasoning about beliefs and actions under computational resource constraints , 1987, Int. J. Approx. Reason..

[6]  Victor Lesser,et al.  Utility-Based Termination of Anytime Algorithms , 1994 .

[7]  Michael Gelfond,et al.  Logic Programs with Consistency-Restoring Rules , 2003 .

[8]  Shlomo Zilberstein,et al.  Reactive Control of Dynamic Progressive Processing , 1999, IJCAI.

[9]  Christos H. Papadimitriou,et al.  Games against nature , 1985, 24th Annual Symposium on Foundations of Computer Science (sfcs 1983).

[10]  Peter Szolovits,et al.  What Is a Knowledge Representation? , 1993, AI Mag..

[11]  Tuomas Sandholm,et al.  Bargaining with limited computation: Deliberation equilibrium , 2001, Artif. Intell..

[12]  Shlomo Zilberstein,et al.  Optimal Composition of Real-Time Systems , 1996, Artif. Intell..

[13]  David C. Parkes,et al.  Approximate and Compensate: A method for risk-sensitive meta-deliberation and continual computation , 2001 .

[14]  Shlomo Zilberstein,et al.  Monitoring and control of anytime algorithms: A dynamic programming approach , 2001, Artif. Intell..

[15]  François Charpillet,et al.  Real-Time Problem-Solving with Contract Algorithms , 1999, IJCAI.

[16]  Paolo Torroni,et al.  An Abductive Logic Programming Architecture for Negotiating Agents , 2002, JELIA.

[17]  H. Simon,et al.  Models of Bounded Rationality: Empirically Grounded Economic Reason , 1997 .

[18]  Stuart J. Russell,et al.  Do the right thing - studies in limited rationality , 1991 .

[19]  Karen S. Cook,et al.  The Limits of Rationality , 1992 .

[20]  James E. Matheson,et al.  The Economic Value of Analysis and Computation , 1968, IEEE Trans. Syst. Sci. Cybern..

[21]  Michael Gelfond,et al.  Logic programming and knowledge representation - The A-Prolog perspective , 2002, Artif. Intell..

[22]  Eric Horvitz,et al.  Principles and applications of continual computation , 2001, Artif. Intell..

[23]  T. Sandholm,et al.  Costly valuation computation in auctions , 2001 .

[24]  Victor R. Lesser,et al.  Coalitions Among Computationally Bounded Agents , 1997, Artif. Intell..

[25]  Eric B. Baum,et al.  A Bayesian Approach to Relevance in Game Playing , 1997, Artif. Intell..

[26]  Mark S. Boddy,et al.  Deliberation Scheduling for Problem Solving in Time-Constrained Environments , 1994, Artif. Intell..

[27]  Simon Parsons,et al.  Do the right thing - studies in limited rationality by Stuart Russell and Eric Wefald, MIT Press, Cambridge, MA, £24.75, ISBN 0-262-18144-4 , 1994, The Knowledge Engineering Review.

[28]  Devika Subramanian,et al.  Provably Bounded Optimal Agents , 1993, IJCAI.