论文信息 - A Framework for Decision-Theoretic Planning I: Combining the Situation Calculus, Conditional Plans, Probability and Utility

A Framework for Decision-Theoretic Planning I: Combining the Situation Calculus, Conditional Plans, Probability and Utility

This paper shows how we can combine logical representations of actions and decision theory in such a manner that seems natural for both. In partitular we assume an axiomatization of the domain in terms of situation calculus, using what is essentially Reiter's solution to the frame problem, in terms of the completion of the axioms defining the state change. Uncertainty is handled in terms of the independent choice logic, which allows for independent choices and a logic program that gives the consequences of the choices. As part of the consequences are a specification of the utility of (final) states. The robot adopts robot plans, similar to the GOLOG programming language. Within this logic, we can define the expected utility of a conditional plan, based on the axiomadzation of the actions, the uncertainty and the utility. The 'planning' problem is to find the plan with the highest expected utility. This is related to recent structured representations for POMDPs; here we use stochastic situation calculus rules to specify the state transition function and the reward/value function. Finally we show that with stochastic frame axioms, actions representations in probabilistic STRIPS are exponentially larger than using the representation proposed here.

David L. Poole | D. Poole

[1] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[2] Jerome A. Feldman,et al. Decision Theory and Artificial Intelligence II: The Hungry Monkey , 1977, Cogn. Sci..

[3] Richard Waldinger,et al. Achieving several goals simultaneously , 1977 .

[4] Leslie Pack Kaelbling,et al. Planning With Deadlines in Stochastic Domains , 1993, AAAI.

[5] Craig Boutilier,et al. Computing Optimal Policies for Partially Observable Decision Processes Using Compact Representations , 1996, AAAI/IAAI, Vol. 2.

[6] Craig Boutilier,et al. Process-Oriented Planning and Average-Reward Optimality , 1995, IJCAI.

[7] Nicholas Kushmerick,et al. An Algorithm for Probabilistic Planning , 1995, Artif. Intell..

[8] Hector J. Levesque,et al. What Is Planning in the Presence of Sensing? , 1996, AAAI/IAAI, Vol. 2.

[9] Hector J. Levesque,et al. Reasoning about Noisy Sensors in the Situation Calculus , 1995, IJCAI.

[10] Nils J. Nilsson,et al. Artificial Intelligence , 1974, IFIP Congress.

[11] Peter Haddawy,et al. Efficient Decision-Theoretic Planning: Techniques and Empirical Analysis , 1995, UAI.