论文信息 - High-Level Planning and Control with Incomplete Information Using POMDP's

High-Level Planning and Control with Incomplete Information Using POMDP's

We develop an approach to planning with incomplete information that is based on three elements: 1. a high-level language for describing the effects of actions on both the world and the agent’s beliefs that we call POMDP theories 2. a semantics that translates such theories into actual POMDPs 3. a real time dynamic programming algorithm that produces controllers from such POMDPs. We show that the resulting approach is not only clean and general but that is practical as well. We have implemented a shell that accepts POMDP theories and produces controllers, and have tested it over a number of problems. In this paper we present the main elements of the approach and report results for the ’omelette problem’ where the resulting controller exhibits a better performance than the handcrafted controller.

Blai Bonet

[1] Thomas Hedley Bruce Burrough. An approach to planning , 1953 .

[2] R. Bellman. Dynamic programming. , 1957, Science.

[3] Edward J. Sondik,et al. The optimal control of par-tially observable Markov processes , 1971 .

[4] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[5] Robert C. Moore. A Formal Theory of Knowledge and Action , 1984 .

[6] Robert C. Moore,et al. Formal Theories of the Commonsense World , 1985 .

[7] Y. Shoham. What is the frame problem , 1987 .

[8] 李幼升,et al. Ph , 1989 .

[9] Keiji Kanazawa,et al. A model for reasoning about persistence and causation , 1989 .

[10] Richard E. Korf,et al. Real-Time Heuristic Search , 1990, Artif. Intell..

[11] Raymond Reiter,et al. The Frame Problem in the Situation Calculus: A Simple Solution (Sometimes) and a Completeness Result for Goal Regression , 1991, Artificial and Mathematical Theory of Computation.