论文信息 - Repairing Decision-Theoretic Policies Using Goal-Oriented Planning

Repairing Decision-Theoretic Policies Using Goal-Oriented Planning

In this paper we address the problem of how decision-theoretic policies can be repaired. This work is motivated by observations made in robotic soccer where decision-theoretic policies become invalid due to small deviations during execution; and repairing might pay off compared to re-planning from scratch. Our policies are generated with Readylog , a derivative of Golog based on the situation calculus, which combines programming and planning for agents in dynamic domains. When an invalid policy is detected, the world state is transformed into a pddl description and a state-of-the-art pddl planner is deployed to calculate the repair plan.

Alexander Ferrein | Gerhard Lakemeyer | Christoph Mies

[1] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .

[2] Alexander Ferrein,et al. On-Line Decision-Theoretic Golog for Unpredictable Domains , 2004, KI.

[3] Gerhard Lakemeyer,et al. Towards an Integration of Planning and Golog , 2007, IJCAI 2007.

[4] Alexander Ferrein,et al. Approaching A Formal Soccer Theory FromBehaviour Specifi Cations In Robotic Soccer , 2008 .

[5] Henrik Grosskreutz,et al. Probabilistic Projection and Belief Update in the pGOLOG Framework , 2000, GI Jahrestagung.

[6] Maria Fox,et al. PDDL2.1: An Extension to PDDL for Expressing Temporal Planning Domains , 2003, J. Artif. Intell. Res..

[7] Craig A. Knoblock,et al. PDDL-the planning domain definition language , 1998 .

[8] J. McCarthy. Situations, Actions, and Causal Laws , 1963 .

[9] Gerhard Lakemeyer,et al. ccGolog -- A Logical Language Dealing with Continuous Change , 2003, Log. J. IGPL.

[10] Alexander Ferrein,et al. Using Golog for Deliberation and Team Coordination in Robotic Soccer , 2005, Künstliche Intell..

[11] Giuseppe De Giacomo,et al. Execution Monitoring of High-Level Robot Programs , 1998, KR.