论文信息 - Synonyms Relational Dynamic Programming , Dynamic Programming for Relational Domains , Relational Value Iteration Definition

Synonyms Relational Dynamic Programming , Dynamic Programming for Relational Domains , Relational Value Iteration Definition

Decision-theoretic planning aims at constructing a policy for acting in an uncertain environment that maximizes an agent’s expected utility along a sequence of steps. For this task, Markov decision processes (MDPs) have become the standard model. However, classical dynamic programming algorithms for solving MDPs require explicit state and action enumeration, which is often impractical: the number of states and actions grows very quickly with the number of domain objects and relations. In contrast, symbolic dynamic programming (SDP) algorithms seek to avoid explicit state and action enumeration through the symbolic representation of an MDP and a corresponding symbolic derivation of its solution, such as a value function. In essence, SDP algorithms exploit the symbolic structure of the MDP representation to construct a minimal logical partition of the state space required to make all necessary value distinctions.

K. Kersting | S. Sanner

[1] P. Schrimpf,et al. Dynamic Programming , 2011 .

[2] Scott Sanner,et al. Approximate Solution Techniques for Factored First-Order MDPs , 2007, ICAPS.

[3] Roni Khardon,et al. Policy Iteration for Relational MDPs , 2007, UAI.

[4] Roni Khardon,et al. First Order Decision Diagrams for Relational MDPs , 2007, IJCAI.

[5] Eldar Karabaev,et al. A Heuristic Search Algorithm for Solving First-Order MDPs , 2005, UAI.

[6] Scott Sanner,et al. Approximate Linear Programming for First-order MDPs , 2005, UAI.

[7] Sylvie Thiébaux,et al. Exploiting First-Order Regression in Inductive Policy Selection , 2004, UAI.

[8] Luc De Raedt,et al. Bellman goes relational , 2004, ICML.

[9] Steffen Hölldobler,et al. A Logic-based Approach to Dynamic Programming , 2004 .

[10] Robert Givan,et al. Approximate Policy Iteration with a Policy Language Bias , 2003, NIPS.

[11] Carlos Guestrin,et al. Generalizing plans to new environments in relational MDPs , 2003, IJCAI 2003.

[12] Craig Boutilier,et al. Symbolic Dynamic Programming for First-Order MDPs , 2001, IJCAI.

[13] Nicholas Kushmerick,et al. An Algorithm for Probabilistic Planning , 1995, Artif. Intell..

[14] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[15] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.