论文信息 - Envelope-based Planning in Relational MDPs

Envelope-based Planning in Relational MDPs

A mobile robot acting in the world is faced with a large amount of sensory data and uncertainty in its action outcomes. Indeed, almost all interesting sequential decision-making domains involve large state spaces and large, stochastic action sets. We investigate a way to act intelligently as quickly as possible in domains where finding a complete policy would take a hopelessly long time. This approach, Relational Envelope-based Planning (REBP) tackles large, noisy problems along two axes. First, describing a domain as a relational MDP (instead of as an atomic or propositionally-factored MDP) allows problem structure and dynamics to be captured compactly with a small set of probabilistic, relational rules. Second, an envelope-based approach to planning lets an agent begin acting quickly within a restricted part of the full state space and to judiciously expand its envelope as resources permit.

Leslie Pack Kaelbling | Natalia Hernandez-Gardiol | L. Kaelbling | Natalia Hernandez-Gardiol

[1] Leslie Pack Kaelbling,et al. Planning under Time Constraints in Stochastic Domains , 1993, Artif. Intell..

[2] Blai Bonet. High-Level Planning and Control with Incomplete Information Using POMDPs Hdctor Geffner and , 2003 .

[3] Craig Boutilier,et al. Symbolic Dynamic Programming for First-Order MDPs , 2001, IJCAI.

[4] David E. Smith,et al. Extending Graphplan to handle uncertainty and sensing actions , 1998, AAAI 1998.

[5] Carlos Guestrin,et al. Generalizing plans to new environments in relational MDPs , 2003, IJCAI 2003.

[6] Jesse Hoey,et al. SPUDD: Stochastic Planning using Decision Diagrams , 1999, UAI.

[7] Kurt Driessens,et al. Speeding Up Relational Reinforcement Learning through the Use of an Incremental First Order Decision Tree Learner , 2001, ECML.

[8] Craig A. Knoblock,et al. Combining the Expressivity of UCPOP with the Efficiency of Graphplan , 1997, ECP.

[9] Robert Givan,et al. Inductive Policy Selection for First-Order MDPs , 2002, UAI.

[10] Avrim Blum,et al. Fast Planning Through Planning Graph Analysis , 1995, IJCAI.

[11] Daniel S. Weld. Recent Advances in AI Planning , 1999, AI Mag..

[12] Bernhard Nebel,et al. Extending Planning Graphs to an ADL Subset , 1997, ECP.

[13] Bernhard Nebel,et al. Ignoring Irrelevant Facts and Operators in Plan Generation , 1997, ECP.

[14] John Langford,et al. Probabilistic Planning in the Graphplan Framework , 1999, ECP.