Sequential Monte Carlo in Probabilistic Planning Reachability Heuristics

The current best conformant probabilistic planners encode the problem as a bounded length CSP or SAT problem. While these approaches can find optimal solutions for given plan lengths, they often do not scale for large problems or plan lengths. As has been shown in classical planning, heuristic search outperforms CSP/SAT techniques (especially when a plan length is not given a priori). The problem with applying heuristic search in probabilistic planning is that effective heuristics are as yet lacking. In this work, we apply heuristic search to conformant probabilistic planning by adapting planning graph heuristics developed for non-deterministic planning. We evaluate a straight-forward application of these planning graph techniques, which amounts to exactly computing the distribution over reachable relaxed planning graph layers. Computing these distributions is costly, so we apply Sequential Monte Carlo to approximate them. We demonstrate on several domains how our approach enables our planner to far out-scale existing (optimal) probabilistic planners and still find reasonable quality solutions.

[1]  L. Li,et al.  Engineering a Conformant Probabilistic Planner , 2011, J. Artif. Intell. Res..

[2]  John Langford,et al.  Probabilistic Planning in the Graphplan Framework , 1999, ECP.

[3]  Anne Condon,et al.  On the Undecidability of Probabilistic Planning and Infinite-Horizon Partially Observable Markov Decision Problems , 1999, AAAI/IAAI.

[4]  Andrew G. Barto,et al.  Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..

[5]  Daniel Bryce,et al.  Heuristic Guidance Measures for Conformant Planning , 2004, ICAPS.

[6]  R. Brafman,et al.  Contingent Planning via Heuristic Forward Search witn Implicit Belief States , 2005, ICAPS.

[7]  Daniel Bryce,et al.  Planning Graph Heuristics for Belief Space Search , 2006, J. Artif. Intell. Res..

[8]  Mausam,et al.  Concurrent Probabilistic Temporal Planning , 2005, ICAPS.

[9]  Fabio Somenzi,et al.  CUDD: CU Decision Diagram Package Release 2.2.0 , 1998 .

[10]  NebelBernhard,et al.  The FF planning system , 2001 .

[11]  Michael L. Littman,et al.  Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes , 1997, UAI.

[12]  Avrim Blum,et al.  Fast Planning Through Planning Graph Analysis , 1995, IJCAI.

[13]  Daniel Bryce,et al.  Planning in belief space with a labelled uncertainty graph , 2004, AAAI 2004.

[14]  Timothy J. Robinson,et al.  Sequential Monte Carlo Methods in Practice , 2003 .

[15]  Ronen I. Brafman,et al.  Conformant planning via heuristic forward search: A new approach , 2004, Artif. Intell..

[16]  Daniel Bryce,et al.  Cost Sensitive Reachability Heuristics for Handling State Uncertainty , 2005, UAI.

[17]  Håkan L. S. Younes,et al.  VHPOP: Versatile Heuristic Partial Order Planner , 2003, J. Artif. Intell. Res..

[18]  Ivan Serina,et al.  Planning Through Stochastic Local Search and Temporal Action Graphs in LPG , 2003, J. Artif. Intell. Res..

[19]  Bernhard Nebel,et al.  Extending Planning Graphs to an ADL Subset , 1997, ECP.

[20]  Michael L. Littman,et al.  The Computational Complexity of Probabilistic Planning , 1998, J. Artif. Intell. Res..

[21]  David E. Smith,et al.  Conformant Graphplan , 1998, AAAI/IAAI.

[22]  Nathanael Hyafil,et al.  Conformant Probabilistic Planning via CSPs , 2003, ICAPS.

[23]  Sylvie Thiébaux,et al.  Prottle: A Probabilistic Temporal Planner , 2005, AAAI.

[24]  Sebastian Thrun,et al.  Monte Carlo POMDPs , 1999, NIPS.

[25]  Randal E. Bryant,et al.  Graph-Based Algorithms for Boolean Function Manipulation , 1986, IEEE Transactions on Computers.

[26]  Michael L. Littman,et al.  MAXPLAN: A New Approach to Probabilistic Planning , 1998, AIPS.

[27]  Nathanael Hyafil,et al.  Utilizing Structured Representations and CSP's in Conformant Probabilistic Planning , 2004, ECAI.

[28]  Håkan L. S. Younes,et al.  PPDDL 1 . 0 : An Extension to PDDL for Expressing Planning Domains with Probabilistic Effects , 2004 .

[29]  Bart Selman,et al.  Encoding Plans in Propositional Logic , 1996, KR.

[30]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[31]  Subbarao Kambhampati,et al.  Improving Temporal Flexibility of Position Constrained Metric Temporal Plans , 2003, ICAPS.

[32]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[33]  Nicholas Kushmerick,et al.  An Algorithm for Probabilistic Least-Commitment Planning , 1994, AAAI.

[34]  Bernhard Nebel,et al.  The FF Planning System: Fast Plan Generation Through Heuristic Search , 2011, J. Artif. Intell. Res..

[35]  Patrick Fabiani,et al.  Search Space Splitting in order to Compute Admissible Heuristics in Planning , 2003, PuK.

[36]  Randal E. Bryant,et al.  Efficient implementation of a BDD package , 1991, DAC '90.

[37]  Blai Bonet,et al.  Labeled RTDP: Improving the Convergence of Real-Time Dynamic Programming , 2003, ICAPS.

[38]  Jussi Rintanen,et al.  Expressive Equivalence of Formalisms for Planning with Sensing , 2003, ICAPS.