Directed Fixed-Point Regression-Based Planning for Non-Deterministic Domains

We present a novel approach to fully-observable nondeter-ministic planning (FOND) that attempts to bridge the gap between symbolic fix-point computation and recent approaches based on forward heuristic search. Concretely, we formalize the relationship between symbolic and dynamic programming nondeterministic planners, and then exploit such connection to propose a novel family of planning algorithms that reasons over symbolic policies in a directed manner. By doing so, our proposal reasons over sets of states and executions in a succinct way (as done by symbolic planners) while biasing the reasoning with respect to the initial and goal states of the specific planning problem at hand (as done by heuristic planners). We show empirical results that prove this approach promising in settings where there is an intrinsic tension between plan efficiency and plan "robustness," a feature to be expected in nondeterministic domains.

[1]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[3]  Carmel Domshlak,et al.  Fault Tolerant Planning: Complexity and Compilation , 2013, ICAPS.

[4]  Robert E. Tarjan,et al.  Depth-First Search and Linear Graph Algorithms , 1972, SIAM J. Comput..

[5]  Robert P. Goldman,et al.  Using Classical Planners to Solve Nondeterministic Planning Problems , 2008, ICAPS.

[6]  Blai Bonet,et al.  Faster Heuristic Search Algorithms for Planning with Uncertainty and Full Feedback , 2003, IJCAI.

[7]  Bernhard Nebel,et al.  The FF Planning System: Fast Plan Generation Through Heuristic Search , 2011, J. Artif. Intell. Res..

[8]  Marco Pistore,et al.  Weak, strong, and strong cyclic planning via symbolic model checking , 2003, Artif. Intell..

[9]  Raymond Reiter,et al.  Knowledge in Action: Logical Foundations for Specifying and Implementing Dynamical Systems , 2001 .

[10]  Jussi Rintanen,et al.  Regression for Classical and Nondeterministic Planning , 2008, ECAI.

[11]  Robert P. Goldman,et al.  Conditional Linear Planning , 1994, AIPS.

[12]  Richard E. Korf,et al.  Depth-First Iterative-Deepening: An Optimal Admissible Tree Search , 1985, Artif. Intell..

[13]  Moshe Y. Vardi,et al.  Treewidth in Verification: Local vs. Global , 2005, LPAR.

[14]  Christian J. Muise,et al.  Improved Non-Deterministic Planning by Exploiting State Relevance , 2012, ICAPS.

[15]  Robert Givan,et al.  FF-Replan: A Baseline for Probabilistic Planning , 2007, ICAPS.

[16]  Paolo Traverso,et al.  Strong Cyclic Planning Revisited , 1999, ECP.

[17]  David H. D. Warren,et al.  Generating Conditional Plans and Programs , 1976, AISB.

[18]  Hector Geffner,et al.  Fair LTL Synthesis for Non-Deterministic Systems using Strong Cyclic Planners , 2013, IJCAI.

[19]  Farokh B. Bastani,et al.  Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence Simple and Fast Strong Cyclic Planning for Fully-Observable Nondeterministic Planning Problems �� , 2022 .

[20]  Malte Helmert,et al.  Concise finite-domain representations for PDDL planning tasks , 2009, Artif. Intell..

[21]  Blai Bonet,et al.  A Concise Introduction to Models and Methods for Automated Planning , 2013, A Concise Introduction to Models and Methods for Automated Planning.

[22]  Stefan Edelkamp,et al.  Solving Fully-Observable Non-deterministic Planning Problems via Translation into a General Game , 2009, KI.

[23]  Ugur Kuter,et al.  Incremental plan aggregation for generating policies in MDPs , 2010, AAMAS.

[24]  John G. Gibbons Knowledge in Action , 2001 .

[25]  Edmund M. Clarke,et al.  Design and Synthesis of Synchronization Skeletons Using Branching-Time Temporal Logic , 1981, Logic of Programs.

[26]  Jussi Rintanen,et al.  Complexity of Planning with Partial Observability , 2004, ICAPS.

[27]  Sylvie Thiébaux,et al.  Probabilistic planning vs replanning , 2007 .

[28]  Sebastian Sardiña,et al.  Behavior Composition as Fully Observable Non-Deterministic Planning , 2013, ICAPS.

[29]  Wheeler Ruml,et al.  Deadline-Aware Search Using On-Line Measures of Behavior , 2011, SOCS.

[30]  Mausam,et al.  Discovering hidden structure in factored MDPs , 2012, Artif. Intell..