Failure Driven Dynamic Search Control for Partial Order Planners: An Explanation Based Approach

Abstract Given the intractability of domain independent planning, the ability to control the search of a planner is vitally important. One way of doing this involves learning from search failures. This paper describes SNLP + EBL, the first implementation of an explanation based search control rule learning framework for a partial order (plan-space) planner. We will start by describing the basic learning framework of SNLP + EBL. We will then concentrate on SNLP + EBL's ability to learn from failures, and describe the results of empirical studies which demonstrate the effectiveness of the search control rules SNLP + EBL learns using our method. We then demonstrate the generality of our learning methodology by extending it to UCPOP (Penberthy and Weld, 1992), a descendant of SNLP that allows for more expressive domain theories. The resulting system, UCPOP + EBL, is used to analyze and understand the factors influencing the effectiveness of EBL. Specifically, we analyze the effect of (i) expressive action representations, (ii) domain specific failure theories and (iii) sophisticated backtracking strategies on the utility of EBL. Through empirical studies, we demonstrate that expressive action representations allow for more explicit domain representations which in turn increase the ability of EBL to learn from analytical failures, and obviate the need for domain specific failure theories. We also explore the strong affinity between dependency directed backtracking and EBL in planning.

[1]  S. Kambhampati,et al.  Learning Explanation-Based Search Control Rules for Partial Order Planning , 1994, AAAI.

[2]  Mark A. Peot,et al.  Suspending Recursion in Causal-Link Planning , 1996, AIPS.

[3]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[4]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Steve Ankuo Chien An explanation-based learning approach to incremental planning , 1991 .

[6]  Edwin P. D. Pednault,et al.  Generalizing Nonlinear Planning to Handle Complex Goals and Actions with Context-Dependent Effects , 1991, IJCAI.

[7]  Steven Minton,et al.  Quantitative Results Concerning the Utility of Explanation-based Learning , 1988, Artif. Intell..

[8]  Tara A. Estlin,et al.  Hybrid learning of search control for partial-order planning , 1996 .

[9]  Oren Etzioni,et al.  A Structural Theory of Explanation-Based Learning , 1993, Artif. Intell..

[10]  David A. McAllester,et al.  GSAT and Dynamic Backtracking , 1994, KR.

[11]  Edwin P. D. Pednault,et al.  Synthesizing plans that contain actions with context‐dependent effects 1 , 1988, Comput. Intell..

[12]  S. Kambhampati,et al.  Universal classical planner: an algorithm for unifying state-space and plan-space planning , 1996 .

[13]  Anthony Barrett,et al.  Partial-Order Planning: Evaluating Possible Efficiency Gains , 1994, Artificial Intelligence.

[14]  Jaime G. Carbonell,et al.  Learning effective search control knowledge: an explanation-based approach , 1988 .

[15]  Rina Dechter,et al.  Dead-End Driven Learning , 1994, AAAI.

[16]  Subbarao Kambhampati,et al.  Learning search control rules for plan-space planners: factors affecting the performance , 1996 .

[17]  Subbarao Kambhampati,et al.  Explanation-Based Generalization of Partially Ordered Plans , 1991, AAAI.

[18]  Subbarao Kambhampati,et al.  A Unified Framework for Explanation-Based Generalization of Partially Ordered and Partially Instantiated Plans , 1994, Artif. Intell..

[19]  Subbarao Kambhampati,et al.  Admissible Pruning Strategies based on plan minimality for Plan-Space Planning , 1995, IJCAI.

[20]  Oren Etzioni,et al.  Statistical methods for analyzing speedup learning experiments , 2004, Machine Learning.

[21]  Steven Minton,et al.  Total-Order and Partial-Order Planning: A Comparative Analysis , 1994, J. Artif. Intell. Res..

[22]  Mark A. Peot,et al.  Postponing Threats in Partial-Order Planning , 1993, AAAI.

[23]  Oren Etzioni,et al.  Explanation-Based Learning: A Problem Solving Perspective , 1989, Artif. Intell..

[24]  Mark A. Peot,et al.  Threat-Removal Strategies for Partial-Order Planning , 1993, AAAI.

[25]  Martha E. Pollack,et al.  Least-Cost Flaw Repair: A Plan Refinement Strategy for Partial-Order Planning , 1994, AAAI.

[26]  David A. McAllester,et al.  Systematic Nonlinear Planning , 1991, AAAI.

[27]  Subbarao Kambhampati,et al.  Formalizing Dependency Directed Backtracking and Explanation Based Learning in Refinement Search , 1996, AAAI/IAAI, Vol. 1.

[28]  Oren Etzioni,et al.  DYNAMIC: A New Role for Training Problems in EBL , 1992, ML.

[29]  Gerald DeJong,et al.  COMPOSER: A Probabilistic Solution to the Utility Problem in Speed-Up Learning , 1992, AAAI.

[30]  Austin Tate,et al.  O-Plan: The open Planning Architecture , 1991, Artif. Intell..

[31]  Gerald DeJong,et al.  Constructing Simplified Plans Via Truth Criteria Approximation , 1994, AIPS.

[32]  Daniel S. Weld,et al.  UCPOP: A Sound, Complete, Partial Order Planner for ADL , 1992, KR.

[33]  Rina Dechter,et al.  Enhancement Schemes for Constraint Processing: Backjumping, Learning, and Cutset Decomposition , 1990, Artif. Intell..

[34]  Mark Drummond,et al.  Exploiting temporal coherence in nonlinear plan construction , 1988, Comput. Intell..

[35]  Manuela M. Veloso,et al.  Incremental Learning of Control Knowledge for Nonlinear Problem Solving , 1994, ECML.

[36]  Edward P. K. Tsang,et al.  Foundations of constraint satisfaction , 1993, Computation in cognitive science.

[37]  Subbarao Kambhampati,et al.  Planning as Refinement Search: A Unified Framework for Evaluating Design Tradeoffs in Partial-Order Planning , 1995, Artif. Intell..

[38]  Norman M. Sadeh,et al.  Backtracking Techniques for the Job Shop Scheduling Constraint Satisfaction Problem , 1995, Artif. Intell..

[39]  Brian Drabble,et al.  Third International Conference on Artificial Intelligence Planning Systems (AIPS-96) , 1997, AI Mag..