Incremental Learning of Control Knowledge for Improvement of Planning Efficiency and Plan Quality

General-purpose planners use domain-independent search heuristics to generate solutions for problems in a variety of different domains. However, as heuristics they are, there are situations in which these heuristics do not produce the expected effective guidance, and the planner performs inefficiently or obtains solutions of poor quality. Learning from experience can help to identify the particular situations for which the domain-independent heuristics need to be overridden. In this paper, we present a system, HAMLET, that learns control knowledge and incrementally refines it, allowing the planner not only to solve efficiently complex problems, but also generate solutions of good quality. We claim that incremental learning of control knowledge and consideration of the quality of the solutions are two fundamental research directions towards the goal of applying planning techniques to real-world problems. We show empirical results in a complex domain that show the promise of our approach to support our claims.

[1]  Jaime G. Carbonell,et al.  Control Knowledge to Improve Plan Quality , 1994, AIPS.

[2]  Pat Langley,et al.  Learning Effective Search Heuristics , 1983, IJCAI.

[3]  Manuela M. Veloso,et al.  Linkability: Examining Causal Link Commitments in Partial-order Planning , 1994, AIPS.

[4]  Subbarao Kambhampati,et al.  Explanation-Based Generalization of Partially Ordered Plans , 1991, AAAI.

[5]  Manuela M. Veloso,et al.  Incremental Learning of Control Knowledge for Nonlinear Problem Solving , 1994, ECML.

[6]  Oren Etzioni,et al.  DYNAMIC: A New Role for Training Problems in EBL , 1992, ML.

[7]  Raymond J. Mooney,et al.  Combining FOIL and EBG to Speed-up Logic Programs , 1993, IJCAI.

[8]  Ingrid Zukerman,et al.  Learning Search Control Rules for Planning: An Inductive Approach , 1991, ML.

[9]  David A. McAllester,et al.  Systematic Nonlinear Planning , 1991, AAAI.

[10]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[11]  David Ruby,et al.  Learning Episodes for Optimization , 1992, ML.

[12]  Tom M. Mitchell,et al.  Learning by experimentation: acquiring and refining problem-solving heuristics , 1993 .

[13]  Manuela Veloso Learning by analogical reasoning in general problem-solving , 1992 .

[14]  Prasad Tadepalli,et al.  Lazy ExplanationBased Learning: A Solution to the Intractable Theory Problem , 1989, IJCAI.

[15]  Neeraj Bhatanagar Learning by incomplete explanations of failures in recursive domains , 1992, ICML 1992.

[16]  Manuela Veloso,et al.  Multiple Target Concept Learning and Revision In Nonlinear Problem Solving , 1994 .

[17]  Jaime G. Carbonell,et al.  Learning effective search control knowledge: an explanation-based approach , 1988 .

[18]  David Chapman,et al.  Planning for Conjunctive Goals , 1987, Artif. Intell..

[19]  Oren Etzioni,et al.  PRODIGY4.0: The Manual and Tutorial , 1992 .

[20]  Manuela Veloso Nonlinear problem solving using intelligent casual-commitment , 1989 .

[21]  Oren Etzioni,et al.  Why EBL Produces Overly-Specific Knowledge: A Critique of the PRODIGY Approaches , 1992, ML.

[22]  Manuela M. Veloso,et al.  The Need for Different Domain-independent Heuristics , 1994, AIPS.

[23]  David Warren,et al.  Warplan: a system for generating plans , 1974 .