论文信息 - Acquiring domain-specific planners by example

Acquiring domain-specific planners by example

Abstract : Intelligent problem solving requires the ability to select actions autonomously from a specific state to reach objectives. Planning algorithms provide approaches to look ahead and select a complete sequence of actions. Given a domain description consisting of preconditions and effects of the actions the planner can take, an initial state, and a goal, a planning program returns a sequence of actions to transform the initial state into a state in which the goal is satisfied. Classical planning research has addressed this problem in a domain-independent manner--the same algorithm generates a complete plan for any domain specification. This feature comes at a cost which domain-independent planners incur either in high search efforts or in tedious hand-coded domain knowledge. Previous approaches to efficient general-purpose planning have focused on reducing the search involved in an existing general-purpose planning algorithm. An interesting alternative is to use example plans in a particular domain to demonstrate how to solve problems in that domain and to use that information to solve new problems independently of a domain-independent planner. Others have used example plans for case based planning, but the retrieval and adaptation mechanisms were still domain-independent and efficiency issues were still a concern. In my thesis, I propose to introduce algorithms to extract complex, repeating processes, in the form of domain-specific planning programs, from example plans. I will investigate the application of these learned programs to modelling agent preferences and choices. I will also investigate how the programs can be used, extended, and repaired dynamically as an agent encounters new problems and acquires new experience. Finally, I will compare the template-based planning paradigm to existing general-purpose and domain-specific planning programs with a full evaluation on new and existing planning domains.

Manuela Veloso | Elly Winner | M. Veloso | Elly Winner

[1] Daniel S. Weld,et al. UCPOP: A Sound, Complete, Partial Order Planner for ADL , 1992, KR.

[2] Douglas R. Smith,et al. KIDS - A Knowledge-Based Software Development System , 1991 .

[3] Manuela M. Veloso,et al. OBDD-based Universal Planning for Synchronized Agents in Non-Deterministic Domains , 2000, J. Artif. Intell. Res..

[4] Earl D. Sacerdott. Planning in a hierarchy of abstraction spaces , 1973, IJCAI 1973.

[5] David E. Smith,et al. Conditional Effects in Graphplan , 1998, AIPS.

[6] Yumi Iwasaki,et al. The concept and implementation of skeletal plans , 1985, Journal of Automated Reasoning.

[7] Henry A. Kautz,et al. Generalized Plan Recognition , 1986, AAAI.

[8] Mark A. Peot,et al. Suspending Recursion in Causal-Link Planning , 1996, AIPS.

[9] Ralph Bergmann. Knowledge Acquisition by Generating Skeletal Plans from Real World Cases , 1991, Contemporary Knowledge Engineering and Cognition.

[10] Charles W. Anderson,et al. Strategy Learning with Multilayer Connectionist Representations , 1987 .

[11] Manuela M. Veloso,et al. Planning and Learning by Analogical Reasoning , 1994, Lecture Notes in Computer Science.

[12] Manuela M. Veloso,et al. Prodigy/Analogy: Analogical Reasoning in General Problem Solving , 1993, EWCBR.

[13] Zohar Manna,et al. Fundamentals of Deductive Program Synthesis , 1992, IEEE Trans. Software Eng..

[14] Marcel Schoppers,et al. Universal Plans for Reactive Robots in Unpredictable Environments , 1987, IJCAI.

[15] Michael Anthony Bauer,et al. Programming by Examples , 1986, Artif. Intell..

[16] Robert S. Williams. Learning to Program by Examining and Modifying Cases , 1988, ML.

[17] James A. Hendler,et al. A Validation-Structure-Based Theory of Plan Modification and Reuse , 1992, Artif. Intell..

[18] David Leake,et al. Case-Based Reasoning: Experiences, Lessons and Future Directions , 1996 .

[19] Jaime G. Carbonell,et al. Learning effective search control knowledge: an explanation-based approach , 1988 .

[20] Daniel S. Weld. An Introduction to Least Commitment Planning , 1994, AI Mag..

[21] Craig A. Knoblock. Automatically Generating Abstractions for Planning , 1994, Artif. Intell..

[22] Richard Fikes,et al. Learning and Executing Generalized Robot Plans , 1993, Artif. Intell..

[23] Michael Luck,et al. Multi-Agent Systems and Applications , 2001, Lecture Notes in Computer Science.

[24] Manuela M. Veloso,et al. The Lumberjack Algorithm for Learning Linked Decision Forests , 2000, PRICAI.

[25] Mark A. Peot,et al. Postponing Threats in Partial-Order Planning , 1993, AAAI.

[26] Jaime G. Carbonell,et al. Towards a General Framework for Composing Disjunctive and Iterative Macro-operators , 1989, IJCAI.

[27] Paolo Traverso,et al. Strong Planning in Non-Deterministic Domains Via Model Checking , 1998, AIPS.

[28] Paolo Traverso,et al. Automatic OBDD-Based Generation of Universal Plans in Non-Deterministic Domains , 1998, AAAI/IAAI.

[29] Roni Khardon,et al. Learning Action Strategies for Planning Domains , 1999, Artif. Intell..

[30] Subbarao Kambhampati,et al. Explanation-Based Generalization of Partially Ordered Plans , 1991, AAAI.

[31] Richard C. Waters,et al. Approaches to Automatic Programming , 1993, Adv. Comput..

[32] Craig A. Knoblock. Learning Abstraction Hierarchies for Problem Solving , 1990, AAAI.

[33] Ute Schmid,et al. Inductive Synthesis of Functional Programs , 2003, Lecture Notes in Computer Science.

[34] Fritz Wysotzki,et al. Applying Inductive Program Synthesis to Macro Learning , 2000, AIPS.

[35] James A. Hendler,et al. Flexible reuse and modification in hierarchical planning: a validation structure-based approach , 1989 .

[36] Christer Bäckström. Finding Least Constrained Plans and Optimal Parallel Executions is Harder than We Thought , 1994 .

[37] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[38] Blai Bonet,et al. The AIPS-98 Planning Competition , 2000, AI Mag..

[39] J. Harris. Where is the child's environment? A group socialization theory of development. , 1995 .

[40] Steven Minton,et al. Selectively Generalizing Plans for Problem-Solving , 1985, IJCAI.

[41] Xuemei Wang,et al. Learning Planning Operators by Observation and Practice , 1994, AIPS.

[42] Manuela M. Veloso,et al. Analyzing Plans with Conditional Effects , 2002, AIPS.

[43] J. Shavlik. Acquiring Recursive and Iterative Concepts with Explanation-Based Learning , 1990, Machine Learning.

[44] M. Veloso,et al. Nonlinear Planning with Parallel Resource Allocation , 1990 .

[45] Edwin P. D. Pednault,et al. FORMULATING MULTIAGENT, DYNAMIC-WORLD PROBLEMS IN THE CLASSICAL PLANNING FRAMEWORK , 1987 .

[46] S. Kambhampati,et al. Learning Explanation-Based Search Control Rules for Partial Order Planning , 1994, AAAI.

[47] Robert Wilensky,et al. A model for planning in complex situations , 1981 .

[48] Larry D. Pyeatt,et al. Decision Tree Function Approximation in Reinforcement Learning , 1999 .

[49] Pierre Régnier,et al. Complete Determination of Parallel Actions and Temporal Optimization in Linear Plans of Action , 1991, EWSP.

[50] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.

[51] De,et al. Relational Reinforcement Learning , 2001, Encyclopedia of Machine Learning and Data Mining.

[52] Richard E. Korf,et al. Macro-Operators: A Weak Method for Learning , 1985, Artif. Intell..

[53] Jaime G. Carbonell,et al. Learning by experimentation: the operator refinement method , 1990 .

[54] Kristian J. Hammond,et al. CHEF: A Model of Case-Based Planning , 1986, AAAI.

[55] Manuela Veloso,et al. Automatically Acquiring Planning Templates from Example Plans , 2002 .

[56] Luc De Raedt,et al. Inductive Logic Programming: Theory and Methods , 1994, J. Log. Program..

[57] Pedro M. Domingos,et al. Programming by demonstration: a machine learning approach , 2001 .

[58] Avrim Blum,et al. Fast Planning Through Planning Graph Analysis , 1995, IJCAI.