The role of self-models in learning to plan

We argue that in order to learn to plan effectively, an agent needs an explicit model of its own planning and plan execution processes. Given such a model, the agent can pinpoint the elements of these processes that are responsible for an observed failure to perform as expected, which in turn enables the formulation of a repair designed to ensure that similar failures do not occur in the future. We have constructed simple models of a number of important components of an intentional agent, including threat detection, execution scheduling, and projection, and applied them to learning within the context of competitive games such as chess and checkers.

[1]  Michael Freed,et al.  A model-based approach to the construction of adaptive case-based planning systems , 1991 .

[2]  Gregg Collins,et al.  An adaptive model of decision-making in planning , 1989, IJCAI 1989.

[3]  Brian C. Williams,et al.  Diagnosing Multiple Faults , 1987, Artif. Intell..

[4]  Kristian J. Hammond,et al.  Opportunistic memory , 1989, IJCAI 1989.

[5]  Kristian J. Hammond,et al.  Case-Based Planning: Viewing Planning as a Memory Task , 1989 .

[6]  Gerald Jay Sussman,et al.  A Computer Model of Skill Acquisition , 1975 .

[7]  Jaime G. Carbonell,et al.  Derivational analogy: a theory of reconstructive problem solving and expertise acquisition , 1993 .

[8]  Robert James Firby,et al.  Adaptive execution in complex dynamic worlds , 1989 .

[9]  L. Birnbaum,et al.  The transfer of experience across planning domains through the acquisition of abstract strategies , 1988 .

[10]  David E. Wilkins,et al.  Domain-Independent Planning: Representation and Plan Generation , 1984, Artif. Intell..

[11]  Janet L. Kolodner Capitalizing on Failure through Case-Based Inference , 1987 .

[12]  Richard Fikes,et al.  STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[13]  Gerald J. Sussman,et al.  Forward Reasoning and Dependency-Directed Backtracking in a System for Computer-Aided Circuit Analysis , 1976, Artif. Intell..

[14]  David Chapman,et al.  Pengi: An Implementation of a Theory of Activity , 1987, AAAI.

[15]  Earl D. Sacerdoti,et al.  Planning in a Hierarchy of Abstraction Spaces , 1974, IJCAI.

[16]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[17]  Jon Doyle,et al.  A Truth Maintenance System , 1979, Artif. Intell..

[18]  C SchankRoger,et al.  Dynamic Memory: A Theory of Reminding and Learning in Computers and People , 1983 .

[19]  Michael Freed,et al.  Plan Debugging in an Intentional System , 1991, IJCAI.

[20]  G. Collins,et al.  Transcending inductive category formation in learning , 1986, Behavioral and Brain Sciences.

[21]  Michael Freed,et al.  Model-Based Diagnosis of Planning Failures , 1990, AAAI.

[22]  Reid G. Simmons,et al.  A Theory of Debugging Plans and Interpretations , 1988, AAAI.

[23]  Earl David Sacerdoti,et al.  A Structure for Plans and Behavior , 1977 .

[24]  E. Feigenbaum,et al.  Computers and Thought , 1963 .

[25]  F. Hayes-Roth Using Proofs and Refutations to Learn from Experience , 1983 .

[26]  John R. Anderson,et al.  MACHINE LEARNING An Artificial Intelligence Approach , 2009 .

[27]  Allen Newell,et al.  GPS, a program that simulates human thought , 1995 .

[28]  Randall Davis,et al.  Diagnostic Reasoning Based on Structure and Behavior , 1984, Artif. Intell..