Learning Recursive Control Programs from Problem Solving

In this paper, we propose a new representation for physical control -- teleoreactive logic programs -- along with an interpreter that uses them to achieve goals. In addition, we present a new learning method that acquires recursive forms of these structures from traces of successful problem solving. We report experiments in three different domains that demonstrate the generality of this approach. In closing, we review related work on learning complex skills and discuss directions for future research on this topic.

[1]  Nils J. Nilsson,et al.  Teleo-Reactive Programs for Agent Control , 1993, J. Artif. Intell. Res..

[2]  Pat Langley,et al.  An architecture for persistent reactive behavior , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[3]  Pat Langley,et al.  A CONSTRAINED ARCHITECTURE FOR LEARNING AND PROBLEM SOLVING , 2005, Comput. Intell..

[4]  Fahiem Bacchus,et al.  The AIPS '00 Planning Competition , 2001, AI Mag..

[5]  Pat Langley,et al.  Learning Teleoreactive Logic Programs from Problem Solving , 2005, ILP.

[6]  Jude W. Shavlik,et al.  Acquiring Recursive Concepts with Explanation-Based Learning , 1989, IJCAI.

[7]  Roland Olsson,et al.  Inductive Functional Programming Using Incremental Program Transformation , 1995, Artif. Intell..

[8]  Stacy Marsella,et al.  A Method for Biasing the Learning of Nonterminal Reduction Rules , 1993 .

[9]  Raymond J. Mooney,et al.  The Effect of Rule Use on the Utility of Explanation-Based Learning , 1989, IJCAI.

[10]  Pat Langley,et al.  Learning Teleoreactive Logic Programs by Observation , 2005 .

[11]  Hector Muñoz-Avila,et al.  SHOP: Simple Hierarchical Ordered Planner , 1999, IJCAI.

[12]  Claude Sammut,et al.  Automatic construction of reactive control systems using symbolic machine learning , 1996, The Knowledge Engineering Review.

[13]  Fahiem Bacchus,et al.  AIPS 2000 Planning Competition: The Fifth International Conference on Artificial Intelligence Planning and Scheduling Systems , 2001 .

[14]  James A. Hendler,et al.  HTN Planning: Complexity and Expressivity , 1994, AAAI.

[15]  Glenn A. Iba,et al.  A heuristic approach to the discovery of macro-operators , 2004, Machine Learning.

[16]  Allen Newell,et al.  GPS, a program that simulates human thought , 1995 .

[17]  Jaime G. Carbonell,et al.  Derivational Analogy in PRODIGY: Automating Case Acquisition, Storage, and Utilization , 1993, Machine Learning.

[18]  Scott Benson,et al.  Inductive Learning of Reactive Action Models , 1995, ICML.

[19]  Steven Minton,et al.  Quantitative Results Concerning the Utility of Explanation-based Learning , 1988, Artif. Intell..

[20]  Pat Langley,et al.  Learning, development, and production systems , 1987 .

[21]  Robert Givan,et al.  Learning Domain-Specific Control Knowledge from Random Walks , 2004, ICAPS.

[22]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[23]  Allen Newell,et al.  Chunking in Soar: The anatomy of a general learning mechanism , 1985, Machine Learning.

[24]  Ute Schmid A Cognitive Model of Learning by Doing , 2005 .

[25]  Steven Minton,et al.  Machine Learning Methods for Planning , 1994 .

[26]  Raymond J. Mooney,et al.  Combining FOIL and EBG to Speed-up Logic Programs , 1993, IJCAI.

[27]  William F. Clocksin,et al.  Programming in Prolog , 1981, Springer Berlin Heidelberg.

[28]  David Ruby,et al.  SteppingStone: An Empirical and Analytical Evaluation , 1991, AAAI.

[29]  Tom M. Mitchell,et al.  Learning from Solution Paths: An Approach to the Credit Assignment Problem , 1982, AI Mag..

[30]  Prasad Tadepalli,et al.  Learning Goal-Decomposition Rules Using Exercises , 1997, AAAI/IAAI.

[31]  Roni Khardon,et al.  Learning Action Strategies for Planning Domains , 1999, Artif. Intell..

[32]  P. Langley,et al.  Production system models of learning and development , 1987 .

[33]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[34]  Tara A. Estlin,et al.  Learning to Improve both Efficiency and Quality of Planning , 1997, IJCAI.