Learning STRIPS Operators from Noisy and Incomplete Observations

Agents learning to act autonomously in real-world domains must acquire a model of the dynamics of the domain in which they operate. Learning domain dynamics can be challenging, especially where an agent only has partial access to the world state, and/or noisy external sensors. Even in standard STRIPS domains, existing approaches cannot learn from noisy, incomplete observations typical of real-world domains. We propose a method which learns STRIPS action models in such domains, by decomposing the problem into first learning a transition function between states in the form of a set of classifiers, and then deriving explicit STRIPS rules from the classifiers' parameters. We evaluate our approach on simulated standard planning domains from the International Planning Competition, and show that it learns useful domain descriptions from noisy, incomplete observations.

[1]  Richard Fikes,et al.  STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[2]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[3]  Leslie G. Valiant,et al.  Learning Disjunction of Conjunctions , 1985, IJCAI.

[4]  Yolanda Gil,et al.  Learning by Experimentation: Incremental Refinement of Incomplete Planning Domains , 1994, International Conference on Machine Learning.

[5]  Xuemei Wang,et al.  Learning by Observation and Practice: An Incremental Approach for Planning Operator Acquisition , 1995, ICML.

[6]  Scott Sherwood Benson,et al.  Learning action models for reactive autonomous agents , 1996 .

[7]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[8]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[9]  Ken Sadohara,et al.  Learning of Boolean Functions Using Support Vector Machines , 2001, ALT.

[10]  Rocco A. Servedio,et al.  Efficiency versus Convergence of Boolean Kernels for On-Line Learning Algorithms , 2001, NIPS.

[11]  Giorgio Metta,et al.  Early integration of vision and manipulation , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[12]  Giorgio Metta,et al.  Better Vision through Manipulation , 2003, Adapt. Behav..

[13]  Rocco A. Servedio,et al.  Maximum Margin Algorithms with Boolean Kernels , 2005, J. Mach. Learn. Res..

[14]  Eyal Amir,et al.  Learning Partially Observable Deterministic Action Models , 2005, IJCAI.

[15]  David Haussler,et al.  Learning Conjunctive Concepts in Structural Domains , 1989, Machine Learning.

[16]  Maurice Bruynooghe,et al.  Online Learning and Exploiting Relational Models in Reinforcement Learning , 2007, IJCAI.

[17]  Maya Cakmak,et al.  From primitive behaviors to goal-directed behavior using affordances , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Qiang Yang,et al.  Learning action models from plan examples using weighted MAX-SAT , 2007, Artif. Intell..

[19]  Roni Khardon,et al.  Noise Tolerant Variants of the Perceptron Algorithm , 2007, J. Mach. Learn. Res..

[20]  Peter Geibel,et al.  Learning Models of Relational MDPs Using Graph Kernels , 2007, MICAI.

[21]  L. P. Kaelbling,et al.  Learning Symbolic Models of Stochastic Domains , 2007, J. Artif. Intell. Res..

[22]  Benjamin Kuipers,et al.  The initial development of object knowledge by a learning robot , 2008, Robotics Auton. Syst..

[23]  Mark Steedman,et al.  Learning action effects in partially observable domains , 2010, ECAI.

[24]  Céline Rouveirol,et al.  Incremental Learning of Relational Action Models in Noisy Environments , 2010, ILP.

[25]  Qiang Yang,et al.  Learning complex action models with quantifiers and logical implications , 2010, Artif. Intell..

[26]  Kira Mourão Learning action representations using kernel perceptrons , 2012 .

[27]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .