Learning Action Models from Disordered and Noisy Plan Traces

There is increasing awareness in the planning community that the burden of specifying complete domain models is too high, which impedes the applicability of planning technology in many real-world domains. Although there have many learning systems that help automatically learning domain models, most existing work assumes that the input traces are completely correct. A more realistic situation is that the plan traces are disordered and noisy, such as plan traces described by natural language. In this paper we propose and evaluate an approach for doing this. Our approach takes as input a set of plan traces with disordered actions and noise and outputs action models that can best explain the plan traces. We use a MAX-SAT framework for learning, where the constraints are derived from the given plan traces. Unlike traditional action models learners, the states in plan traces can be partially observable and noisy as well as the actions in plan traces can be disordered and parallel. We demonstrate the effectiveness of our approach through a systematic empirical evaluation with both IPC domains and the real-world dataset extracted from natural language documents.

[1]  Qiang Yang,et al.  Cross-Domain Action-Model Acquisition for Planning via Web Search , 2011, ICAPS.

[2]  Subbarao Kambhampati,et al.  Model-Lite Case-Based Planning , 2012, AAAI.

[3]  Brendan Juba,et al.  Efficient, Safe, and Probably Approximately Complete Learning of Action Models , 2017, IJCAI.

[4]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[5]  Qiang Yang,et al.  Action-model acquisition for planning via transfer learning , 2014, Artif. Intell..

[6]  Mark Steedman,et al.  Learning STRIPS Operators from Noisy and Incomplete Observations , 2012, UAI.

[7]  Qiang Yang,et al.  Learning complex action models with quantifiers and logical implications , 2010, Artif. Intell..

[8]  Subbarao Kambhampati,et al.  Action-Model Acquisition from Noisy Plan Traces , 2013, IJCAI.

[9]  João Fernando Ferreira,et al.  Framer: Planning Models from Natural Language Action Descriptions , 2017, ICAPS.

[10]  Stephen Cresswell,et al.  Domain Model Acquisition in the Presence of Static Relations in the LOP System , 2015, ICAPS.

[11]  Subbarao Kambhampati,et al.  Model-lite planning: Case-based vs. model-based approaches , 2017, Artif. Intell..

[12]  Qiang Yang,et al.  Learning action models from plan examples using weighted MAX-SAT , 2007, Artif. Intell..

[13]  Christer Bäckström,et al.  Plan Reordering and Parallel Execution - A Parameterized Complexity View , 2017, AAAI.

[14]  Yolanda Gil,et al.  Learning by Experimentation: Incremental Refinement of Incomplete Planning Domains , 1994, International Conference on Machine Learning.

[15]  L. P. Kaelbling,et al.  Learning Symbolic Models of Stochastic Domains , 2007, J. Artif. Intell. Res..

[16]  Hector Muñoz-Avila,et al.  Learning hierarchical task network domains from partially observed plan traces , 2014, Artif. Intell..

[17]  Subbarao Kambhampati,et al.  Extracting Action Sequences from Texts Based on Deep Reinforcement Learning , 2018, IJCAI.

[18]  Eyal Amir,et al.  Learning Partially Observable Deterministic Action Models , 2005, IJCAI.

[19]  Daniel Bryce,et al.  Maintaining Evolving Domain Models , 2016, IJCAI.

[20]  Richard Fikes,et al.  STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[21]  George Konidaris,et al.  Learning Portable Representations for High-Level Planning , 2019, ICML.

[22]  Fahiem Bacchus,et al.  Reduced Cost Fixing for Maximum Satisfiability , 2018, IJCAI.

[23]  Piergiorgio Bertoli,et al.  Web Service Composition as Planning, Revisited: In Between Background Theories and Initial State Uncertainty , 2007, AAAI.

[24]  Eva Onaindia,et al.  Learning STRIPS Action Models with Classical Planning , 2018, ICAPS.

[25]  Alex S. Fukunaga,et al.  Classical Planning in Deep Latent Space: Bridging the Subsymbolic-Symbolic Boundary , 2017, AAAI.

[26]  Eva Onaindia,et al.  Learning action models with minimal observability , 2019, Artif. Intell..

[27]  Leslie Pack Kaelbling,et al.  Learning Planning Rules in Noisy Stochastic Worlds , 2005, AAAI.

[28]  John D. Lafferty,et al.  Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Felip Manyà,et al.  New Inference Rules for Max-SAT , 2007, J. Artif. Intell. Res..

[30]  Piergiorgio Bertoli,et al.  Automated composition of Web services via planning in asynchronous domains , 2005, Artif. Intell..

[31]  Regina Barzilay,et al.  Learning High-Level Planning from Text , 2012, ACL.

[32]  Leslie Pack Kaelbling,et al.  From Skills to Symbols: Learning Symbolic Representations for Abstract High-Level Planning , 2018, J. Artif. Intell. Res..

[33]  Hiroshi Kajino,et al.  Towards Stable Symbol Grounding with Zero-Suppressed State AutoEncoder , 2019, ICAPS.