On Control Knowledge Acquisition by Exploiting Human-Computer Interaction

In the last decade, there has been a strong and increasing interest on building fast planning systems. From it, we have seen an enormous improvement in relation to the solvability horizon of current planning techniques. Most of these techniques rely on the translation of the predicate logic description of domains into propositional representations of them. Then, a fast search procedure is applied for obtaining solutions to planning problems. While this is an important step towards solving the planning problem, we believe older planners that are based on predicate logic computation can still be competitive when they are combined with learning capabilities. This is so, because they can acquire and use more abstract representations of control knowledge (CK) which are easier to describe and maintain by humans and/or automatic systems than the ones based on propositional logic. In this paper, we present the results we have obtained with a relatively "old" planner, Prodigy4.0, powered with CK that has been acquired using a mixed human-computer collaboration. We advocate for the initial generation of CK by a learning system, and its later refinement by a human. In fact, we can iterate this process until the desired result has been achieved. We show results in the logistics domain used at AIPS'00 that are at the same level when compared with other techniques in the Track on hand-tailored planning systems (Track2).

[1]  Blai Bonet,et al.  Planning as heuristic search , 2001, Artif. Intell..

[2]  Hector Muñoz-Avila,et al.  SHOP: Simple Hierarchical Ordered Planner , 1999, IJCAI.

[3]  Craig A. Knoblock,et al.  Planning by Rewriting , 2001, J. Artif. Intell. Res..

[4]  Jaime G. Carbonell,et al.  Control Knowledge to Improve Plan Quality , 1994, AIPS.

[5]  Fangzhen Lin,et al.  From Causal Theories to Successor State Axioms and STRIPS-Like Systems , 2000, AAAI/IAAI.

[6]  Tara A. Estlin,et al.  Multi-Strategy Learning of Search Control for Partial-Order Planning , 1996, AAAI/IAAI, Vol. 1.

[7]  Sira Vegas,et al.  Quality-based Learning for Planning , 2001 .

[8]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[9]  Malik Ghallab,et al.  Representation and Control in IxTeT, a Temporal Planner , 1994, AIPS.

[10]  Pedro Isasi Viñuela,et al.  Genetic Programming and Deductive-Inductive Learning: A Multi-Strategy Approach , 1998, ICML.

[11]  D. Meziat,et al.  An Artificial Intelligence Planner for Satellite Operations , 2001 .

[12]  Craig A. Knoblock,et al.  Flexible and Scalable Query Planning in Distributed and Heterogeneous Environments , 1998, AIPS.

[13]  David Ruby,et al.  Learning Episodes for Optimization , 1992, ML.

[14]  Hector Geffner,et al.  Heuristic Planning with Time and Resources , 2014 .

[15]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[16]  Manuela M. Veloso,et al.  Lazy Incremental Learning of Control Knowledge for Efficiently Obtaining Quality Plans , 1997, Artificial Intelligence Review.

[17]  Daniel S. Weld,et al.  UCPOP: A Sound, Complete, Partial Order Planner for ADL , 1992, KR.

[18]  Pedro Isasi Viñuela,et al.  Knowledge Representation Issues in Control Knowledge Learning , 2000, ICML.

[19]  Steve Hanks,et al.  Flaw Selection Strategies for Value-Directed Planning , 1996, AIPS.

[20]  Eugene Fink,et al.  Integrating planning and learning: the PRODIGY architecture , 1995, J. Exp. Theor. Artif. Intell..

[21]  Masahiko Iwamoto,et al.  A Planner with Quality Goal and Its Speed-up Learning for Optimization Problem , 1994, AIPS.

[22]  Fahiem Bacchus,et al.  Using temporal logics to express search control knowledge for planning , 2000, Artif. Intell..

[23]  Tara A. Estlin,et al.  Learning to Improve both Efficiency and Quality of Planning , 1997, IJCAI.