Exploring the Synergy between Two Modular Learning Techniques for Automated Planning

In the last decade the emphasis on improving the operational performance of domain independent automated planners has been in developing complex techniques which merge a range of different strategies. This quest for operational advantage, driven by the regular international planning competitions, has not made it easy to study, understand and predict what combinations of techniques will have what effect on a planner’s behaviour in a particular application domain. In this paper, we consider two machine learning techniques for planner performance improvement, and exploit a modular approach to their combination in order to facilitate the analysis of the impact of each individual component. We believe this can contribute to the development of more transparent planning engines, which are designed using modular, interchangeable, and well-founded components. Specifically, we combined two previously unrelated learning techniques, entanglements and relational decision trees, to guide a “vanilla” search algorithm. We report on a large experimental analysis which demonstrates the effectiveness of the approach in terms of performance improvements, resulting in a very competitive planning configuration despite the use of a more modular and transparent architecture. This gives insights on the strengths and weaknesses of the considered approaches, that will help their future exploitation.

[1]  Sergio Jiménez Celorrio,et al.  A review of machine learning for automated planning , 2012, The Knowledge Engineering Review.

[2]  Simon Parkinson,et al.  Automated planning to minimise uncertainty of machine tool calibration , 2014, Eng. Appl. Artif. Intell..

[3]  Scott Sanner,et al.  A Survey of the Seventh International Planning Competition , 2012, AI Mag..

[4]  Daniel Borrajo,et al.  Combining Macro-operators with Control Knowledge , 2007, ILP.

[5]  Lukás Chrpa,et al.  An Automatic Algorithm Selection Approach for Planning , 2013, 2013 IEEE 25th International Conference on Tools with Artificial Intelligence.

[6]  Lukás Chrpa,et al.  MUM: A Technique for Maximising the Utility of Macro-operators by Constrained Generation and Use , 2014, ICAPS.

[7]  Olivier Buffet,et al.  Learning Pruning Rules for Heuristic Search Planning , 2014, ECAI.

[8]  Silvia Richter,et al.  The LAMA Planner: Guiding Cost-Based Anytime Planning with Landmarks , 2010, J. Artif. Intell. Res..

[9]  Thomas Leo McCluskey,et al.  On Exploiting Structures of Classical Planning Problems: Generalizing Entanglements , 2012, ECAI.

[10]  Raquel Fuentetaja,et al.  Scaling up Heuristic Planning with Relational Decision Trees , 2014, J. Artif. Intell. Res..

[11]  Lukás Chrpa,et al.  Towards application of automated planning in urban traffic control , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[12]  Hendrik Blockeel,et al.  Top-Down Induction of First Order Logical Decision Trees , 1998, AI Commun..

[13]  Malte Helmert,et al.  Lama 2008 and 2011 , 2011 .

[14]  Malte Helmert,et al.  Planning with h + in Theory and Practice , 2009, KI.

[15]  Robert Givan,et al.  Learning Control Knowledge for Forward Search Planning , 2008, J. Mach. Learn. Res..

[16]  Jörg Hoffmann,et al.  Ordered Landmarks in Planning , 2004, J. Artif. Intell. Res..

[17]  Jendrik Seipp,et al.  Automatic Configuration of Sequential Planning Portfolios , 2015, AAAI.

[18]  Alfonso Gerevini,et al.  Planning through Automatic Portfolio Configuration: The PbP Approach , 2014, J. Artif. Intell. Res..

[19]  Marc Cavazza,et al.  Applying planning to interactive storytelling: Narrative control using state constraints , 2010, TIST.

[20]  T. L. McCluskey,et al.  Combining Weak Learning Heuristics in General Problem Solvers , 1987, IJCAI.

[21]  Adele E. Howe,et al.  Learning from planner performance , 2009, Artif. Intell..

[22]  Fernando Fernández,et al.  Learning Predictive Models to Configure Planning Portfolios , 2013 .

[23]  Bernhard Nebel,et al.  The FF Planning System: Fast Plan Generation Through Heuristic Search , 2011, J. Artif. Intell. Res..

[24]  Kevin Leyton-Brown,et al.  Algorithm runtime prediction: Methods & evaluation , 2012, Artif. Intell..

[25]  Joachim Hertzberg On building a planning tool box , 1996 .

[26]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[27]  Yixin Chen,et al.  Temporal Planning using Subgoal Partitioning and Resolution in SGPlan , 2006, J. Artif. Intell. Res..

[28]  T. L. McCluskey,et al.  Acquisition of Object-Centred Domain Models from Planning Examples , 2009, ICAPS.

[29]  Qiang Yang,et al.  Learning action models from plan examples using weighted MAX-SAT , 2007, Artif. Intell..

[30]  Jonathan Schaeffer,et al.  Macro-FF: Improving AI Planning with Automatically Learned Macro-Operators , 2005, J. Artif. Intell. Res..

[31]  Alfonso Gerevini,et al.  An Automatically Configurable Portfolio-based Planner with Macro-actions: PbP , 2009, ICAPS.