Features, Projections, and Representation Change for Generalized Planning

Generalized planning is concerned with the characterization and computation of plans that solve many instances at once. In the standard formulation, a generalized plan is a mapping from feature or observation histories into actions, assuming that the instances share a common pool of features and actions. This assumption, however, excludes the standard relational planning domains where actions and objects change across instances. In this work, we extend the formulation of generalized planning to such domains. This is achieved by projecting the actions over the features, resulting in a common set of abstract actions which can be tested for soundness and completeness, and which can be used for generating general policies such as "if the gripper is empty, pick the clear block above x and place it on the table" that achieve the goal clear(x) in any Blocksworld instance. In this policy, "pick the clear block above x" is an abstract action that may represent the action Unstack(a, b) in one situation and the action Unstack(b, c) in another. Transformations are also introduced for computing such policies by means of fully observable non-deterministic (FOND) planners. The value of generalized representations for learning general policies is also discussed.

[1]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[2]  G. Carpenter,et al.  Behavioral and Brain Sciences , 1999 .

[3]  Marco Pistore,et al.  Weak, strong, and strong cyclic planning via symbolic model checking , 2003, Artif. Intell..

[4]  Neil Immerman,et al.  A new representation and associated algorithms for generalized planning , 2011, Artif. Intell..

[5]  David Chapman,et al.  Penguins Can Make Cake , 1989, AI Mag..

[6]  S. Crawford,et al.  Volume 1 , 2012, Journal of Diabetes Investigation.

[7]  Yuxiao Hu,et al.  Generalized Planning: Synthesizing Plans that Work for Multiple Environments , 2011, IJCAI.

[8]  Hector J. Levesque,et al.  Foundations for Generalized Planning in Unbounded Stochastic Domains , 2016, KR.

[9]  Robert Mattmüller,et al.  Pattern-Database Heuristics for Partially Observable Nondeterministic Planning , 2010, KI.

[10]  Hector Geffner,et al.  Learning Generalized Policies from Planning Examples Using Concept Languages , 2004, Applied Intelligence.

[11]  Giuseppe De Giacomo,et al.  Generalized Planning: Non-Deterministic Abstractions and Trajectory Constraints , 2017, IJCAI.

[12]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[13]  Blai Bonet,et al.  Automatic Derivation of Memoryless Policies and Finite-State Controllers Using Classical Planners , 2009, ICAPS.

[14]  Neil Immerman,et al.  Qualitative Numeric Planning , 2011, AAAI.

[15]  Rajesh P. N. Rao,et al.  Embodiment is the foundation, not a level , 1996, Behavioral and Brain Sciences.

[16]  Neil Immerman,et al.  Learning Generalized Plans Using Abstract Counting , 2008, AAAI.

[17]  Malte Helmert,et al.  Pattern Database Heuristics for Fully Observable Nondeterministic Planning , 2010, ICAPS.

[18]  Craig Boutilier,et al.  Symbolic Dynamic Programming for First-Order MDPs , 2001, IJCAI.

[19]  Razvan Pascanu,et al.  Learning to Navigate in Complex Environments , 2016, ICLR.

[20]  Martijn van Otterlo,et al.  Solving Relational and First-Order Logical Markov Decision Processes: A Survey , 2012, Reinforcement Learning.

[21]  Roni Khardon,et al.  First Order Decision Diagrams for Relational MDPs , 2007, IJCAI.

[22]  Malte Helmert,et al.  Decidability and Undecidability Results for Planning with Numerical State Variables , 2002, PuK.

[23]  Javier Segovia Aguas,et al.  Generalized Planning with Procedural Domain Control Knowledge , 2016, ICAPS.

[24]  Pieter Abbeel,et al.  Learning Generalized Reactive Policies using Deep Neural Networks , 2017, ICAPS.

[25]  D. Ballard,et al.  Memory Representations in Natural Tasks , 1995, Journal of Cognitive Neuroscience.

[26]  Robert Givan,et al.  Approximate Policy Iteration with a Policy Language Bias , 2003, NIPS.

[27]  Hector Geffner,et al.  Width and Serialization of Classical Planning Problems , 2012, ECAI.