Transfer Planning for Temporal Logic Tasks

This paper proposes an optimal control synthesis algorithm for Linear Temporal Logic (LTL) tasks that exploits experience from solving similar LTL tasks before. The key idea is to appropriately decompose complex LTL tasks into simpler subtasks and define sets of skills, or plans, needed to solve these subtasks. These skills can be stored in a library of reusable skills and can be used to quickly synthesize plans for new tasks that have not been encountered before. Our proposed method is inspired by literature on multi-task learning and can be used to transfer experience between different LTL tasks. It amounts to a new paradigm in model-checking and optimal control synthesis methods that to this date do not use prior experience to solve planning problems. We present numerical experiments that show that our approach generally outperforms these methods in terms of time to generate feasible plans. We also show that our proposed algorithm is probabilistically complete and asymptotically optimal.

[1]  Michael M. Zavlanos,et al.  Intermittent connectivity control in mobile robot networks , 2015, 2015 49th Asilomar Conference on Signals, Systems and Computers.

[2]  Christel Baier,et al.  Principles of model checking , 2008 .

[3]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[4]  Ufuk Topcu,et al.  Optimization-based trajectory generation with linear temporal logic specifications , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[5]  James A. Hendler,et al.  UMCP: A Sound and Complete Procedure for Hierarchical Task-network Planning , 1994, AIPS.

[6]  Michael M. Zavlanos,et al.  Sampling-Based Optimal Control Synthesis for Multirobot Systems Under Global Temporal Tasks , 2017, IEEE Transactions on Automatic Control.

[7]  Michael M. Zavlanos,et al.  Temporal Logic Optimal Control for Large-Scale Multi-Robot Systems: 10400 States and Beyond , 2018, 2018 IEEE Conference on Decision and Control (CDC).

[8]  Yushan Chen,et al.  Formal Approach to the Deployment of Distributed Robotic Teams , 2012, IEEE Transactions on Robotics.

[9]  Calin Belta,et al.  Sampling-based temporal logic path planning , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  Paulo Tabuada,et al.  Linear temporal logic motion planning for teams of underactuated robots using satisfiability modulo convex programming , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[11]  Fahiem Bacchus,et al.  Using temporal logics to express search control knowledge for planning , 2000, Artif. Intell..

[12]  Michael M. Zavlanos,et al.  Sampling-Based Control Synthesis for Multi-robot Systems under Global Temporal Specifications , 2017, 2017 ACM/IEEE 8th International Conference on Cyber-Physical Systems (ICCPS).

[13]  Calin Belta,et al.  Optimal path planning for surveillance with temporal-logic constraints* , 2011, Int. J. Robotics Res..

[14]  Sheila A. McIlraith,et al.  Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning , 2018, ICML.