Combined Task and Motion Planning under Partial Observability: An Optimization-Based Approach

We propose a novel approach to Combined Task and Motion Planning (TAMP) under partial observability. Previous optimization-based TAMP methods [1][2] compute optimal plans and paths assuming full observability. However, partial observability requires the solution to be a policy that reacts to the observations that the agent receives. We consider a formulation where observations introduce additional branching in the symbolic decision tree. The solution is now given by a reactive policy on the symbolic level together with a path tree that describes the branchings of optimal motion depending on the observations. Our method works in two stages: First, the symbolic policy is optimized using approximate path costs estimated from independent optimizations of trajectory pieces. Second, we fix the best symbolic policy and optimize a joint trajectory tree. We test our approach on object manipulation and autonomous driving examples. We also compare the algorithm’s performance to a state-of-the-art TAMP planner in fully observable cases.

[1]  Leslie Pack Kaelbling,et al.  A constraint-based method for solving sequential manipulation planning problems , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Leslie Pack Kaelbling,et al.  Integrated task and motion planning in belief space , 2013, Int. J. Robotics Res..

[3]  Manuel Lopes,et al.  Multi-bound tree search for logic-geometric programming in cooperative manipulation domains , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Emilio Frazzoli,et al.  A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles , 2016, IEEE Transactions on Intelligent Vehicles.

[5]  Alessandro Saffiotti,et al.  Constraint propagation on interval bounds for dealing with geometric backtracking , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Swarat Chaudhuri,et al.  Incremental Task and Motion Planning: A Constraint-Based Approach , 2016, Robotics: Science and Systems.

[7]  Ronen I. Brafman,et al.  R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning , 2001, J. Mach. Learn. Res..

[8]  Marc Toussaint,et al.  Logic-Geometric Programming: An Optimization-Based Approach to Combined Task and Motion Planning , 2015, IJCAI.

[9]  Christos Katrakazas,et al.  Real-time motion planning methods for autonomous on-road driving: State-of-the-art and future research directions , 2015 .

[10]  Alessandro Saffiotti,et al.  Efficiently combining task and motion planning using geometric constraints , 2014, Int. J. Robotics Res..

[11]  Marc Toussaint,et al.  Multi-Agent Task and Motion Planning: An Optimization based Approach , 2018 .