论文信息 - Combined Task and Motion Planning under Partial Observability: An Optimization-Based Approach

Combined Task and Motion Planning under Partial Observability: An Optimization-Based Approach

We propose a novel approach to Combined Task and Motion Planning (TAMP) under partial observability. Previous optimization-based TAMP methods [1][2] compute optimal plans and paths assuming full observability. However, partial observability requires the solution to be a policy that reacts to the observations that the agent receives. We consider a formulation where observations introduce additional branching in the symbolic decision tree. The solution is now given by a reactive policy on the symbolic level together with a path tree that describes the branchings of optimal motion depending on the observations. Our method works in two stages: First, the symbolic policy is optimized using approximate path costs estimated from independent optimizations of trajectory pieces. Second, we fix the best symbolic policy and optimize a joint trajectory tree. We test our approach on object manipulation and autonomous driving examples. We also compare the algorithm’s performance to a state-of-the-art TAMP planner in fully observable cases.

Marc Toussaint | Camille Phiquepal

[1] Leslie Pack Kaelbling,et al. A constraint-based method for solving sequential manipulation planning problems , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2] Leslie Pack Kaelbling,et al. Integrated task and motion planning in belief space , 2013, Int. J. Robotics Res..

[3] Manuel Lopes,et al. Multi-bound tree search for logic-geometric programming in cooperative manipulation domains , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[4] Emilio Frazzoli,et al. A Survey of Motion Planning and Control Techniques for Self-Driving Urban Vehicles , 2016, IEEE Transactions on Intelligent Vehicles.

[5] Alessandro Saffiotti,et al. Constraint propagation on interval bounds for dealing with geometric backtracking , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6] Swarat Chaudhuri,et al. Incremental Task and Motion Planning: A Constraint-Based Approach , 2016, Robotics: Science and Systems.

[7] Ronen I. Brafman,et al. R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning , 2001, J. Mach. Learn. Res..

[8] Marc Toussaint,et al. Logic-Geometric Programming: An Optimization-Based Approach to Combined Task and Motion Planning , 2015, IJCAI.

[9] Christos Katrakazas,et al. Real-time motion planning methods for autonomous on-road driving: State-of-the-art and future research directions , 2015 .

[10] Alessandro Saffiotti,et al. Efficiently combining task and motion planning using geometric constraints , 2014, Int. J. Robotics Res..

[11] Marc Toussaint,et al. Multi-Agent Task and Motion Planning: An Optimization based Approach , 2018 .