论文信息 - Gradient-Based Mixed Planning with Discrete and Continuous Actions

Gradient-Based Mixed Planning with Discrete and Continuous Actions

Dealing with planning problems with both discrete logical relations and continuous numeric changes in real-world dynamic environments is challenging. Existing numeric planning systems for the problem often discretize numeric variables or impose convex quadratic constraints on numeric variables, which harms the performance when solving the problem. In this paper, we propose a novel algorithm framework to solve the numeric planning problems mixed with discrete and continuous actions based on gradient descent. We cast the numeric planning with discrete and continuous actions as an optimization problem by integrating a heuristic function based on discrete effects. Specifically, we propose a gradient-based framework to simultaneously optimize continuous parameters and actions of candidate plans. The framework is combined with a heuristic module to estimate the best plan candidate to transit initial state to the goal based on relaxation. We repeatedly update numeric parameters and compute candidate plan until it converges to a valid plan to the planning problem. In the empirical study, we exhibit that our algorithm framework is both effective and efficient, especially when solving non-convex planning problems.

Subbarao Kambhampati | Hankz Hankui Zhuo | Kebing Jin | Hai Wan | Zhanhao Xiao

[1] Pieter Abbeel,et al. Combined task and motion planning through an extensible planner-independent interface layer , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[2] Andrew Coles,et al. Temporal Planning with Preferences and Time-Dependent Continuous Costs , 2012, ICAPS.

[3] J.W. Langelaan. Tree-based trajectory planning to exploit atmospheric energy , 2008, 2008 American Control Conference.

[4] Leslie Pack Kaelbling,et al. Hierarchical task and motion planning in the now , 2011, 2011 IEEE International Conference on Robotics and Automation.

[5] Patrik Haslum,et al. Improving Delete Relaxation Heuristics Through Explicitly Represented Conjunctions , 2014, J. Artif. Intell. Res..

[6] Daniele Magazzeni,et al. A universal planning system for hybrid domains , 2011, Applied Intelligence.

[7] Bernhard Nebel,et al. The FF Planning System: Fast Plan Generation Through Heuristic Search , 2011, J. Artif. Intell. Res..

[8] Jörg Hoffmann,et al. FF: The Fast-Forward Planning System , 2001, AI Mag..

[9] Pieter Abbeel,et al. Learning Plannable Representations with Causal InfoGAN , 2018, NeurIPS.

[10] Maria Fox,et al. A Compilation of the Full PDDL+ Language into SMT , 2016, ICAPS.

[11] Robert Mattmüller,et al. Using the Context-enhanced Additive Heuristic for Temporal and Numeric Planning , 2009, ICAPS.

[12] Kimon P. Valavanis,et al. Evolutionary algorithm based offline/online path planner for UAV navigation , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[13] Shuxiang Guo,et al. Path-planning optimization of underwater microrobots in 3-D space by PSO Approach , 2009, 2009 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[14] Henry Kautz,et al. Integer optimization models of AI planning problems , 2000, The Knowledge Engineering Review.

[15] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.

[16] Maria Fox,et al. Heuristic Planning for Hybrid Systems , 2016, AAAI.

[17] Gregory D. Hager,et al. Sampling-Based Motion and Symbolic Action Planning with geometric and differential constraints , 2010, 2010 IEEE International Conference on Robotics and Automation.

[18] Leslie Pack Kaelbling,et al. A constraint-based method for solving sequential manipulation planning problems , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19] Rachid Alami,et al. A Hybrid Approach to Intricate Motion, Manipulation and Task Planning , 2009, Int. J. Robotics Res..

[20] Sepp Hochreiter,et al. The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[21] Patrik Haslum,et al. Interval-Based Relaxation for General Numeric Planning , 2016, ECAI.

[22] Dongyan Zhao,et al. Plan-And-Write: Towards Better Automatic Storytelling , 2018, AAAI.

[23] Mark O. Riedl,et al. Controllable Neural Story Generation via Reinforcement Learning , 2018, ArXiv.

[24] Andrew Coles,et al. COLIN: Planning with Continuous Linear Numeric Change , 2012, J. Artif. Intell. Res..

[25] Humbert Fiorino,et al. HDDL: An Extension to PDDL for Expressing Hierarchical Planning Problems , 2020, AAAI.

[26] Paul J. Werbos,et al. Backpropagation Through Time: What It Does and How to Do It , 1990, Proc. IEEE.

[27] Fuchun Sun,et al. Evolutionary route planner for unmanned air vehicles , 2005, IEEE Transactions on Robotics.

[28] Erez Karpas,et al. ScottyActivity: Mixed Discrete-Continuous Planning with Convex Optimization , 2018, J. Artif. Intell. Res..

[29] Otthein Herzog,et al. Planning with Numeric Key Performance Indicators over Dynamic Organizations of Intelligent Agents , 2014, MATES.

[30] Maria Fox,et al. Modelling Mixed Discrete-Continuous Domains for Planning , 2006, J. Artif. Intell. Res..

[31] Scott Sanner,et al. Scalable Planning with Tensorflow for Hybrid Nonlinear Domains , 2017, NIPS.

[32] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[33] Maria Fox,et al. Planning Using Actions with Control Parameters , 2016, ECAI.

[34] Luis Gracia,et al. A supervisory loop approach to fulfill workspace constraints in redundant robots , 2012, Robotics Auton. Syst..

[35] Maria Fox,et al. PDDL2.1: An Extension to PDDL for Expressing Temporal Planning Domains , 2003, J. Artif. Intell. Res..

[36] Leslie Pack Kaelbling,et al. From Skills to Symbols: Learning Symbolic Representations for Abstract High-Level Planning , 2018, J. Artif. Intell. Res..

[37] Aluizio F. R. Araújo,et al. A partially recurrent neural network to perform trajectory planning, inverse kinematics, and inverse dynamics , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).

[38] Jianqiang Li,et al. A Hybrid Path Planning Method in Unmanned Air/Ground Vehicle (UAV/UGV) Cooperative Systems , 2016, IEEE Transactions on Vehicular Technology.

[39] Avrim Blum,et al. Fast Planning Through Planning Graph Analysis , 1995, IJCAI.

[40] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[41] Scott Sanner,et al. Metric Hybrid Factored Planning in Nonlinear Domains with Constraint Generation , 2019, CPAIOR.

[42] Shuai Li,et al. Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[43] Ying Lin,et al. Automatic path planning for autonomous underwater vehicles based on an adaptive differential evolution , 2014, GECCO.

[44] Subbarao Kambhampati,et al. A Hybrid Linear Programming and Relaxed Plan Heuristic for Partial Satisfaction Planning Problems , 2007, ICAPS.

[45] Patrik Haslum,et al. Optimal Planning with Global Numerical State Constraints , 2014, ICAPS.

[46] Michael I. Jordan. Serial Order: A Parallel Distributed Processing Approach , 1997 .

[47] Brian C. Williams,et al. Generative Planning for Hybrid Systems Based on Flow Tubes , 2008, ICAPS.

[48] nbsp,et al. Minimum-time Approach to Obstacle Avoidance Constrained by Envelope Protection for Autonomous UAVs , 2009 .

[49] A. Caiti,et al. Evolutionary path planning for autonomous underwater vehicles in a variable ocean , 2004, IEEE Journal of Oceanic Engineering.

[50] Amanda Jane Coles,et al. Mixed Discrete Continuous Non-Linear Planning through Piecewise Linear Approximation , 2019, ICAPS.

[51] Jörg Hoffmann,et al. The Metric-FF Planning System: Translating ''Ignoring Delete Lists'' to Numeric State Variables , 2003, J. Artif. Intell. Res..

[52] Dai Yun-tao,et al. Path Planning of Underwater Vehicle in 3D Space Based on Ant Colony Algorithm , 2008 .

[53] Andrew Coles,et al. Forward-Chaining Partial-Order Planning , 2010, ICAPS.

[54] Enrique,et al. Generative multi-robot task and motion planning over long horizons , 2018 .

[55] Lydia E. Kavraki,et al. Informing Multi-Modal Planning with Synergistic Discrete Leads , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[56] Malte Helmert,et al. The Fast Downward Planning System , 2006, J. Artif. Intell. Res..

[57] Te-Tan Liao,et al. A hybrid strategy for the time- and energy-efficient trajectory planning of parallel platform manipulators , 2011 .

[58] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..

[59] Subbarao Kambhampati,et al. Improving Temporal Flexibility of Position Constrained Metric Temporal Plans , 2003, ICAPS.

[60] Ivan Serina,et al. LPG: A Planner Based on Local Search for Planning Graphs with Action Costs , 2002, AIPS.

[61] Andrew Coles,et al. A Hybrid Relaxed Planning Graph'LP Heuristic for Numeric Planning Domains , 2008, ICAPS.

[62] Mingyue Ding,et al. Phase Angle-Encoded and Quantum-Behaved Particle Swarm Optimization Applied to Three-Dimensional Route Planning for UAV , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[63] J J Hopfield,et al. Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.