论文信息 - Squash-Box Feasibility Driven Differential Dynamic Programming

Squash-Box Feasibility Driven Differential Dynamic Programming

Recently, Differential Dynamic Programming (DDP) and other similar algorithms have become the solvers of choice when performing non-linear Model Predictive Control (nMPC) with modern robotic devices. The reason is that they have a lower computational cost per iteration when compared with off-the-shelf Non-Linear Programming (NLP) solvers, which enables its online operation. However, they cannot handle constraints, and are known to have poor convergence capabilities. In this paper, we propose a method to solve the optimal control problem with control bounds through a squashing function (i.e., a sigmoid, which is bounded by construction). It has been shown that a naive use of squashing functions damage the convergence rate. To tackle this, we first propose to add a quadratic barrier that avoids the difficulty of the plateau produced by the sigmoid. Second, we add an outer loop that adapts both the sigmoid and the barrier; it makes the optimal control problem with the squashing function converge to the original control-bounded problem. To validate our method, we present simulation results for different types of platforms including a multi-rotor, a biped, a quadruped and a humanoid robot.

J. Solà | Carlos Mastalli | Josep Marti-Saumell | Angel Santamaria-Navarro

[1] D. Mayne. A Second-order Gradient Method for Determining Optimal Trajectories of Non-linear Discrete-time Systems , 1966 .

[2] D K Smith,et al. Numerical Optimization , 2001, J. Oper. Res. Soc..

[3] Michael A. Saunders,et al. SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization , 2002, SIAM J. Optim..

[4] Emanuel Todorov,et al. Iterative Linear Quadratic Regulator Design for Nonlinear Biological Movement Systems , 2004, ICINCO.

[5] Lorenz T. Biegler,et al. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming , 2006, Math. Program..

[6] Jorge Nocedal,et al. Knitro: An Integrated Package for Nonlinear Optimization , 2006 .

[7] G. Lantoine,et al. A Hybrid Differential Dynamic Programming Algorithm for Robust Low-Thrust Optimization , 2008 .

[8] John T. Betts,et al. Practical Methods for Optimal Control and Estimation Using Nonlinear Programming , 2009 .

[9] Yuval Tassa,et al. Control-limited differential dynamic programming , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[10] Olivier Stasse,et al. Whole-body model-predictive control applied to the HRP-2 humanoid , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11] Roland Siegwart,et al. Fast nonlinear Model Predictive Control for unified trajectory optimization and tracking , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[12] C. Karen Liu,et al. Differential dynamic programming with nonlinear constraints , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[13] Moritz Diehl,et al. A Family of Iterative Gauss-Newton Shooting Methods for Nonlinear Optimal Control , 2017, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[14] Nicolas Mansard,et al. Differential Dynamic Programming for Multi-Phase Rigid Contact Dynamics , 2018, 2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids).

[15] Zachary Manchester,et al. ALTRO: A Fast Solver for Constrained Trajectory Optimization , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[16] Ruben Grandia,et al. Feedback MPC for Torque-Controlled Legged Robots , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[17] S. Vijayakumar,et al. Crocoddyl: An Efficient and Versatile Framework for Multi-Contact Optimal Control , 2019, 2020 IEEE International Conference on Robotics and Automation (ICRA).