Differential dynamic programming and Newton's method for discrete optimal control problems

The purpose of this paper is to draw a detailed comparison between Newton's method, as applied to discrete-time, unconstrained optimal control problems, and the second-order method known as differential dynamic programming (DDP). The main outcomes of the comparison are: (i) DDP does not coincide with Newton's method, but (ii) the methods are close enough that they have the same convergence rate, namely, quadratic.The comparison also reveals some other facts of theoretical and computational interest. For example, the methods differ only in that Newton's method operates on a linear approximation of the state at a certain point at which DDP operates on the exact value. This would suggest that DDP ought to be more accurate, an anticipation borne out in our computational example. Also, the positive definiteness of the Hessian of the objective function is easy to check within the framework of DDP. This enables one to propose a modification of DDP, so that a descent direction is produced at each iteration, regardless of the Hessian.

[1]  A. Booth Numerical Methods , 1957, Nature.

[2]  P. B. Coaker,et al.  Applied Dynamic Programming , 1964 .

[3]  H. Halkin A Maximum Principle of the Pontryagin Type for Systems Described by Nonlinear Difference Equations , 1966 .

[4]  D. Mayne A Second-order Gradient Method for Determining Optimal Trajectories of Non-linear Discrete-time Systems , 1966 .

[5]  Robert E. Larson,et al.  State increment dynamic programming , 1968 .

[6]  James M. Ortega,et al.  Iterative solution of nonlinear equations in several variables , 2014, Computer science and applied mathematics.

[7]  M. Ciletti,et al.  The computation and theory of optimal control , 1972 .

[8]  David Q. Mayne,et al.  Differential dynamic programming , 1972, The Mathematical Gazette.

[9]  Åke Björck,et al.  Numerical Methods , 2021, Markov Renewal and Piecewise Deterministic Processes.

[10]  Daniel Matthys Murray,et al.  DIFFERENTIAL DYNAMIC PROGRAMMING FOR THE EFFICIENT SOLUTION OF OPTIMAL CONTROL PROBLEMS , 1978 .

[11]  S. Yakowitz,et al.  Principles and procedures of numerical analysis , 1978 .

[12]  K. Ohno A new approach to differential dynamic programming for discrete time systems , 1978 .

[13]  S. Yakowitz,et al.  Constrained differential dynamic programming and its application to multireservoir control , 1979 .

[14]  Sidney J. Yakowitz,et al.  The application of optimal control methodology to nonlinear programming problems , 1981, Math. Program..

[15]  S. Yakowitz Dynamic programming applications in water resources , 1982 .

[16]  Sidney J. Yakowitz,et al.  Convergence rate analysis of the state increment dynamic programming method , 1983, Autom..