An efficient trust region method for unconstrained discrete-time optimal control problems

Discrete-time optimal control (DTOC) problems are large-scale optimization problems with a dynamic structure. In previous work this structure has been exploited to provide very fast and efficient local procedures. Two examples are the differential dynamic programming algorithm (DDP) and the stagewise Newton procedure—both require onlyO(N) operations per iteration, whereN is the number of timesteps. Both exhibit a quadratic convergence rate. However, most algorithms in this category do not have a satisfactory global convergence strategy. The most popular global strategy is shifting: this sometimes works poorly due to the lack of automatic adjustment to the shifting element.In this paper we propose a method that incorporates the trust region idea with the local stagewise Newton's method. This method possesses advantages of both the trust region idea and the stagewise Newton's method, i.e., our proposed method has strong global and local convergence properties yet remains economical. Preliminary numerical results are presented to illustrate the behavior of the proposed algorithm. We also collect in the Appendix some DTOC problems that have appeared in the literature.

[1]  M. Powell A New Algorithm for Unconstrained Optimization , 1970 .

[2]  Sidney J. Yakowitz,et al.  The application of optimal control methodology to nonlinear programming problems , 1981, Math. Program..

[3]  C. Shoemaker,et al.  Dynamic optimal control for groundwater remediation with flexible management periods , 1992 .

[4]  D. Bertsekas Projected Newton methods for optimization problems with simple constraints , 1981, 1981 20th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes.

[5]  D. Mayne A Second-order Gradient Method for Determining Optimal Trajectories of Non-linear Discrete-time Systems , 1966 .

[6]  J. Pantoja,et al.  Differential dynamic programming and Newton's method , 1988 .

[7]  David G. Luenberger,et al.  Linear and nonlinear programming , 1984 .

[8]  S. Yakowitz,et al.  Computational aspects of discrete-time optimal control , 1984 .

[9]  L. Liao,et al.  Advantages of Differential Dynamic Programming Over Newton''s Method for Discrete-time Optimal Control Problems , 1992 .

[10]  David M. author-Gay Computing Optimal Locally Constrained Steps , 1981 .

[11]  Michael Wolfe,et al.  J+ = J , 1994, ACM SIGPLAN Notices.

[12]  G. A. Garreau,et al.  Mathematical Programming and Control Theory , 1979, Mathematical Gazette.

[13]  Stephen R. McReynolds,et al.  The computation and theory of optimal control , 1970 .

[14]  S. Yakowitz,et al.  Differential dynamic programming and Newton's method for discrete optimal control problems , 1984 .

[15]  Daniel Ralph,et al.  A Parallel Method for Discrete-Time Optimal Control Problems , 1993 .

[16]  Jorge J. Moré,et al.  Computing a Trust Region Step , 1983 .

[17]  L. Grippo,et al.  A Class of Structured Quasi-Newton Algorithms for Optimal Control Problems , 1983 .

[18]  L. Liao,et al.  Convergence in unconstrained discrete-time differential dynamic programming , 1991 .

[19]  Stephen J. Wright Partitioned Dynamic Programming for Optimal Control , 1991, SIAM J. Optim..

[20]  R. Fletcher Practical Methods of Optimization , 1988 .

[21]  Jorge J. Moré,et al.  Recent Developments in Algorithms and Software for Trust Region Methods , 1982, ISMP.