Gradient dynamic programming for stochastic optimal control of multidimensional water resources systems

A new computational algorithm is presented for the solution of discrete time linearly constrained stochastic optimal control problems decomposable in stages. The algorithm, designated gradient dynamic programming, is a backward moving stagewise optimization. The main innovations over conventional discrete dynamic programming (DDP) are in the functional representation of the cost-to-go function and the solution of the single-stage problem. The cost-to-go function (assumed to be of requisite smoothness) is approximated within each element defined by the discretization scheme by the lowest-order polynomial which preserve its values and the values of its gradient with respect to the state variables at all nodes of the discretization grid. The improved accuracy of this Hermitian interpolation scheme reduces the effect of discretization error and allows the use of coarser grids which reduces the dimensionality of the problem. At each stage, the optimal control is determined on each node of the discretized state space using a constrained Newton-type optimization procedure which has quadratic rate of convergence. The set of constraints which act as equalities is determined from an active set strategy which converges under lenient convexity requirements. This method of solving the single-stage optimization is much more efficient than the conventional way based on enumeration or iterative methods with linear rate of convergence. Once the optimal control is determined, the cost-to-go function and its gradient with respect to the state variables is calculated to be used at the next stage. The proposed technique permits the efficient optimization of stochastic systems whose high dimensionality does not permit solution under the conventional DDP framework and for which successive approximation methods are not directly applicable due to stochasticity. Results for a four-reservoir example are presented. The purpose of this paper is to present a new computational algorithm for the stochastic optimization of sequential decision problems. One important and extensively studied class of such problems in the area of water resources is the discrete time optimal control of multireservoir systems under stochastic inflows. Other applications include the optimal design and operation of sewer systems [e.g., Mays and Wenzel, 1976; Labadie et al., 1980], the optimal conjunctive utilization of surface and groundwater resources [e.g., Buras, 1972], and the minimum cost water quality maintenance in rivers [e.g., Dracup and Fogarty, 1974; Chang and Yeh, 1973], to mention only a few of the water resources applications and pertinent references. An extensive review of dynamic programming applications in water resources can be found in the works by Yakowitz [1982] and Yeh [1985]. Before we proceed with the

[1]  P. B. Coaker,et al.  Applied Dynamic Programming , 1964 .

[2]  Robert E. Larson,et al.  State increment dynamic programming , 1968 .

[3]  E. Lee,et al.  APPLYING GRADIENT PROJECTION AND CONJUGATE GRADIENT TO THE OPTIMUM OPERATION OF RESERVOIRS1 , 1970 .

[4]  V. T. Chow,et al.  Discrete Differential Dynamic Programing Approach to Water Resources Systems Optimization , 1971 .

[5]  Nathan Buras Scientific allocation of water resources , 1971 .

[6]  David Q. Mayne,et al.  Differential dynamic programming , 1972, The Mathematical Gazette.

[7]  William W.-G. Yeh,et al.  OPTIMAL ALLOCATION OF ARTIFICIAL AERATION ALONG A POLLUTED STREAM USING DYNAMIC PROGRAMMING1 , 1973 .

[8]  E. Polak Introduction to linear and nonlinear programming , 1973 .

[9]  Optimal Planning for a Thermal Discharge Treatment System , 1974 .

[10]  R. H. Hicks,et al.  A nonlinear programming approach to a very large hydroelectric system optimization , 1974, Math. Program..

[11]  David R. Maidment,et al.  Computer time and memory requirements for DP and DDDP in water resource systems analysis , 1975 .

[12]  L. Mays,et al.  Optimal design of multilevel branching sewer systems , 1976 .

[13]  Dimitri P. Bertsekas,et al.  Numerical methods for constrained optimization , 1976 .

[14]  James W. Daniel,et al.  Splines and efficiency in dynamic programming , 1976 .

[15]  Robert E. Larson,et al.  Principles of Dynamic Programming , 1978 .

[16]  Leon Lapidus,et al.  Studies in approximation methods—I: Splines and control via discrete dynamic programming , 1978 .

[17]  William W.-G. Yeh,et al.  A NONLINEAR PROGRAMMING ALGORITHM FOR REAL‐TIME HOURLY RESERVOIR OPERATIONS , 1979 .

[18]  S. Yakowitz,et al.  Constrained differential dynamic programming and its application to multireservoir control , 1979 .

[19]  Melanie L. Lenard,et al.  A computational study of active set strategies in nonlinear programming with linear constraints , 1979, Math. Program..

[20]  S. Gal Optimal management of a multireservoir water supply system , 1979 .

[21]  John W. Labadie,et al.  Optimal Control of Unsteady Combined Sewer Flow , 1980 .

[22]  Hermann Engles,et al.  Numerical quadrature and cubature , 1980 .

[23]  S. Yakowitz Dynamic programming applications in water resources , 1982 .

[24]  Dan Weiner,et al.  A stochastic dynamic programing model for the operation of the Mediterranean-Dead Sea Project , 1982 .

[25]  B. F. Sule,et al.  Stochastic dynamic programming models for reservoir operation optimization , 1984 .

[26]  William W.-G. Yeh,et al.  Reservoir Management and Operations Models: A State‐of‐the‐Art Review , 1985 .

[27]  Ian C. Goulter,et al.  Practical implications in the use of stochastic dynamic programming for reservoir operation , 1985 .

[28]  M. Papageorgiou Optimal Multireservoir Network Control by the Discrete Maximum Principle , 1985 .

[29]  P. Kitanidis,et al.  Error analysis of conventional discrete and gradient dynamic programming , 1987 .

[30]  P. Kitanidis A first-order approximation to stochastic optimal control of reservoirs , 1987 .

[31]  R. Fletcher Practical Methods of Optimization , 1988 .