A trajectory optimization technique for multidimensional nonlinear processes is presented. Problems which are cast in a discrete-time mold are considered. The method is based on dynamic programming and employs a combination of the technique of functional approximation and the method of region-limiting strategies. The cost function at each stage is approximated by a quadratic polynomial in a region which is restricted to be of a size judged appropriate to reduce the error in the approximation. Minimal costs are evaluated at a set of points, called base points. A new control trajectory and an improved state trajectory are then generated within an extrapolation region. The iterative application of this procedure yields an optimal trajectory. Contained in the algorithm is a simple procedure which eliminates matrix inversion to determine the coefficients of the approximating polynomial. The present algorithm is applicable to problems with one bounded control action. It accounts for inequality constraints on state variables in a straightforward manner. The algorithm is applied to solve a number of trajectory optimization problems.
[1]
D. Jacobson,et al.
A transformation technique for optimal control problems with a state variable inequality constraint
,
1969
.
[2]
David G. Luenberger,et al.
Reducing the Memory Requirements of Dynamic Programming
,
1968,
Oper. Res..
[3]
R. Larson,et al.
A survey of dynamic programming computational procedures
,
1967,
IEEE Transactions on Automatic Control.
[4]
D. C. Collins,et al.
Dimensional approximation in dynamic programming by structural decomposition
,
1970
.
[5]
D. Jacobson,et al.
A discrete-time differential dynamic programming algorithm with application to optimal orbit transfer
,
1970
.
[6]
Jr. Henry C. Thacher.
Generalization of Concepts Related to Linear Dependence
,
1958
.
[7]
W. E. Milne,et al.
INTERPOLATION IN SEVERAL VARIABLES
,
1960
.
[8]
G. Whaples.
A Note on Degree-n Independence
,
1958
.