Learning Techniques and Neural Networks for the Solution of N-Stage Nonlinear Nonquadratic Optimal Control Problems

This paper deals with the problem of designing closed-loop feed-forward control strategies to drive the state of a dynamic system (in general, nonlinear) so as to track any desired trajectory joining the points of given compact sets, while minimizing a certain cost function (in general, nonquadratic). Due to the generality of the problem, conventional methods (e.g., dynamic programming, maximum principle, etc.) are difficult to apply. Then, an approximate solution is sought by constraining control strategies to take on the structure of multi-layer feed-forward neural networks. After discussing the approximation properties of neural control strategies, a particular neural architecture is presented, which is based on what has been called the “Linear-Structure Preserving Principle” (the LISP principle). The original functional problem is then reduced to a nonlinear programming one, and backpropagation is applied to derive the optimal values of the synaptic weights. Recursive equations to compute the gradient components are presented, which generalize the classical adjoint system equations of N-stage optimal control theory. Simulation results related to non-LQ problems show the effectiveness of the proposed method.

[1]  Arthur E. Bryson,et al.  Applied Optimal Control , 1969 .

[2]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[3]  Robert E. Larson,et al.  State increment dynamic programming , 1968 .

[4]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[5]  G. Siouris,et al.  Optimum systems control , 1979, Proceedings of the IEEE.

[6]  M. Athans,et al.  The design of suboptimal linear time-varying systems , 1968 .

[7]  Thomas Parisini,et al.  Team theory and backpropagation for dynamic routing in communication networks , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[8]  Hecht-Nielsen Theory of the backpropagation neural network , 1989 .

[9]  Thomas Parisini,et al.  Multi-Layer Neural Networks for the Optimal Control of Nonlinear Dynamic Systems , 1991 .

[10]  Kumpati S. Narendra,et al.  Adaptation and learning in automatic systems , 1974 .

[11]  P. Makila,et al.  Computational methods for parametric LQ problems--A survey , 1987 .

[12]  Andrew P. Sage,et al.  Closed Loop Optimization of Fixed Configuration Systems , 1966 .

[13]  Derrick H. Nguyen,et al.  Neural networks for self-learning control systems , 1990 .

[14]  Barry J. Wythoff,et al.  Backpropagation neural networks , 1993 .

[15]  Andrew R. Barron,et al.  Universal approximation bounds for superpositions of a sigmoidal function , 1993, IEEE Trans. Inf. Theory.