Online Adaptive Critic Flight Control

A nonlinear control system comprising a network of networks is taught by the use of a two-phase learning procedure realized through novel training techniques and an adaptive critic design. The neural network controller is trained algebraically, offline, by the observation that its gradients must equal corresponding linear gain matrices at chosen operating points. Online learning by a dual heuristic adaptive critic architecture optimizes performance incrementally over time by accounting for plant dynamics and nonlinear effects that are revealed during large, coupled motions. The method is implemented to control the six-degree-of-freedom simulation of a business jet aircraft over its full operating envelope. The result is a controller that improves its performance while unexpected conditions, such as unmodeled dynamics, parameter variations, and control failures, are experienced for the first time.

[1]  Bernard Etkin,et al.  Dynamics of Atmospheric Flight , 1972 .

[2]  Silvia Ferrari,et al.  Algebraic and adaptive learning in neural control systems , 2002 .

[3]  Robert C. Nelson,et al.  Flight Stability and Automatic Control , 1989 .

[4]  Robert F. Stengel,et al.  Optimal Control and Estimation , 1994 .

[5]  S. F. R. F. Stengel 3 Model-Based Adaptive Critic Designs , 2004 .

[6]  Li Zhi Solution of the Matrix Equation AX-XB=C , 2001 .

[7]  Richard H. Bartels,et al.  Algorithm 432 [C2]: Solution of the matrix equation AX + XB = C [F4] , 1972, Commun. ACM.

[8]  R. Bellman Dynamic programming. , 1957, Science.

[9]  Andrew R. Barron,et al.  Universal approximation bounds for superpositions of a sigmoidal function , 1993, IEEE Trans. Inf. Theory.

[10]  Michael Athans,et al.  Guaranteed properties of gain scheduled control for linear parameter-varying plants , 1991, Autom..

[11]  Paul J. Werbos,et al.  Approximate dynamic programming for real-time control and neural modeling , 1992 .

[12]  P J Webros BACKPROPAGATION THROUGH TIME: WHAT IT DOES AND HOW TO DO IT , 1990 .

[13]  Richard D. Braatz,et al.  On the "Identification and control of dynamical systems using neural networks" , 1997, IEEE Trans. Neural Networks.

[14]  Juri Kalviste Spherical mapping and analysis of aircraft angles for maneuvering flight , 1987 .

[15]  George G. Lendaris,et al.  Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[16]  K. K. Kumar,et al.  Immunized adaptive critics for level 2 intelligent control , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[17]  T. T. Shannon,et al.  Application considerations for the DHP methodology , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[18]  Robert F. Stengel,et al.  Algebraic training of a neural network , 2001, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[19]  Martin A. Riedmiller,et al.  A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[20]  Ronald A. Howard,et al.  Dynamic Programming and Markov Processes , 1960 .

[21]  Donald E. Kirk,et al.  Optimal control theory : an introduction , 1970 .

[22]  S. N. Balakrishnan,et al.  Adaptive-critic based neural networks for aircraft optimal control , 1996 .

[23]  Robert F. Stengel,et al.  Restructurable control using proportional-integral implicit model following , 1990 .

[24]  James C. Neidhoefer,et al.  Immunized adaptive critics , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[25]  Jennie Si,et al.  Online learning control by association and reinforcement , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[26]  R. Stengel,et al.  Classical/Neural Synthesis of Nonlinear Control Systems , 2000 .

[27]  Christopher I. Marrison,et al.  Design of Robust Control Systems for a Hypersonic Aircraft , 1998 .