Reinforcement learning based controller synthesis for flexible aircraft wings

Aeroelastic study of flight vehicles has been a subject of great interest and research in the last several years. Aileron reversal and flutter related problems are due in part to the elasticity of a typical airplane. Structural dynamics of an aircraft wing due to its aeroelastic nature are characterized by partial differential equations. Controller design for these systems is very complex as compared to lumped parameter systems defined by ordinary differential equations. In this paper, a stabilizing statefeedback controller design approach is presented for the heave dynamics of a wing-fuselage model. In this study, a continuous actuator in the spatial domain is assumed. A control methodology is developed by combining the technique of “proper orthogonal decomposition” and approximate dynamic programming. The proper orthogonal decomposition technique is used to obtain a low-order nonlinear lumped parameter model of the infinite dimensional system. Then a near optimal controller is designed using the single-network-adaptive-critic technique. Furthermore, to add robustness to the nominal single-network-adaptive-critic controller against matched uncertainties, an identifier based adaptive controller is proposed. Simulation results demonstrate the effectiveness of the single-network-adaptive-critic controller augmented with adaptive controller for infinite dimensional systems.

[1]  Johnny Evers,et al.  Target tracking strategies for a nonlinear, flexible aircraft-inspired model , 2011, Proceedings of the 2011 American Control Conference.

[2]  S. Balakrishnan,et al.  Optimal dynamic inversion control design for a class of nonlinear distributed parameter systems with continuous and discrete actuators , 2007 .

[3]  S. N. Balakrishnan,et al.  Adaptive-critic based neural networks for aircraft optimal control , 1996 .

[4]  Anneli Folkesson,et al.  Numerical methods for engineers , 2007 .

[5]  Johnny Evers,et al.  Sensitivities & functional gains for a flexible aircraft-inspired model , 2010, Proceedings of the 2010 American Control Conference.

[6]  Radhakant Padhi,et al.  A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems , 2006, Neural Networks.

[7]  P. Christofides,et al.  Nonlinear and Robust Control of PDE Systems: Methods and Applications to Transport-Reaction Processes , 2002 .

[8]  John A. Burns,et al.  Optimal sensor location for robust control of distributed parameter systems , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.

[9]  Radhakant Padhi,et al.  Robust/Optimal Temperature Profile Control of a Re-Entry Vehicle Using Neural Networks , 2006 .

[10]  I. Lasiecka Control of systems governed by partial differential equations: a historical perspective , 1995, Proceedings of 1995 34th IEEE Conference on Decision and Control.

[11]  Petros A. Ioannou,et al.  Robust Adaptive Control , 2012 .

[12]  Miroslav Krstic,et al.  Adaptive control of PDES , 2007 .

[13]  S. S. Ravindran,et al.  Adaptive Reduced-Order Controllers for a Thermal Flow System Using Proper Orthogonal Decomposition , 2001, SIAM J. Sci. Comput..

[14]  P. Holmes,et al.  Turbulence, Coherent Structures, Dynamical Systems and Symmetry , 1996 .

[15]  Radhakant Padhi,et al.  PROPER ORTHOGONAL DECOMPOSITION BASED OPTIMAL CONTROL DESIGN OF HEAT EQUATION WITH DISCRETE ACTUATORS USING NEURAL NETWORKS , 2002 .

[16]  Frank L. Lewis,et al.  Applied Optimal Control and Estimation , 1992 .

[17]  George M. Siouris,et al.  Applied Optimal Control: Optimization, Estimation, and Control , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[18]  Keum-Shik Hong,et al.  Model reference adaptive control of a time-varying parabolic system , 2000 .

[19]  S.N. Balakrishnan,et al.  Optimal beaver population management using reduced order distributed parameter model and single network adaptive critics , 2004, Proceedings of the 2004 American Control Conference.

[20]  Michael A. Demetriou,et al.  Model Reference Adaptive Control of Distributed Parameter Systems , 1998 .

[21]  Ulrich Eggers,et al.  Introduction To Infinite Dimensional Linear Systems Theory , 2016 .

[22]  Sivaguru S. Ravindran,et al.  Proper Orthogonal Decomposition in Optimal Control of Fluids , 1999 .

[23]  Michael A. Demetriou,et al.  On‐line robust parameter identification for parabolic systems , 2001 .

[24]  Keum-Shik Hong,et al.  Direct adaptive control of parabolic systems: algorithm synthesis, and convergence and stability analysis , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.

[25]  Donald A. Sofge,et al.  Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , 1992 .

[26]  L. Sirovich Turbulence and the dynamics of coherent structures. I. Coherent structures , 1987 .

[27]  Radhakant Padhi,et al.  Proper orthogonal decomposition based feedback optimal control synthesis of distributed parameter systems using neural networks , 2002, Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301).