Attitude Control with Auxiliary Structure Based on Adaptive Dynamic Programming for Reentry Vehicles

This paper presents an attitude control scheme combined with adaptive dynamic programming (ADP) for reentry vehicles with high nonlinearity and disturbances. Firstly, the nonlinear attitude dynamics is divided into inner and outer loops according to the time scale separation and the cascade control principle, and a general sliding mode control method is employed to construct the main controllers for the double loops. Considering the shortage of main controllers in handling nonlinearity and sudden disturbances, an ADP structure is introduced into the outer attitude loop as an auxiliary. And the ADP structure utilizes neural network estimators to minimize the cost function and generate optimal signals through online learning, so as to compensate defect of the main controllers’ adaptability speed and accuracy. Then, the stability is analyzed by the Lyapunov method, and the parameter selection strategy of the ADP structure is derived to guide implementation. In addition, this paper puts forward skills to speed up ADP training. Finally, simulation results show that the control strategy with ADP possesses stronger adaptability and faster response than that without ADP for the nonlinear vehicle system.

[1]  Huan Li,et al.  On-orbit Reconfiguration Using Adaptive Dynamic Programming for Multi-mission-constrained Spacecraft Attitude Control System , 2019 .

[2]  Jongrae Kim,et al.  Engineering Notes Backstepping Control Design with Actuator Torque Bound for Spacecraft Attitude Maneuver , 2010 .

[3]  Jun Fu,et al.  Robust LPV modeling and control of aircraft flying through wind disturbance , 2019 .

[4]  Frank L. Lewis,et al.  Reinforcement learning and optimal adaptive control: An overview and implementation examples , 2012, Annu. Rev. Control..

[5]  Ali Heydari,et al.  Theoretical and Numerical Analysis of Approximate Dynamic Programming with Approximation Errors , 2014, ArXiv.

[6]  Haibo He,et al.  Model-Free Dual Heuristic Dynamic Programming , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Tommaso Mannucci,et al.  Safe Exploration Algorithms for Reinforcement Learning Controllers , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Haibo He,et al.  Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems , 2017, IEEE Transactions on Cybernetics.

[9]  Guang-Bin Huang,et al.  Extreme learning machine: a new learning scheme of feedforward neural networks , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[10]  Erik-Jan Van Kampen,et al.  Incremental Model-Based Global Dual Heuristic Programming for Flight Control , 2019 .

[11]  Qinglei Hu,et al.  Robust Backstepping Sliding Mode Attitude Tracking and Vibration Damping of Flexible Spacecraft with Actuator Dynamics , 2009 .

[12]  Anton H. J. de Ruiter Observer-Based Adaptive Spacecraft Attitude Control With Guaranteed Performance Bounds , 2016, IEEE Transactions on Automatic Control.

[13]  Xiuyun Meng,et al.  Improved nonlinear dynamic inversion control for a flexible air-breathing hypersonic vehicle , 2018, Aerospace Science and Technology.

[14]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[15]  Guang-Bin Huang,et al.  Trends in extreme learning machines: A review , 2015, Neural Networks.

[16]  Jennie Si,et al.  Online learning control by association and reinforcement. , 2001, IEEE transactions on neural networks.

[17]  Jay H. Lee,et al.  Reinforcement Learning - Overview of recent progress and implications for process control , 2019, Comput. Chem. Eng..

[18]  Erik-Jan Van Kampen,et al.  Incremental model based online dual heuristic programming for nonlinear adaptive control , 2018 .

[19]  Changhui Wang,et al.  Adaptive Neural Network Control of a Class of Fractional Order Uncertain Nonlinear MIMO Systems with Input Constraints , 2019, Complex..

[20]  J. A. Mulder,et al.  Continuous Adaptive Critic Flight Control aided with Approximated Plant Dynamics , 2006 .

[21]  Yifan Liu,et al.  Observer-based Robust Adaptive Type-2 Fuzzy Tracking Control for Flexible Air-breathing Hypersonic Vehicles , 2018 .

[22]  Xinping Guan,et al.  Optimal control for vertical take-off and landing aircraft non-linear system by online kernel-based dual heuristic programming learning , 2015 .

[23]  Chaoyang Dong,et al.  Switched adaptive active disturbance rejection control of variable structure near space vehicles based on adaptive dynamic programming , 2019, Chinese Journal of Aeronautics.

[24]  Ruiyun Qi,et al.  Compound fault-tolerant attitude control for hypersonic vehicle with reaction control systems in reentry phase. , 2019, ISA transactions.

[25]  Yoh-Han Pao,et al.  Stochastic choice of basis functions in adaptive function approximation and the functional-link net , 1995, IEEE Trans. Neural Networks.

[26]  Robert Kozma,et al.  Complete stability analysis of a heuristic approximate dynamic programming control design , 2015, Autom..

[27]  Erik-Jan van Kampen,et al.  Incremental model-based global dual heuristic programming with explicit analytical calculations applied to flight control , 2020, Eng. Appl. Artif. Intell..

[28]  Chunyu Yang,et al.  Model-Free Composite Control of Flexible Manipulators Based on Adaptive Dynamic Programming , 2018, Complex..

[29]  Erik-Jan Van Kampen,et al.  Incremental Approximate Dynamic Programming for Nonlinear Adaptive Tracking Control with Partial Observability , 2018, Journal of Guidance, Control, and Dynamics.

[30]  Feng Liu,et al.  Longitudinal control of hypersonic vehicles based on direct heuristic dynamic programming using ANFIS , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[31]  E. van Kampen,et al.  Nonlinear Adaptive Flight Control Using Incremental Approximate Dynamic Programming and Output Feedback , 2017 .

[32]  Haibo He,et al.  Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[33]  Changhui Wang,et al.  An Adaptive Control of Fractional-Order Nonlinear Uncertain Systems with Input Saturation , 2019, Complex..