Composite Observer-Based Optimal Attitude-Tracking Control With Reinforcement Learning for Hypersonic Vehicles

This article proposes an observer-based reinforcement learning (RL) control approach to address the optimal attitude-tracking problem and application for hypersonic vehicles in the reentry phase. Due to the unknown uncertainty and nonlinearity caused by parameter perturbation and external disturbance, accurate model information of hypersonic vehicles in the reentry phase is generally unavailable. For this reason, a novel synchronous estimation is proposed to construct a composite observer for hypersonic vehicles, which consists of a neural-network (NN)-based Luenberger-type observer and a synchronous disturbance observer. This solves the identification problem of nonlinear dynamics in the reference control and realizes the estimation of the system state when unknown nonlinear dynamics and unknown disturbance exist at the same time. By synthesizing the information from the composite observer, an RL tracking controller is developed to solve the optimal attitude-tracking control problem. To improve the convergence performance of critic network weights, concurrent learning is employed to replace the traditional persistent excitation condition with a historical experience replay manner. In addition, this article proves that the weight estimation error is bounded when the learning rate satisfies the given sufficient condition. Finally, the numerical simulation demonstrates the effectiveness and superiority of the proposed approaches to attitude-tracking control systems for hypersonic vehicles.

[1]  Yuyan Guo,et al.  Asymmetric integral BLF based state‐constrained flight control using NN and DOB , 2022, International Journal of Robust and Nonlinear Control.

[2]  Jingcheng Wang,et al.  Goal representation adaptive critic design for discrete-time uncertain systems subjected to input constraints: The event-triggered case , 2022, Neurocomputing.

[3]  Xiaowei Zhao,et al.  Optimal Tracking Control for Uncertain Nonlinear Systems With Prescribed Performance via Critic-Only ADP , 2022, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[4]  Shan Xue,et al.  Constrained Event-Triggered H∞ Control Based on Adaptive Dynamic Programming With Concurrent Learning , 2022, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[5]  S. Tong,et al.  Observer-Based Neuro-Adaptive Optimized Control of Strict-Feedback Nonlinear Systems With State Constraints , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Bo Xu,et al.  Intelligent Control of Flexible Hypersonic Flight Dynamics With Input Dead Zone Using Singular Perturbation Decomposition. , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Xiaokui Yue,et al.  Review of control and guidance technology on hypersonic vehicle , 2021, Chinese Journal of Aeronautics.

[8]  Derui Ding,et al.  Neural‐network‐based control for discrete‐time nonlinear systems with denial‐of‐service attack: The adaptive event‐triggered case , 2021, International Journal of Robust and Nonlinear Control.

[9]  Xiangwei Bu,et al.  Adaptive dynamic programing design for the neural control of hypersonic flight vehicles , 2021, J. Frankl. Inst..

[10]  Lu Liu,et al.  Distributed Path Following of Multiple Under-Actuated Autonomous Surface Vehicles Based on Data-Driven Neural Predictors via Integral Concurrent Learning , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Junfei Qiao,et al.  Data-Driven Iterative Adaptive Critic Control Toward an Urban Wastewater Treatment Plant , 2021, IEEE Transactions on Industrial Electronics.

[12]  Ding Wang,et al.  Neural optimal tracking control of constrained nonaffine systems with a wastewater treatment application , 2021, Neural Networks.

[13]  Lei Guo,et al.  Fault-Tolerant Optimal Control for Discrete-Time Nonlinear System Subjected to Input Saturation: A Dynamic Event-Triggered Approach , 2021, IEEE Transactions on Cybernetics.

[14]  Xingling Shao,et al.  Fault-Tolerant Quantized Control for Flexible Air-Breathing Hypersonic Vehicles With Appointed-Time Tracking Performances , 2021, IEEE Transactions on Aerospace and Electronic Systems.

[15]  Michael Basin,et al.  Practical Terminal Sliding Mode Control of Nonlinear Uncertain Active Suspension Systems With Adaptive Disturbance Observer , 2021, IEEE/ASME Transactions on Mechatronics.

[16]  Yunfei Mu,et al.  Decentralized Tracking Optimization Control for Partially Unknown Fuzzy Interconnected Systems via Reinforcement Learning Method , 2021, IEEE transactions on fuzzy systems.

[17]  Rong Su,et al.  Adaptive Resilient Event-Triggered Control Design of Autonomous Vehicles With an Iterative Single Critic Learning Framework , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[18]  Shuyi Shao,et al.  Adaptive Neural Discrete-Time Fractional-Order Control for a UAV System With Prescribed Performance Using Disturbance Observer , 2021, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[19]  Haibo He,et al.  Decentralized Event-Triggered Control for a Class of Nonlinear-Interconnected Systems Using Reinforcement Learning , 2019, IEEE Transactions on Cybernetics.

[20]  Ding Wang,et al.  Adaptive-critic-based hybrid intelligent optimal tracking for a class of nonlinear discrete-time systems , 2021, Eng. Appl. Artif. Intell..

[21]  Xiao Han,et al.  Online policy iteration ADP-based attitude-tracking control for hypersonic vehicles , 2020 .

[22]  Haibo He,et al.  Adaptive Critic Learning and Experience Replay for Decentralized Event-Triggered Control of Nonlinear Interconnected Systems , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[23]  Xiangpeng Xie,et al.  Observer-Based State Estimation of Discrete-Time Fuzzy Systems Based on a Joint Switching Mechanism for Adjacent Instants , 2020, IEEE Transactions on Cybernetics.

[24]  Dan Wang,et al.  Output-Feedback Cooperative Formation Maneuvering of Autonomous Surface Vehicles With Connectivity Preservation and Collision Avoidance , 2020, IEEE Transactions on Cybernetics.

[25]  Frank L. Lewis,et al.  Robust optimal control for a class of nonlinear systems with unknown disturbances based on disturbance observer and policy iteration , 2020, Neurocomputing.

[26]  Raul Ordonez,et al.  Gradient-Based Discrete-Time Concurrent Learning for Standalone Function Approximation , 2020, IEEE Transactions on Automatic Control.

[27]  Chaomin Luo,et al.  Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[28]  Yuanqing Xia,et al.  Six-DOF Spacecraft Optimal Trajectory Planning and Real-Time Attitude Control: A Deep Neural Network-Based Approach , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Qing-Long Han,et al.  Neural-Network-Based Output-Feedback Control Under Round-Robin Scheduling Protocols , 2019, IEEE Transactions on Cybernetics.

[30]  Haibo He,et al.  Reinforcement learning for robust adaptive control of partially unknown nonlinear systems subject to unmatched uncertainties , 2018, Inf. Sci..

[31]  Haibo He,et al.  Self-learning robust optimal control for continuous-time nonlinear systems with mismatched disturbances , 2018, Neural Networks.

[32]  Haibo He,et al.  Adaptive Critic Nonlinear Robust Control: A Survey , 2017, IEEE Transactions on Cybernetics.

[33]  Haibo He,et al.  Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[34]  Qichao Zhang,et al.  Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics , 2016, IEEE Transactions on Cybernetics.

[35]  Frank L. Lewis,et al.  Actor–Critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Fuchun Sun,et al.  Discrete-time hypersonic flight control based on extreme learning machine , 2014, Neurocomputing.

[37]  Frank L. Lewis,et al.  Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems , 2014, Autom..

[38]  Fuchun Sun,et al.  Adaptive discrete-time controller design with neural network for hypersonic flight vehicle via back-stepping , 2011, Int. J. Control.

[39]  Girish Chowdhary,et al.  Concurrent learning for convergence in adaptive control without persistency of excitation , 2010, 49th IEEE Conference on Decision and Control (CDC).

[40]  Shahriar Keshmiri,et al.  Modeling and simulation of a generic hypersonic vehicle , 2007 .

[41]  M. Mirmirani,et al.  Development of an Aerodynamic Database for a Generic Hypersonic Air Vehicle , 2005 .

[42]  Yun Li,et al.  PID control system analysis, design, and technology , 2005, IEEE Transactions on Control Systems Technology.

[43]  Petros A. Ioannou,et al.  Adaptive Sliding Mode Control Design fo ra Hypersonic Flight Vehicle , 2004 .

[44]  Alberto Bemporad,et al.  The explicit linear quadratic regulator for constrained systems , 2003, Autom..

[45]  Robert F. Stengel,et al.  Robust Nonlinear Control of a Hypersonic Aircraft , 1999 .

[46]  B. Pasik-Duncan,et al.  Adaptive Control , 1996, IEEE Control Systems.

[47]  Stephen P. Boyd,et al.  Necessary and sufficient conditions for parameter convergence in adaptive control , 1986, Autom..

[48]  Jessy W. Grizzle,et al.  Feedback Linearization of Discrete-Time Systems , 1986 .