论文信息 - Actor-Critic Reinforcement Learning Control of Non-Strict Feedback Nonaffine Dynamic Systems

Actor-Critic Reinforcement Learning Control of Non-Strict Feedback Nonaffine Dynamic Systems

The most focuses of the existing actor-critic reinforcement learning control (ARLC) are on dealing with continuous affine systems or discrete nonaffine systems. In this paper, I propose a new ARLC method for continuous nonaffine dynamic systems subject to unknown dynamics and external disturbances. A new input-to-state stable system is developed to establish an augmented dynamic system, from which I further get a strict-feedback affine model that is convenient for control designing based on a model transformation approach. The Nussbaum function is connected with a fuzzy approximation to devise an actor network whose tracking performance is further enhanced via strengthening signals generated by a fuzzy critic network. The stability of the closed-loop control system is guaranteed by the Lyapunov synthesis. Finally, the comparison simulation results are presented to verify the design.

Xiangwei Bu | Xiangwei Bu

[1] Qiong Wang,et al. Concise Neural Nonaffine Control of Air-Breathing Hypersonic Vehicles Subject to Parametric Uncertainties , 2017 .

[2] Jiaqi Huang,et al. Guaranteeing preselected tracking quality for air-breathing hypersonic non-affine models with an unknown control direction via concise neural control , 2016, J. Frankl. Inst..

[3] S. R. Nekoo,et al. Finite-time state-dependent Riccati equation for time-varying nonaffine systems: rigid and flexible joint manipulator control. , 2015, ISA transactions.

[4] Shaocheng Tong,et al. Adaptive Neural Networks Prescribed Performance Control Design for Switched Interconnected Uncertain Nonlinear Systems , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[5] Haibo He,et al. Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[6] Barjeev Tyagi,et al. Adaptive Critic Design Using Policy Iteration Technique for LTI Systems: A Comprehensive Performance Analysis , 2015, Journal of Control, Automation and Electrical Systems.

[7] Xiangwei Bu,et al. Guaranteeing prescribed performance for air-breathing hypersonic vehicles via an adaptive non-affine tracking controller , 2018, Acta Astronautica.

[8] Haibo He,et al. Adaptive Critic Nonlinear Robust Control: A Survey , 2017, IEEE Transactions on Cybernetics.

[9] Haibo He,et al. Adaptive Learning and Control for MIMO System Based on Adaptive Dynamic Programming , 2011, IEEE Transactions on Neural Networks.

[10] Xiangwei Bu,et al. Guaranteeing prescribed output tracking performance for air-breathing hypersonic vehicles via non-affine back-stepping control design , 2017 .

[11] Frank L. Lewis,et al. Adaptive critic design using non‐linear network structures , 2003 .

[12] Qiuye Sun,et al. Adaptive critic design-based robust neural network control for nonlinear distributed parameter systems with unknown dynamics , 2015, Neurocomputing.

[13] Xiangwei Bu,et al. Air-Breathing Hypersonic Vehicles Funnel Control Using Neural Approximation of Non-affine Dynamics , 2018, IEEE/ASME Transactions on Mechatronics.

[14] Zhong-Ping Jiang,et al. Adaptive dynamic programming and optimal control of nonlinear nonaffine systems , 2014, Autom..

[15] Robert F. Stengel,et al. Online Adaptive Critic Flight Control , 2004 .

[16] Frank L. Lewis,et al. Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Convergence Analysis , 2018, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[17] Chuan-Kai Lin. Robust adaptive critic control of nonlinear systems using fuzzy basis function networks: An LMI approach , 2007, Inf. Sci..

[18] Qinglai Wei,et al. Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming , 2012, Autom..

[19] Xiangwei Bu,et al. A new prescribed performance control approach for uncertain nonlinear dynamic systems via back-stepping , 2018, J. Frankl. Inst..

[20] Chuan-Kai Lin,et al. Adaptive critic autopilot design of Bank-to-turn missiles using fuzzy basis function networks , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[21] Shuzhi Sam Ge,et al. A direct method for robust adaptive nonlinear control with guaranteed transient performance , 1999 .

[22] Youxian Sun,et al. Adaptive neural control of high-order uncertain nonaffine systems: A transformation to affine systems approach , 2014, Autom..

[23] Wee Chin Wong,et al. Approximate dynamic programming approach for process control , 2009 .

[24] Wei-Song Lin,et al. Adaptive critic motion control design of autonomous wheeled mobile robot by dual heuristic programming , 2008, Autom..

[25] Shaocheng Tong,et al. Adaptive Fuzzy Tracking Control Design for SISO Uncertain Nonstrict Feedback Nonlinear Systems , 2016, IEEE Transactions on Fuzzy Systems.

[26] Derong Liu,et al. Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning , 2014, Neural Networks.

[27] Niket S. Kaisare,et al. Approximate dynamic programming based control of hyperbolic PDE systems using reduced-order models from method of characteristics , 2013, Comput. Chem. Eng..

[28] Qiuye Sun,et al. Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence , 2012, Neurocomputing.

[29] Yun Zhang,et al. Neural Network Learning and Robust Stabilization of Nonlinear Systems With Dynamic Uncertainties , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[30] Rui Zhang,et al. Tracking differentiator design for the robust backstepping control of a flexible air-breathing hypersonic vehicle , 2015, J. Frankl. Inst..

[31] Zhongke Shi,et al. DOB-Based Neural Control of Flexible Hypersonic Flight Vehicle Considering Wind Effects , 2017, IEEE Transactions on Industrial Electronics.