NN Reinforcement Learning Adaptive Control for a Class of Nonstrict-Feedback Discrete-Time Systems

This article investigates an adaptive reinforcement learning (RL) optimal control design problem for a class of nonstrict-feedback discrete-time systems. Based on the neural network (NN) approximating ability and RL control design technique, an adaptive backstepping RL optimal controller and a minimal learning parameter (MLP) adaptive RL optimal controller are developed by establishing a novel strategic utility function and introducing external function terms. It is proved that the proposed adaptive RL optimal controllers can guarantee that all signals in the closed-loop systems are semiglobal uniformly ultimately bounded (SGUUB). The main feature is that the proposed schemes can solve the optimal control problem that the previous literature cannot deal with. Furthermore, the proposed MPL adaptive optimal control scheme can reduce the number of adaptive laws, and thus the computational complexity is decreased. Finally, the simulation results illustrate the validity of the proposed optimal control schemes.

[1]  Shaocheng Tong,et al.  Adaptive Fuzzy Tracking Control Design for SISO Uncertain Nonstrict Feedback Nonlinear Systems , 2016, IEEE Transactions on Fuzzy Systems.

[2]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[3]  Shaocheng Tong,et al.  A DSC Approach to Robust Adaptive NN Tracking Control for Strict-Feedback Nonlinear Systems , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  Shaocheng Tong,et al.  Reinforcement Learning Design-Based Adaptive Tracking Control With Less Learning Parameters for Nonlinear Discrete-Time MIMO Systems , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Peter Xiaoping Liu,et al.  Robust adaptive fuzzy fault-tolerant control for a class of non-lower-triangular nonlinear systems with actuator failures , 2016, Inf. Sci..

[6]  Tieshan Li,et al.  Output-Feedback Adaptive Neural Control for Stochastic Nonlinear Time-Varying Delay Systems With Unknown Control Directions , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Gang Feng,et al.  Adaptive neural control for a class of stochastic nonlinear time‐delay systems with unknown dead zone using dynamic surface technique , 2016 .

[8]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9]  Shuzhi Sam Ge,et al.  Adaptive Fuzzy Control of a Class of Nonlinear Systems by Fuzzy Approximation Approach , 2012, IEEE Transactions on Fuzzy Systems.

[10]  Haibo He,et al.  Adaptive Critic Learning and Experience Replay for Decentralized Event-Triggered Control of Nonlinear Interconnected Systems , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[11]  Bing Chen,et al.  Adaptive Fuzzy Tracking Control for a Class of MIMO Nonlinear Systems in Nonstrict-Feedback Form , 2015, IEEE Transactions on Cybernetics.

[12]  Bin Wang,et al.  A supervised Actor–Critic approach for adaptive cruise control , 2013, Soft Comput..

[13]  Zongquan Deng,et al.  Reinforcement Learning Neural Network-Based Adaptive Control for State and Input Time-Delayed Wheeled Mobile Robots , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[14]  Peng Shi,et al.  Adaptive Neural Tracking Control for a Class of Nonlinear Systems With Dynamic Uncertainties , 2017, IEEE Transactions on Cybernetics.

[15]  Gang Feng,et al.  Neural Network-Based Adaptive Control for Pure-Feedback Stochastic Nonlinear Systems With Time-Varying Delays and Dead-Zone Input , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[16]  Yu Guo,et al.  Adaptive Prescribed Performance Motion Control of Servo Mechanisms with Friction Compensation , 2014, IEEE Transactions on Industrial Electronics.

[17]  Yan-Jun Liu,et al.  ADP-Based Online Tracking Control of Partially Uncertain Time-Delayed Nonlinear System and Application to Wheeled Mobile Robots , 2020, IEEE Transactions on Cybernetics.

[18]  Frank L. Lewis,et al.  Adaptive optimal control for continuous-time linear systems based on policy iteration , 2009, Autom..

[19]  Junsheng Ren,et al.  Adaptive fuzzy robust tracking controller design via small gain approach and its application , 2003, IEEE Trans. Fuzzy Syst..

[20]  Chao Chen,et al.  Adaptive Partial Reinforcement Learning Neural Network-Based Tracking Control for Wheeled Mobile Robotic Systems , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[21]  Tingwen Huang,et al.  Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design , 2014, Autom..

[22]  Frank L. Lewis,et al.  Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems , 2014, Autom..

[23]  Zhang Bin,et al.  Multigradient recursive reinforcement learning NN control for affine nonlinear systems with unmodeled dynamics , 2019, International Journal of Robust and Nonlinear Control.

[24]  Zongquan Deng,et al.  Adaptive Neural Network-Based Finite-Time Online Optimal Tracking Control of the Nonlinear System With Dead Zone , 2019, IEEE Transactions on Cybernetics.

[25]  Mingxuan Sun,et al.  Echo State Network-Based Backstepping Adaptive Iterative Learning Control for Strict-Feedback Systems: An Error-Tracking Approach , 2020, IEEE Transactions on Cybernetics.

[26]  Frank L. Lewis,et al.  Reinforcement Learning-Based Adaptive Optimal Exponential Tracking Control of Linear Systems With Unknown Dynamics , 2019, IEEE Transactions on Automatic Control.

[27]  Haibo He,et al.  Intelligent Optimal Control With Critic Learning for a Nonlinear Overhead Crane System , 2018, IEEE Transactions on Industrial Informatics.

[28]  Guangdeng Zong,et al.  Command Filter-Based Adaptive Neural Tracking Controller Design for Uncertain Switched Nonlinear Output-Constrained Systems , 2017, IEEE Transactions on Cybernetics.

[29]  Shaocheng Tong,et al.  Adaptive Neural Networks Decentralized FTC Design for Nonstrict-Feedback Nonlinear Interconnected Large-Scale Systems Against Actuator Faults , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[30]  C. L. Philip Chen,et al.  Fuzzy Adaptive Quantized Control for a Class of Stochastic Nonlinear Uncertain Systems , 2016, IEEE Transactions on Cybernetics.

[31]  Radac,et al.  Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic , 2019, Applied Sciences.

[32]  Qinmin Yang,et al.  Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[33]  Shaocheng Tong,et al.  Fuzzy Approximation-Based Adaptive Backstepping Optimal Control for a Class of Nonlinear Discrete-Time Systems With Dead-Zone , 2016, IEEE Transactions on Fuzzy Systems.

[34]  Derong Liu,et al.  Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[35]  Changjiu Zhou,et al.  Small gain method for adaptive robust fuzzy control of a class of nonlinear systems , 2003, Proceedings of the 2003 IEEE International Symposium on Intelligent Control.

[36]  Shuzhi Sam Ge,et al.  Adaptive NN control for a class of strict-feedback discrete-time nonlinear systems , 2003, Autom..

[37]  Shaocheng Tong,et al.  Observer-Based Adaptive Fuzzy Fault-Tolerant Optimal Control for SISO Nonlinear Systems , 2019, IEEE Transactions on Cybernetics.

[38]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[39]  Xiaobo Li,et al.  Adaptive fuzzy control for full states constrained systems with nonstrict-feedback form and unknown nonlinear dead zone , 2017, Inf. Sci..

[40]  Lu Bai,et al.  Adaptive Neural Control of Uncertain Nonstrict-Feedback Stochastic Nonlinear Systems with Output Constraint and Unknown Dead Zone , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[41]  Derong Liu,et al.  Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints , 2013 .

[42]  Shaocheng Tong,et al.  Fuzzy Adaptive Output Feedback Optimal Control Design for Strict-Feedback Nonlinear Systems , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[43]  Tieshan Li,et al.  Adaptive Reinforcement Learning Neural Network Control for Uncertain Nonlinear System With Input Saturation , 2020, IEEE Transactions on Cybernetics.

[44]  Dongbin Zhao,et al.  Deep Reinforcement Learning With Visual Attention for Vehicle Classification , 2017, IEEE Transactions on Cognitive and Developmental Systems.

[45]  Qichao Zhang,et al.  Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics , 2019, IEEE Transactions on Cybernetics.

[46]  Shaocheng Tong,et al.  Adaptive Fuzzy Output Feedback Control for Switched Nonstrict-Feedback Nonlinear Systems With Input Nonlinearities , 2016, IEEE Transactions on Fuzzy Systems.