Adaptive Reinforcement Learning Neural Network Control for Uncertain Nonlinear System With Input Saturation

In this paper, an adaptive neural network (NN) control problem is investigated for discrete-time nonlinear systems with input saturation. Radial-basis-function (RBF) NNs, including critic NNs and action NNs, are employed to approximate the utility functions and system uncertainties, respectively. In the previous works, a gradient descent scheme is applied to update weight vectors, which may lead to local optimal problem. To circumvent this problem, a multigradient recursive (MGR) reinforcement learning scheme is proposed, which utilizes both the current gradient and the past gradients. As a consequence, the MGR scheme not only eliminates the local optimal problem but also guarantees faster convergence rate than the gradient descent scheme. Moreover, the constraint of actuator input saturation is considered. The closed-loop system stability is developed by using the Lyapunov stability theory, and it is proved that all the signals in the closed-loop system are semiglobal uniformly ultimately bounded (SGUUB). Finally, the effectiveness of the proposed approach is further validated via some simulation results.

[1]  Tahereh Binazadeh,et al.  Application of neural network and genetic algorithm in identification of a model of a variable mass underwater vehicle , 2015 .

[2]  Junsheng Ren,et al.  Adaptive fuzzy robust tracking controller design via small gain approach and its application , 2003, IEEE Trans. Fuzzy Syst..

[3]  F.L. Lewis,et al.  Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[4]  Frank L. Lewis,et al.  Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Feng Ding,et al.  Recursive Least Squares and Multi-innovation Stochastic Gradient Parameter Estimation Methods for Signal Modeling , 2017, Circuits Syst. Signal Process..

[6]  Shuzhi Sam Ge,et al.  Adaptive NN control for a class of strict-feedback discrete-time nonlinear systems , 2003, Autom..

[7]  Wei He,et al.  Cooperative Adaptive Event-Triggered Control for Multiagent Systems With Actuator Failures , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[8]  Shaocheng Tong,et al.  Observer-Based Adaptive Fuzzy Fault-Tolerant Optimal Control for SISO Nonlinear Systems , 2019, IEEE Transactions on Cybernetics.

[9]  Zhanshan Wang,et al.  Data-Based Optimal Control of Multiagent Systems: A Reinforcement Learning Design Approach , 2017, IEEE Transactions on Cybernetics.

[10]  Zhongke Shi,et al.  Reinforcement Learning Output Feedback NN Control Using Deterministic Learning Technique , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Qinmin Yang,et al.  Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  Peng Shi,et al.  Novel Neural Control for a Class of Uncertain Pure-Feedback Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[13]  Kevin M. Passino,et al.  Decentralized adaptive control of nonlinear systems using radial basis neural networks , 1999, IEEE Trans. Autom. Control..

[14]  James E. Steck,et al.  Adaptive Feedback Control by Constrained Approximate Dynamic Programming , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[15]  Junwu Zhu,et al.  Robust Gene Circuit Control Design for Time-Delayed Genetic Regulatory Networks Without SUM Regulatory Logic , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  Yaohong Qu,et al.  Distributed Fault-Tolerant Cooperative Control for Multi-UAVs Under Actuator Fault and Input Saturation , 2019, IEEE Transactions on Control Systems Technology.

[17]  Anthony J. Calise,et al.  Adaptive output feedback control of nonlinear systems using neural networks , 2001, Autom..

[18]  Frank L. Lewis,et al.  Actor-Critic Off-Policy Learning for Optimal Control of Multiple-Model Discrete-Time Systems , 2018, IEEE Transactions on Cybernetics.

[19]  Changyin Sun,et al.  Adaptive Neural Impedance Control of a Robotic Manipulator With Input Saturation , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[20]  Frank L. Lewis,et al.  Multiple Actor-Critic Structures for Continuous-Time Optimal Control Using Input-Output Data , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Hongjing Liang,et al.  Adaptive Fuzzy Event-Triggered Control for Stochastic Nonlinear Systems With Full State Constraints and Actuator Faults , 2019, IEEE Transactions on Fuzzy Systems.

[22]  R. Bellman FUNCTIONAL EQUATIONS IN THE THEORY OF DYNAMIC PROGRAMMING. VI. A DIRECT CONVERGENCE PROOF , 1957 .

[23]  Junwu Zhu,et al.  Filter Design with Adaptation to Time-Delay Parameters for Genetic Regulatory Networks , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[24]  Jing Na,et al.  Finite-Time Convergence Adaptive Neural Network Control for Nonlinear Servo Systems , 2020, IEEE Transactions on Cybernetics.

[25]  V. Borkar,et al.  A unified framework for hybrid control: model and optimal control theory , 1998, IEEE Trans. Autom. Control..

[26]  Shaocheng Tong,et al.  Optimal Control-Based Adaptive NN Design for a Class of Nonlinear Discrete-Time Block-Triangular Systems , 2016, IEEE Transactions on Cybernetics.

[27]  Xiaoguang Liu,et al.  Optimized Adaptive Nonlinear Tracking Control Using Actor–Critic Reinforcement Learning Strategy , 2019, IEEE Transactions on Industrial Informatics.

[28]  Tieshan Li,et al.  Multi-Innovation Gradient Iterative Locally Weighted Learning Identification for A Nonlinear Ship Maneuvering System , 2018 .

[29]  Xiaofeng Liao,et al.  Reinforcement Learning for Constrained Energy Trading Games With Incomplete Information , 2017, IEEE Transactions on Cybernetics.

[30]  Shaocheng Tong,et al.  Adaptive Fuzzy Robust Fault-Tolerant Optimal Control for Nonlinear Large-Scale Systems , 2018, IEEE Transactions on Fuzzy Systems.

[31]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[32]  Qinglai Wei,et al.  Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming , 2012, Autom..

[33]  Frank L. Lewis,et al.  Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach , 2005, Autom..

[34]  Shaocheng Tong,et al.  Adaptive fuzzy output-feedback control for output constrained nonlinear systems in the presence of input saturation , 2014, Fuzzy Sets Syst..

[35]  Karthikeyan Rajagopal,et al.  Neural Network-Based Solutions for Stochastic Optimal Control Using Path Integrals , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[36]  Haibo He,et al.  Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems , 2017, IEEE Transactions on Cybernetics.

[37]  Feng Ding,et al.  Performance analysis of multi-innovation gradient type identification methods , 2007, Autom..

[38]  Young Hoon Joo,et al.  Adaptive Synchronization of Reaction–Diffusion Neural Networks and Its Application to Secure Communication , 2020, IEEE Transactions on Cybernetics.

[39]  Guido Herrmann,et al.  Robust adaptive finite‐time parameter estimation and control for robotic systems , 2015 .

[40]  Xiong Yang,et al.  Adaptive Critic Designs for Event-Triggered Robust Control of Nonlinear Systems With Unknown Dynamics , 2019, IEEE Transactions on Cybernetics.

[41]  Qi Zhou,et al.  Observer-Based Adaptive Event-Triggered Control for Nonstrict-Feedback Nonlinear Systems With Output Constraint and Actuator Failures , 2019, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[42]  Michael J. Frank,et al.  Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning , 2007, Proceedings of the National Academy of Sciences.

[43]  Mingyu Wang,et al.  Approximation-Based Adaptive Tracking Control for MIMO Nonlinear Systems With Input Saturation , 2015, IEEE Transactions on Cybernetics.

[44]  Shuzhi Sam Ge,et al.  Adaptive Tracking Control of Surface Vessel Using Optimized Backstepping Technique , 2019, IEEE Transactions on Cybernetics.

[45]  Shaocheng Tong,et al.  A DSC Approach to Robust Adaptive NN Tracking Control for Strict-Feedback Nonlinear Systems , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[46]  Qing-Guo Wang,et al.  Stability Analysis of Discrete-Time Neural Networks With Time-Varying Delay via an Extended Reciprocally Convex Matrix Inequality , 2017, IEEE Transactions on Cybernetics.