Performance‐guaranteed containment control for pure‐feedback multi agent systems via reinforcement learning algorithm

In this article, a performance‐guaranteed containment control scheme based on reinforcement learning (RL) algorithm is proposed for a class of pure‐feedback multi agent systems (MASs) with unmeasurable states. The unknown nonlinear functions are approximated by the neural networks (NNs) and an adaptive NN state observer is designed for the states estimation. Based on estimated states, the algebraic loop problem can be removed by introducing filtered signals, and the actor‐critic architecture of RL algorithm is employed to acquire the optimal controller in the framework of backstepping. Different from many optimal strategies, this article proposes a simpler mechanism based on the uniqueness of the optimal solution to obtain the actor and critic updating laws instead of gradient descent algorithm with complicated calculation. In addition, predefined performance function and an improved error transformation technique are utilized to guarantee the containment error within a prescribed boundary. By using Lyapunov stability theory and graph theory, the stability of the closed‐loop system can be demonstrated. Finally, the effectiveness of the method proposed in this article is verified by a simulation example.

[1]  Yang Shi,et al.  Adaptive Event-Triggered Sliding-Mode Control for Consensus Tracking of Nonlinear Multiagent Systems With Unknown Perturbations , 2022, IEEE Transactions on Cybernetics.

[2]  Hongyi Li,et al.  Fuzzy-based dynamic event triggering formation control for nonstrict-feedback nonlinear MASs , 2022, Fuzzy Sets Syst..

[3]  B. Ghosh,et al.  Input–Output Data-Based Output Antisynchronization Control of Multiagent Systems Using Reinforcement Learning Approach , 2021, IEEE Transactions on Industrial Informatics.

[4]  Faxiang Zhang,et al.  Fuzzy Adaptive Containment Control for Nonlinear Nonaffine Pure-Feedback Multiagent Systems , 2021, IEEE Transactions on Fuzzy Systems.

[5]  Renquan Lu,et al.  Adaptive Attitude Control for Multi-MUAV Systems With Output Dead-Zone and Actuator Fault , 2021, IEEE/CAA Journal of Automatica Sinica.

[6]  Hongjing Liang,et al.  Event-Triggered Fuzzy Bipartite Tracking Control for Network Systems Based on Distributed Reduced-Order Observers , 2021, IEEE Transactions on Fuzzy Systems.

[7]  Qi Zhou,et al.  Event-Triggered Fuzzy Adaptive Containment Control for Nonlinear Multiagent Systems With Unknown Bouc–Wen Hysteresis Input , 2021, IEEE Transactions on Fuzzy Systems.

[8]  Derong Liu,et al.  Event-Triggered ADP for Tracking Control of Partially Unknown Constrained Uncertain Systems , 2021, IEEE Transactions on Cybernetics.

[9]  G. Wen,et al.  Fuzzy Adaptive Cooperative Consensus Tracking of High-Order Nonlinear Multiagent Networks With Guaranteed Performances , 2021, IEEE Transactions on Cybernetics.

[10]  B. Ghosh,et al.  Optimal Tracking Control of Nonlinear Multiagent Systems Using Internal Reinforce Q-Learning , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Peng Shi,et al.  Command Filtered Backstepping-Based Attitude Containment Control for Spacecraft Formation , 2021, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[12]  Renquan Lu,et al.  Approximation-Based Nussbaum Gain Adaptive Control of Nonlinear Systems With Periodic Disturbances , 2021, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[13]  S. Tong,et al.  Observer-Based Neuro-Adaptive Optimized Control of Strict-Feedback Nonlinear Systems With State Constraints , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Renquan Lu,et al.  Human-in-the-Loop Consensus Control for Nonlinear Multi-Agent Systems With Actuator Faults , 2020, IEEE/CAA Journal of Automatica Sinica.

[15]  Faxiang Zhang,et al.  Indirect Adaptive Fuzzy Control for Nonaffine Nonlinear Pure-Feedback Systems , 2020, IEEE Transactions on Fuzzy Systems.

[16]  Yongchao Liu,et al.  Adaptive Tracking Control for Perturbed Strict-Feedback Nonlinear Systems Based on Optimized Backstepping Technique , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Changyin Sun,et al.  Neural network based tracking control for an elastic joint robot with input constraint via actor-critic design , 2020, Neurocomputing.

[18]  Shaocheng Tong,et al.  Adaptive Fuzzy Prescribed Performance Control of Nontriangular Structure Nonlinear Systems , 2020, IEEE Transactions on Fuzzy Systems.

[19]  Junlin Xiong,et al.  Optimal Filtered and Smoothed Estimators for Discrete-Time Linear Systems With Multiple Packet Dropouts Under Markovian Communication Constraints , 2020, IEEE Transactions on Cybernetics.

[20]  Hongjing Liang,et al.  Neural-Network-Based Event-Triggered Adaptive Control of Nonaffine Nonlinear Multiagent Systems With Dynamic Uncertainties , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[21]  Shuzhi Sam Ge,et al.  Simplified Optimized Backstepping Control for a Class of Nonlinear Strict-Feedback Systems With Unknown Dynamic Functions , 2020, IEEE Transactions on Cybernetics.

[22]  Yingnan Pan,et al.  Singularity-Free Fixed-Time Fuzzy Control for Robotic Systems With User-Defined Performance , 2020, IEEE Transactions on Fuzzy Systems.

[23]  Hamid Reza Karimi,et al.  Synchronization of Network Systems via Aperiodic Sampled-Data Control With Constant Delay and Application to Unmanned Ground Vehicles , 2020, IEEE Transactions on Industrial Electronics.

[24]  Xin Yuan,et al.  Event-triggered receding horizon control via actor-critic design , 2020, Science China Information Sciences.

[25]  Tingwen Huang,et al.  Adaptive event-triggered control for a class of nonlinear systems with periodic disturbances , 2020, Science China Information Sciences.

[26]  Tieshan Li,et al.  Prescribed Performance Adaptive Fuzzy Containment Control for Nonlinear Multiagent Systems Using Disturbance Observer , 2020, IEEE Transactions on Cybernetics.

[27]  Yongming Li,et al.  Observer-Based Fuzzy Adaptive Finite-Time Containment Control of Nonlinear Multiagent Systems With Input Delay , 2020, IEEE Transactions on Cybernetics.

[28]  Hongyi Li,et al.  Observer-based adaptive consensus control for nonlinear multi-agent systems with time-delay , 2020, Science China Information Sciences.

[29]  Hongjing Liang,et al.  Observer-Based Event-Triggered Fuzzy Adaptive Bipartite Containment Control of Multiagent Systems With Input Quantization , 2019, IEEE Transactions on Fuzzy Systems.

[30]  Zongquan Deng,et al.  Adaptive Neural Network-Based Finite-Time Online Optimal Tracking Control of the Nonlinear System With Dead Zone , 2019, IEEE Transactions on Cybernetics.

[31]  Shuzhi Sam Ge,et al.  Adaptive Tracking Control of Surface Vessel Using Optimized Backstepping Technique , 2019, IEEE Transactions on Cybernetics.

[32]  Xiaowu Mu,et al.  Containment control of stochastic multiagent systems with semi‐Markovian switching topologies , 2019, International Journal of Robust and Nonlinear Control.

[33]  Kangkang Sun,et al.  Fuzzy adaptive optimal control for nonlinear switched systems with actuator hysteresis , 2019, International Journal of Adaptive Control and Signal Processing.

[34]  Shaocheng Tong,et al.  Observer-Based Adaptive Fuzzy Fault-Tolerant Optimal Control for SISO Nonlinear Systems , 2019, IEEE Transactions on Cybernetics.

[35]  Jianbin Qiu,et al.  Observer-Based Fuzzy Adaptive Event-Triggered Control for Pure-Feedback Nonlinear Systems With Prescribed Performance , 2019, IEEE Transactions on Fuzzy Systems.

[36]  Guanghui Wen,et al.  Distributed Robust Global Containment Control of Second-Order Multiagent Systems With Input Saturation , 2019, IEEE Transactions on Control of Network Systems.

[37]  Wei Wang,et al.  Observer-Based Adaptive Fuzzy Containment Control for Multiple Uncertain Nonlinear Systems , 2019, IEEE Transactions on Fuzzy Systems.

[38]  C. L. Philip Chen,et al.  Optimized Multi-Agent Formation Control Based on an Identifier–Actor–Critic Reinforcement Learning Algorithm , 2018, IEEE Transactions on Fuzzy Systems.

[39]  Wei Wang,et al.  Distributed Adaptive Containment Control for a Class of Nonlinear Multiagent Systems With Input Quantization , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[40]  Frank L. Lewis,et al.  Distributed containment control for nonlinear multiagent systems in pure‐feedback form , 2018 .

[41]  Shuzhi Sam Ge,et al.  Optimized Backstepping for Tracking Control of Strict-Feedback Systems , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[42]  Tingwen Huang,et al.  Model-Free Optimal Tracking Control via Critic-Only Q-Learning , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[43]  Guo-Ping Liu,et al.  On Input-to-State Stability of Switched Stochastic Nonlinear Systems Under Extended Asynchronous Switching , 2016, IEEE Transactions on Cybernetics.

[44]  Shaocheng Tong,et al.  Fuzzy Adaptive Output Feedback Control of MIMO Nonlinear Systems With Partial Tracking Errors Constrained , 2015, IEEE Transactions on Fuzzy Systems.

[45]  Sarangapani Jagannathan,et al.  Optimal Control of Nonlinear Continuous-Time Systems in Strict-Feedback Form , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[46]  Derong Liu,et al.  Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems , 2014, IEEE Transactions on Cybernetics.

[47]  Shengyuan Xu,et al.  Asymptotic Tracking Control of Uncertain Nonlinear Systems With Unknown Actuator Nonlinearity , 2014, IEEE Transactions on Automatic Control.

[48]  Xiong Yang,et al.  Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints , 2014, Int. J. Control.

[49]  Shaocheng Tong,et al.  Observer-Based Adaptive Fuzzy Backstepping Output Feedback Control of Uncertain MIMO Pure-Feedback Nonlinear Systems , 2012, IEEE Transactions on Fuzzy Systems.

[50]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[51]  Charalampos P. Bechlioulis,et al.  Adaptive control with guaranteed transient and steady state tracking error bounds for strict feedback systems , 2009, Autom..

[52]  Shaocheng Tong,et al.  Observed-Based Adaptive Fuzzy Decentralized Tracking Control for Switched Uncertain Nonlinear Large-Scale Systems With Dead Zones , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.