Containment control of heterogeneous systems with active leaders of bounded unknown control using reinforcement learning

This paper solves the containment problem of multi-agent systems on undirected graph with multiple active leaders using off-policy reinforcement learning (RL). The leaders are active in the sense that there exists bounded control input in the dynamics which is unknown to all followers and the followers are heterogeneous with different dynamics. Not only the steady states of agent i but also the transient trajectories are taken into account to impose optimality to the proposed containment control. Inhomogeneous algebraic Riccati equations (ARE) are derived to solve the optimal containment control protocol. To avoid the requirement of agents' dynamics to obtain containment control, an off-policy RL algorithm is developed to solve the inhomogeneous AREs online in real time and without requiring any knowledge of the agents' dynamics. Finally, a simulation example is presented to illustrate the effectiveness of the proposed algorithm.

[1]  Guangming Xie,et al.  Necessary and sufficient conditions for containment control of networked multi-agent systems , 2012, Autom..

[2]  Yisheng Zhong,et al.  Containment analysis and design for high-order linear time-invariant singular swarm systems with time delays , 2012, Proceedings of the 31st Chinese Control Conference.

[3]  Dongbin Zhao,et al.  Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Mohammad Ali Badamchizadeh,et al.  Containment control of heterogeneous linear multi-agent systems , 2015, Autom..

[5]  Tingwen Huang,et al.  Off-Policy Reinforcement Learning for $ H_\infty $ Control Design , 2013, IEEE Transactions on Cybernetics.

[6]  Giancarlo Ferrari-Trecate,et al.  Containment Control in Mobile Networks , 2008, IEEE Transactions on Automatic Control.

[7]  Frank L. Lewis,et al.  Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances , 2016, IEEE Transactions on Cybernetics.

[8]  Frank L. Lewis,et al.  Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning , 2014, IEEE Transactions on Automatic Control.

[9]  Warren E. Dixon,et al.  Robust containment control in a leader–follower network of uncertain Euler–Lagrange systems , 2016 .

[10]  Guangming Xie,et al.  Containment of linear multi-agent systems under general interaction topologies , 2012, Syst. Control. Lett..

[11]  Wei Ren,et al.  Distributed consensus of linear multi-agent systems with adaptive dynamic protocols , 2011, Autom..

[12]  Ziyang Meng,et al.  Distributed Containment Control for Multiple Autonomous Vehicles With Double-Integrator Dynamics: Algorithms and Experiments , 2011, IEEE Transactions on Control Systems Technology.

[13]  Derong Liu,et al.  Reinforcement-Learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints , 2015, IEEE Transactions on Cybernetics.

[14]  Gang Feng,et al.  Containment control of linear multi‐agent systems with multiple leaders of bounded inputs using distributed continuous controllers , 2013, ArXiv.

[15]  Frank L. Lewis,et al.  Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[16]  Frank L. Lewis,et al.  Output Containment Control of Linear Heterogeneous Multi-Agent Systems Using Internal Model Principle , 2017, IEEE Transactions on Cybernetics.

[17]  Long Wang,et al.  Containment control of heterogeneous multi-agent systems , 2014, Int. J. Control.

[18]  Lin Huang,et al.  Consensus of Multiagent Systems and Synchronization of Complex Networks: A Unified Viewpoint , 2016, IEEE Transactions on Circuits and Systems I: Regular Papers.

[19]  Magnus Egerstedt,et al.  Distributed containment control with multiple stationary or dynamic leaders in fixed and switching directed networks , 2012, Autom..

[20]  Dongbin Zhao,et al.  Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming. , 2018, IEEE transactions on neural networks and learning systems.

[21]  Wenwu Yu,et al.  Second-Order Consensus for Multiagent Systems With Directed Topologies and Nonlinear Dynamics , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Huaguang Zhang,et al.  Adaptive Dynamic Programming for a Class of Complex-Valued Nonlinear Systems , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[23]  M. Tan,et al.  Containment control of general linear multi-agent systems with multiple dynamic leaders: A fast sliding mode based approach , 2014, IEEE/CAA Journal of Automatica Sinica.

[24]  Warren E. Dixon,et al.  Model-based reinforcement learning for infinite-horizon approximate optimal tracking , 2014, 53rd IEEE Conference on Decision and Control.

[25]  Yixin Yin,et al.  Hamiltonian-Driven Adaptive Dynamic Programming for Continuous Nonlinear Dynamical Systems , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[26]  Zhong-Ping Jiang,et al.  Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics , 2012, Autom..

[27]  Frank L. Lewis,et al.  Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning , 2016, Autom..

[28]  Randal W. Beard,et al.  Consensus seeking in multiagent systems under dynamically changing interaction topologies , 2005, IEEE Transactions on Automatic Control.

[29]  Long Cheng,et al.  Containment Control of Multiagent Systems With Dynamic Leaders Based on a $PI^{n}$ -Type Approach , 2014, IEEE Transactions on Cybernetics.

[30]  Guanghui Wen,et al.  Containment of Higher-Order Multi-Leader Multi-Agent Systems: A Dynamic Output Approach , 2016, IEEE Transactions on Automatic Control.

[31]  Frank L. Lewis,et al.  Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[32]  Qichao Zhang,et al.  Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics , 2016, IEEE Transactions on Cybernetics.

[33]  Frank L. Lewis,et al.  Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem , 2009, 2009 International Joint Conference on Neural Networks.

[34]  Derong Liu,et al.  Neural-Network-Based Online HJB Solution for Optimal Robust Guaranteed Cost Control of Continuous-Time Uncertain Nonlinear Systems , 2014, IEEE Transactions on Cybernetics.

[35]  Frank L. Lewis,et al.  $ {H}_{ {\infty }}$ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning , 2015, IEEE Transactions on Neural Networks and Learning Systems.