Optimal Containment Control of Unknown Heterogeneous Systems With Active Leaders

This brief presents a partially model-free solution to the distributed containment control of multiagent systems using off-policy reinforcement learning (RL). The followers are assumed to be heterogeneous with different dynamics, and the leaders are assumed to be active in the sense that their control inputs can be nonzero. Optimality is explicitly imposed in solving the containment problem to not only drive the agents’ states into a convex hull of the leaders’ states but also minimize their transient responses. Inhomogeneous algebraic Riccati equations (AREs) are derived to solve the optimal containment control with active leaders. The resulting control protocol for each agent depends on its own state and an estimation of an interior point inside the convex hull spanned by the leaders. This estimation is provided by designing a distributed observer for a trajectory inside the convex hull of active leaders. Only the knowledge of the leaders’ dynamics is required by the observer. An off-policy RL algorithm is developed to solve the inhomogeneous AREs online in real time without requiring any knowledge of the followers’ dynamics. Finally, a simulation example is presented to show the effectiveness of the presented algorithm.

[1]  Changyin Sun,et al.  Adaptive Neural Network Control of a Flapping Wing Micro Aerial Vehicle With Disturbance Observer , 2017, IEEE Transactions on Cybernetics.

[2]  Lorenzo Marconi,et al.  Robust Output Synchronization of a Network of Heterogeneous Nonlinear Agents Via Nonlinear Regulation Theory , 2014, IEEE Transactions on Automatic Control.

[3]  Frank L. Lewis,et al.  Cooperative Control of Multi-Agent Systems: Optimal and Adaptive Design Approaches , 2013 .

[4]  Zhong-Ping Jiang,et al.  A Distributed Control Approach to A Robust Output Regulation Problem for Multi-Agent Linear Systems , 2010, IEEE Transactions on Automatic Control.

[5]  Frank L. Lewis,et al.  Output Containment Control of Linear Heterogeneous Multi-Agent Systems Using Internal Model Principle , 2017, IEEE Transactions on Cybernetics.

[6]  Lin Huang,et al.  Consensus of Multiagent Systems and Synchronization of Complex Networks: A Unified Viewpoint , 2016, IEEE Transactions on Circuits and Systems I: Regular Papers.

[7]  Warren E. Dixon,et al.  Robust containment control in a leader–follower network of uncertain Euler–Lagrange systems , 2016 .

[8]  Wei Ren,et al.  Distributed consensus of linear multi-agent systems with adaptive dynamic protocols , 2011, Autom..

[9]  Warren E. Dixon,et al.  Efficient model-based reinforcement learning for approximate online optimal control , 2015, Autom..

[10]  Frank L. Lewis,et al.  Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning , 2016, Autom..

[11]  Frank L. Lewis,et al.  $ {H}_{ {\infty }}$ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[12]  Randal W. Beard,et al.  Consensus seeking in multiagent systems under dynamically changing interaction topologies , 2005, IEEE Transactions on Automatic Control.

[13]  Z. Qu,et al.  Cooperative Control of Dynamical Systems: Applications to Autonomous Vehicles , 2009 .

[14]  Kai-Yew Lum,et al.  Multiagent Information Fusion and Cooperative Control in Target Search , 2013, IEEE Transactions on Control Systems Technology.

[15]  Frank Allgöwer,et al.  An internal model principle is necessary and sufficient for linear output synchronization , 2011, Autom..

[16]  Zhong-Ping Jiang,et al.  Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics , 2012, Autom..

[17]  Claudio De Persis,et al.  On the Internal Model Principle in the Coordination of Nonlinear Systems , 2014, IEEE Transactions on Control of Network Systems.

[18]  Frank L. Lewis,et al.  Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning , 2014, IEEE Transactions on Automatic Control.

[19]  Warren E. Dixon,et al.  Containment control for a social network with state-dependent connectivity , 2014, Autom..

[20]  Yongcan Cao,et al.  Distributed Coordination of Multi-agent Networks: Emergent Problems, Models, and Issues , 2010 .

[21]  Frank L. Lewis,et al.  On constructing Lyapunov functions for multi-agent systems , 2015, Autom..

[22]  Warren E. Dixon,et al.  Leader-follower containment control over directed random graphs , 2016, Autom..

[23]  Ziyang Meng,et al.  Distributed Containment Control for Multiple Autonomous Vehicles With Double-Integrator Dynamics: Algorithms and Experiments , 2011, IEEE Transactions on Control Systems Technology.

[24]  Yixin Yin,et al.  Hamiltonian-Driven Adaptive Dynamic Programming for Continuous Nonlinear Dynamical Systems , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Shankar Sastry,et al.  A calculus for computing Filippov's differential inclusion with application to the variable structure control of robot manipulators , 1986, 1986 25th IEEE Conference on Decision and Control.

[26]  B. Paden,et al.  Lyapunov stability theory of nonsmooth systems , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.

[27]  Mohammad Ali Badamchizadeh,et al.  Containment control of heterogeneous linear multi-agent systems , 2015, Autom..

[28]  Guanghui Wen,et al.  Containment of Higher-Order Multi-Leader Multi-Agent Systems: A Dynamic Output Approach , 2016, IEEE Transactions on Automatic Control.

[29]  Lihua Xie,et al.  Distributed Tracking Control for Linear Multiagent Systems With a Leader of Bounded Unknown Input , 2013, IEEE Transactions on Automatic Control.

[30]  Gang Feng,et al.  Containment control of linear multi‐agent systems with multiple leaders of bounded inputs using distributed continuous controllers , 2013, ArXiv.

[31]  Warren E. Dixon,et al.  Model-Based Reinforcement Learning in Differential Graphical Games , 2017, IEEE Transactions on Control of Network Systems.

[32]  Wenwu Yu,et al.  Second-Order Consensus for Multiagent Systems With Directed Topologies and Nonlinear Dynamics , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).