Heterogeneous Formation Control of Multiple Rotorcrafts with Unknown Dynamics using Reinforcement Learning*

In this paper, a distributed model-free solution to the leader-follower formation control of heterogeneous multi-agent system is proposed using reinforcement learning. The multi-agent system consists of multiple rotorcrafts, including a virtual leader and multiple followers, and no knowledge of the dynamics of leaders and followers is assumed to be known a priori. The formation controller problem is first formulated as an optimal output regulation problem. A discounted performance function is then introduced to guarantee that the tracking error asymptotically converges to zero, and an online off-policy reinforcement learning algorithm is finally proposed to solve the optimal output problem online and using data generated along the agents’ trajectories. A simulation example is provided to validate the effectiveness of the proposed control method.

[1]  Ali Saberi,et al.  Output synchronization for heterogeneous networks of introspective right‐invertible agents , 2014 .

[2]  Frank L. Lewis,et al.  Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning , 2016, Autom..

[3]  Maarouf Saad,et al.  Robust formation control without velocity measurement of the leader robot , 2013 .

[4]  Lu Liu,et al.  Cooperative Output Regulation of Heterogeneous Linear Multi-Agent Systems by Event-Triggered Control , 2017, IEEE Transactions on Cybernetics.

[5]  Jie Lin,et al.  Coordination of groups of mobile autonomous agents using nearest neighbor rules , 2003, IEEE Trans. Autom. Control..

[6]  Anbal Ollero,et al.  Multiple Heterogeneous Unmanned Aerial Vehicles , 2008 .

[7]  Frank L. Lewis,et al.  Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning , 2014, IEEE Transactions on Automatic Control.

[8]  F.L. Lewis,et al.  Reinforcement learning and adaptive dynamic programming for feedback control , 2009, IEEE Circuits and Systems Magazine.

[9]  Frank L. Lewis,et al.  Cooperative Control of Multi-Agent Systems: Optimal and Adaptive Design Approaches , 2013 .

[10]  Frank L. Lewis,et al.  Leader-to-Formation Stability of Multiagent Systems: An Adaptive Optimal Control Approach , 2018, IEEE Transactions on Automatic Control.

[11]  Nikhil Nigam,et al.  Control of Multiple UAVs for Persistent Surveillance: Algorithm and Flight Test Results , 2012, IEEE Transactions on Control Systems Technology.

[12]  J. B. Park,et al.  Adaptive formation control in absence of leader's velocity information , 2010 .

[13]  D. Ghose,et al.  Search using multiple UAVs with flight time constraints , 2004, IEEE Transactions on Aerospace and Electronic Systems.

[14]  Jie Zhang,et al.  Formation Control of Heterogeneous Discrete-Time Nonlinear Multi-Agent Systems With Uncertainties , 2017, IEEE Transactions on Industrial Electronics.

[15]  Lili Wang,et al.  Distributed Formation Control of Multi-Agent Systems Using Complex Laplacian , 2014, IEEE Transactions on Automatic Control.

[16]  Sergiu-Dan Stan,et al.  A Novel Robust Decentralized Adaptive Fuzzy Control for Swarm Formation of Multiagent Systems , 2012, IEEE Transactions on Industrial Electronics.

[17]  Ming Xin,et al.  Integrated Optimal Formation Control of Multiple Unmanned Aerial Vehicles , 2012, IEEE Transactions on Control Systems Technology.

[18]  Lihua Xie,et al.  Decentralized Multi-UAV Flight Autonomy for Moving Convoys Search and Track , 2017, IEEE Transactions on Control Systems Technology.

[19]  Yang Zheng,et al.  Robust control of heterogeneous vehicular platoon with uncertain dynamics and communication delay , 2016 .

[20]  Dongbing Gu,et al.  Robust Team Formation Control for Quadrotors , 2018, IEEE Transactions on Control Systems Technology.

[21]  Karl Henrik Johansson,et al.  Distributed Event-Triggered Control for Multi-Agent Systems , 2012, IEEE Transactions on Automatic Control.

[22]  Frank L. Lewis,et al.  $ {H}_{ {\infty }}$ Tracking Control of Completely Unknown Continuous-Time Systems via Off-Policy Reinforcement Learning , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[23]  Zhong-Ping Jiang,et al.  Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics , 2012, Autom..