An Optimal Tracking Approach to Formation Control of Nonlinear Multi-Agent Systems

Formation control of network of multi-agent systems with heterogeneous nonlinear dynamics is formulated as an optimal tracking problem and a decentralized controller is developed using the framework of ‘adaptive critics’ to solve the optimal control problem. The reference signal is assumed available only in online implementation so its dynamics is unavailable for offline training of the neurocontroller. However, this issue is resolved by using a re-optimizatio n of the network output through retraining of the neurocontroller online. Finally, the developed controller is applied to the formation control of multispacecraft orbiting the Earth at different orbits and seeking consensus on their position to dock. I. Introduction HE formation control of network of multi-agent systems with heterogeneous nonlinear dynamics has many application and benefits compared to single agent systems; designing a decentralized controller for such a network of agents though is a challenging problem. Using a decentralized controller, each agent is supposed to have access to its neighbors only while the mission for the whole system is to achieve a desired formation. Solutions to the network of linear agents have been developed by several different authors. A method developed in Ref. 1, called behavior-based decentralized control, performs the multi-agent control for network of agents with ring-wise communication topology by considering two desired behaviors: formation keeping and goal seeking. The role of the communication topology in the formation’s stability is shown in Ref. 2 through the appearance of the eigenvalues of Laplacian matrix of the communication graph in the dynamics of the formation. In Ref. 3 also, the stability of the formation/consensus is shown to be related to the eigenvalues of the Laplacian matrix and for the double-integrator system, the authors of Ref. 3 have designed a controller based on the eigenvalues of this matrix. Minimization of some performance index to reach a consensus is the approach used in Ref. 4 and Ref. 5 for single integrator and double integrator dynamics, respectively. Many of the developed methods in the literature are

[1]  J. Nazuno Haykin, Simon. Neural networks: A comprehensive foundation, Prentice Hall, Inc. Segunda Edición, 1999 , 2000 .

[2]  Randal W. Beard,et al.  Decentralized Scheme for Spacecraft Formation Flying via the Virtual Structure Approach , 2004 .

[3]  Gerardo Lafferriere,et al.  Decentralized control of vehicle formations , 2005, Syst. Control. Lett..

[4]  Chi-Tsong Chen,et al.  Linear System Theory and Design , 1995 .

[5]  Liu Jian-zhen,et al.  Reference model based consensus control of second-order multi-agent systems ⁄ , 2011 .

[6]  Marcel J. Sidi,et al.  Spacecraft Dynamics and Control: A Practical Engineering Approach , 1997 .

[7]  Lei Yang,et al.  Direct Heuristic Dynamic Programming for Nonlinear Tracking Control With Filtered Tracking Error , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[8]  F. Xiao,et al.  Consensus problems for high-dimensional multi-agent systems , 2007 .

[9]  Frank L. Lewis,et al.  Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[10]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[11]  Drs Sustainment Online optimal control of nonlinear discrete-time systems using approximate dynamic programming , 2011 .

[12]  Charles R. Johnson,et al.  Topics in Matrix Analysis , 1991 .

[13]  Daizhan Cheng,et al.  Consensus of multi-agent linear dynamic systems† , 2008 .

[14]  Radhakant Padhi,et al.  A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems , 2006, Neural Networks.

[15]  Qinmin Yang,et al.  Near Optimal Neural Network-based Output Feedback Control of Affine Nonlinear Discrete-Time Systems , 2007, 2007 IEEE 22nd International Symposium on Intelligent Control.

[16]  Xue Liu,et al.  Asymptotically stable adaptive critic design for uncertain nonlinear systems , 2009, 2009 American Control Conference.

[17]  Guang-Ren Duan,et al.  Consensus seeking in multiagent cooperative control systems with bounded control input , 2011 .

[18]  Frank L. Lewis,et al.  Cooperative adaptive control for synchronization of second‐order systems with unknown nonlinearities , 2011 .

[19]  W. He,et al.  Consensus control for high-order multi-agent systems [Brief Paper] , 2011 .

[20]  Warren E. Dixon,et al.  Asymptotic tracking by a reinforcement learning-based adaptive critic controller , 2011 .

[21]  Randal W. Beard,et al.  A decentralized approach to formation maneuvers , 2003, IEEE Trans. Robotics Autom..

[22]  M. Xin,et al.  Multi-agent consensus algorithm with obstacle avoidance via optimal control approach , 2011 .

[23]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .

[24]  R. Beard,et al.  Formation feedback control for multiple spacecraft via virtual structures , 2004 .

[25]  Richard M. Murray,et al.  Information flow and cooperative control of vehicle formations , 2004, IEEE Transactions on Automatic Control.

[26]  Yongcan Cao,et al.  Optimal Linear-Consensus Algorithms: An LQR Perspective , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[27]  Huaguang Zhang,et al.  A Novel Infinite-Time Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems via the Greedy HDP Iteration Algorithm , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[28]  Richard M. Murray,et al.  Consensus Protocols for Undirected Networks of Dynamic Agents with Communication Time-Delays , 2003 .

[29]  Guisheng Zhai,et al.  A matrix inequality based design method for consensus problems in multi-agent systems , 2009, Int. J. Appl. Math. Comput. Sci..

[30]  Hyungbo Shim,et al.  Consensus of high-order linear systems using dynamic output feedback compensator: Low gain approach , 2009, Autom..

[31]  Richard M. Murray,et al.  Consensus problems in networks of agents with switching topology and time-delays , 2004, IEEE Transactions on Automatic Control.