A game theory approach to multi-agent team cooperation

The main goal of this work is to design a team of agents that can accomplish consensus over a common value for the agents' output in a cooperative manner. First, a semi-decentralized optimal control strategy introduced recently by the authors is utilized which is based on minimization of individual costs using local information. Cooperative game theory is then used to ensure team cooperation by considering a combination of individual costs as a team cost function. Minimization of this cost function results in a set of Pareto-efficient solutions. The choice of Nash-bargaining solution among the set of Pareto-efficient solutions guarantees the minimum individual cost. The Nash-bargaining solution is obtained by maximizing the product of the difference between the costs achieved through the optimal control strategy and the one obtained through the Pareto-efficient solution. The latter solution results in a lower cost for each agent at the expense of requiring full information set. To avoid this drawback additional constraints are added to the structure of the controller by using the linear matrix inequality (LMI) formulation of the minimization problem. Consequently, although the controller is designed to minimize a unique team cost function, it only uses the available information set for each agent.

[1]  Xun Yu Zhou,et al.  Linear matrix inequalities, Riccati equations, and indefinite stochastic linear quadratic controls , 2000, IEEE Trans. Autom. Control..

[2]  Kevin M. Passino,et al.  Stability analysis of swarms , 2003, IEEE Trans. Autom. Control..

[3]  Stephen P. Boyd,et al.  Linear Matrix Inequalities in Systems and Control Theory , 1994 .

[4]  Elham Semsar-Kazerooni,et al.  Optimal consensus algorithms for cooperative team of agents subject to partial information , 2008, Autom..

[5]  Stephen P. Boyd,et al.  Distributed optimization for cooperative agents: application to formation flight , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[6]  J. Willems Least squares stationary optimal control and the algebraic Riccati equation , 1971 .

[7]  Richard M. Murray,et al.  Consensus problems in networks of agents with switching topology and time-delays , 2004, IEEE Transactions on Automatic Control.

[8]  Bruno Sinopoli,et al.  Distributed control applications within sensor networks , 2003, Proc. IEEE.

[9]  R. Beard,et al.  Formation feedback control for multiple spacecraft via virtual structures , 2004 .

[10]  Elham Semsar-Kazerooni,et al.  Optimal Control and Game Theoretic Approaches to Cooperative Control of a Team of Multi-Vehicle Unmanned Systems , 2007, 2007 IEEE International Conference on Networking, Sensing and Control.

[11]  Richard M. Murray,et al.  Information flow and cooperative control of vehicle formations , 2004, IEEE Transactions on Automatic Control.

[12]  Rodney Teo,et al.  Decentralized overlapping control of a formation of unmanned aerial vehicles , 2004, Autom..

[13]  E. Yaz Linear Matrix Inequalities In System And Control Theory , 1998, Proceedings of the IEEE.

[14]  Jie Lin,et al.  Coordination of groups of mobile autonomous agents using nearest neighbor rules , 2003, IEEE Trans. Autom. Control..

[15]  R. Pesenti,et al.  Mechanism Design for Optimal Consensus Problems , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[16]  Naomi Ehrich Leonard,et al.  Collective motion: bistability and trajectory tracking , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[17]  J. A. Fax Optimal and Cooperative Control of Vehicle Formations , 2002 .

[18]  F. Xiao,et al.  Consensus problems for high-dimensional multi-agent systems , 2007 .

[19]  Jacob Engwerda,et al.  LQ Dynamic Optimization and Differential Games , 2005 .

[20]  C. Tomlin,et al.  Decentralized optimization, with application to multiple aircraft coordination , 2002, Proceedings of the 41st IEEE Conference on Decision and Control, 2002..