Decentralised dynamic games for large population stochastic multi-agent systems

In this study, the authors consider the large population dynamic games where each agent evolves according to a dynamic equation containing the input average of all agents. The long time average (LTA) cost that each agent aims to minimise is coupled with other agents’ states via a population state average (PSA), which is also known as the mean field term. In order to design decentralised controls, the Nash certainty equivalence is introduced. It is shown that the resulting decentralised mean field control laws lead the system to mean-consensus asymptotically as time goes to infinity. The stability property of the mass behaviour and the almost sure asymptotic Nash equilibrium property of the optimal controls are also guaranteed, and the case with non-linear system dynamics is also discussed. In addition, the influence of inaccurate mean field information on individual agent is analysed. Finally, they investigate the socially cooperative formulation where the objective is to minimise the social cost as the sum of all individual LTA costs containing the PSA. In this case, they show that the decentralised mean field social controls are the same as the mean field Nash controls for infinite population systems.

[1]  Minyi Huang,et al.  Nash Certainty Equivalence in Large Population Stochastic Dynamic Games: Connections with the Physics of Interacting Particle Systems , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[2]  Xiao Fan Wang,et al.  Flocking of Multi-Agents With a Virtual Leader , 2009, IEEE Trans. Autom. Control..

[3]  P. Caines,et al.  Individual and mass behaviour in large population stochastic wireless power control problems: centralized and Nash equilibrium solutions , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[4]  Minyi Huang,et al.  Nash Strategies and Adaptation for Decentralized Games Involving Weakly-coupled Agents , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[5]  G. Erickson Differential game models of advertising competition , 1995 .

[6]  Peter E. Caines,et al.  Social Optima in Mean Field LQG Control: Centralized and Decentralized Strategies , 2012, IEEE Transactions on Automatic Control.

[7]  Val E. Lambson Self-enforcing collusion in large dynamic markets , 1984 .

[8]  E. Davison,et al.  On the stabilization of decentralized control systems , 1973 .

[9]  Werner Ebeling,et al.  Noise-induced transition from translational to rotational motion of swarms. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Long Wang,et al.  Swarming behavior of multi-agent systems , 2004, math/0405405.

[11]  Peter E. Caines,et al.  Nash, Social and Centralized Solutions to Consensus Problems via Mean Field Control Theory , 2013, IEEE Transactions on Automatic Control.

[12]  Minyi Huang,et al.  Large-Population Cost-Coupled LQG Problems With Nonuniform Agents: Individual-Mass Behavior and Decentralized $\varepsilon$-Nash Equilibria , 2007, IEEE Transactions on Automatic Control.

[13]  Tao Li,et al.  Asymptotically Optimal Decentralized Control for Large Population Stochastic Multiagent Systems , 2008, IEEE Transactions on Automatic Control.

[14]  Tao Li,et al.  Decentralized tracking-type games for multi-agent systems with coupled ARX models: Asymptotic Nash equilibria , 2008, Autom..

[15]  Sean P. Meyn,et al.  Learning in Mean-Field Games , 2014, IEEE Transactions on Automatic Control.

[16]  Donald D. Dudenhoeffer,et al.  A formation behavior for large-scale micro-robot force deployment , 2000, 2000 Winter Simulation Conference Proceedings (Cat. No.00CH37165).

[17]  Reza Olfati-Saber,et al.  Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.

[18]  Tao Li,et al.  Adaptive Mean Field Games for Large Population Coupled ARX Systems with Unknown Coupling Strength , 2013, Dynamic Games and Applications.

[19]  P. Caines,et al.  Large-population cost-coupled LQG problems: generalizations to non-uniform individuals , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).