Discrete-time mean field games in multi-agent systems

In this paper, we investigate the behavior of agents in mean field games where each agent evolves according to a dynamic equation containing the input average and seeks to minimize its long time average (LTA) cost encompassing a population state average (PSA), which is also known as the mean field term. Due to the informational burden resulting from the PSA coupling to the states of all agents, our idea is to find a deterministic function φ to approximate it. It is shown that φ is an approximation of the PSA as the population size N goes to infinity. The resulting decentralized mean field control laws lead the system to achieve mean-consensus asymptotically as time goes to infinity. Furthermore, the optimal controls generate an almost sure asymptotic Nash equilibrium, which implies that the LTA cost of each agent can reach its minimal value as the number of agents increases to infinity. Finally, we consider the socially optimal case where the basic objective is to minimize the social cost as the sum of the individual LTA cost containing the PSA. In this case, it is shown that the decentralized mean field social control strategies are the same as the mean field Nash controls for infinite population systems.

[1]  Peter E. Caines,et al.  Social Optima in Mean Field LQG Control: Centralized and Decentralized Strategies , 2012, IEEE Transactions on Automatic Control.

[2]  Donald D. Dudenhoeffer,et al.  A formation behavior for large-scale micro-robot force deployment , 2000, 2000 Winter Simulation Conference Proceedings (Cat. No.00CH37165).

[3]  Hiroaki Mukaidani,et al.  A numerical analysis of the Nash strategy for weakly coupled large-scale systems , 2006, IEEE Transactions on Automatic Control.

[4]  Mohammad Aldeen,et al.  Stabilization of decentralized control systems , 1997 .

[5]  G. Erickson Differential game models of advertising competition , 1995 .

[6]  Tao Li,et al.  Adaptive Mean Field Games for Large Population Coupled ARX Systems with Unknown Coupling Strength , 2013, Dynamic Games and Applications.

[7]  Minyi Huang,et al.  Large-Population Cost-Coupled LQG Problems With Nonuniform Agents: Individual-Mass Behavior and Decentralized $\varepsilon$-Nash Equilibria , 2007, IEEE Transactions on Automatic Control.

[8]  P K U Swarm,et al.  Swarming Behavior of Multi-agent Systems * , 2004 .

[9]  Val E. Lambson Self-enforcing collusion in large dynamic markets , 1984 .

[10]  Minyi Huang,et al.  Nash Certainty Equivalence in Large Population Stochastic Dynamic Games: Connections with the Physics of Interacting Particle Systems , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[11]  P. Lions,et al.  Mean field games , 2007 .

[12]  Han-Fu Chen,et al.  Identification and Stochastic Adaptive Control , 1991 .

[13]  S. Hakimi Optimum Distribution of Switching Centers in a Communication Network and Some Related Graph Theoretic Problems , 1965 .

[14]  Tao Li,et al.  Asymptotically Optimal Decentralized Control for Large Population Stochastic Multiagent Systems , 2008, IEEE Transactions on Automatic Control.

[15]  Olivier Guéant,et al.  Mean Field Games and Applications , 2011 .

[16]  T. Başar,et al.  Dynamic Noncooperative Game Theory , 1982 .

[17]  Tao Li,et al.  Decentralized tracking-type games for multi-agent systems with coupled ARX models: Asymptotic Nash equilibria , 2008, Autom..

[18]  P. Caines,et al.  A Solution to the Consensus Problem via Stochastic Mean Field Control , 2010 .

[19]  Michael Patriksson,et al.  The Traffic Assignment Problem: Models and Methods , 2015 .

[20]  Peter E. Caines,et al.  Nash, Social and Centralized Solutions to Consensus Problems via Mean Field Control Theory , 2013, IEEE Transactions on Automatic Control.