Distributed Fictitious Play in Potential Games with Time Varying Communication Networks

We propose a distributed algorithm for multiagent systems that aim to optimize a common objective when agents differ in their estimates of the objective-relevant state of the environment. Each agent keeps an estimate of the environment and a model of the behavior of other agents. The model of other agents’ behavior assumes agents choose their actions randomly based on a stationary distribution determined by the empirical frequencies of past actions. At each step, each agent takes the action that maximizes its expectation of the common objective computed with respect to its estimate of the environment and its model of others. We propose a weighted averaging rule with non-doubly stochastic weights for agents to estimate the empirical frequency of past actions of all other agents by exchanging their estimates with their neighbors over a time-varying communication network. Under this averaging rule, we show agents’ estimates converge to the actual empirical frequencies fast enough. This implies convergence of actions to a Nash equilibrium of the game with identical payoffs given by the expectation of the common objective with respect to an asymptotically agreed estimate of the state of the environment.

[1]  Farzad Salehisadaghiani,et al.  Distributed Nash equilibrium seeking: A gossip-based algorithm , 2016, Autom..

[2]  L. Shapley,et al.  Fictitious Play Property for Games with Identical Interests , 1996 .

[3]  Angelia Nedic,et al.  Distributed Algorithms for Aggregative Games on Graphs , 2016, Oper. Res..

[4]  John N. Tsitsiklis,et al.  Problems in decentralized decision making and computation , 1984 .

[5]  L. Shapley,et al.  Potential Games , 1994 .

[6]  Jeff S. Shamma,et al.  Dynamic fictitious play, dynamic gradient play, and distributed convergence to Nash equilibria , 2005, IEEE Transactions on Automatic Control.

[7]  Tamer Basar,et al.  Distributed algorithms for the computation of noncooperative equilibria , 1987, Autom..

[8]  Vijay Kumar,et al.  Robust Control of Mobility and Communications in Autonomous Robot Teams , 2013, IEEE Access.

[9]  Asuman E. Ozdaglar,et al.  Near-Optimal Power Control in Wireless Networks: A Potential Game Approach , 2010, 2010 Proceedings IEEE INFOCOM.

[10]  Alejandro Ribeiro,et al.  Distributed Fictitious Play for Multiagent Systems in Uncertain Environments , 2018, IEEE Transactions on Automatic Control.

[11]  Asuman E. Ozdaglar,et al.  Distributed Subgradient Methods for Multi-Agent Optimization , 2009, IEEE Transactions on Automatic Control.

[12]  Soummya Kar,et al.  Empirical Centroid Fictitious Play: An Approach for Distributed Learning in Multi-Agent Games , 2013, IEEE Transactions on Signal Processing.

[13]  Ali H. Sayed,et al.  Diffusion Adaptation Strategies for Distributed Optimization and Learning Over Networks , 2011, IEEE Transactions on Signal Processing.