Shaping Large Population Agent Behaviors Through Entropy-Regularized Mean-Field Games

Mean-field games (MFG) were introduced to efficiently analyze approximate Nash equilibria in large population settings. In this work, we consider entropy-regularized meanfield games with a finite state-action space in a discrete time setting. We show that entropy regularization provides the necessary regularity conditions, that are lacking in the standard finite mean field games. Such regularity conditions enable us to design fixed-point iteration algorithms to find the unique meanfield equilibrium (MFE). Furthermore, the reference policy used in the regularization provides an extra means, through which one can control the behavior of the population. We first formulate the problem as a stochastic game with a large population of N homogeneous agents. We establish conditions for the existence of a Nash equilibrium in the limiting case as N tends to infinity, and we demonstrate that the Nash equilibrium for the infinite population case is also an -Nash equilibrium for the N -agent regularized game, where the sub-optimality is of order O ( 1/ √ N ) . Finally, we verify the theoretical guarantees through a resource allocation example and demonstrate the efficacy of using a reference policy to control the behavior of a large population of agents.

[1]  Yves Achdou,et al.  Mean Field Games: Numerical Methods , 2010, SIAM J. Numer. Anal..

[2]  Naci Saldi,et al.  Q-Learning in Regularized Mean-field Games , 2020, ArXiv.

[3]  Charafeddine Mouzouni,et al.  A Mean Field Game Of Portfolio Trading And Its Consequences On Perceived Correlations , 2019, 1902.09606.

[4]  Ali Esmaili,et al.  Probability and Random Processes , 2005, Technometrics.

[5]  Lei Liu,et al.  A Survey of Swarm Robotics System , 2012, ICSI.

[6]  Zhu Han,et al.  Distributed Interference and Energy-Aware Power Control for Ultra-Dense D2D Networks: A Mean Field Game , 2017, IEEE Transactions on Wireless Communications.

[7]  Roy Fox,et al.  Taming the Noise in Reinforcement Learning via Soft Updates , 2015, UAI.

[8]  Heinz Koeppl,et al.  Approximately Solving Mean Field Games via Entropy-Regularized Deep Reinforcement Learning , 2021, AISTATS.

[9]  Ming Zhou,et al.  Mean Field Multi-Agent Reinforcement Learning , 2018, ICML.

[10]  Alagan Anpalagan,et al.  Mean Field Game-Theoretic Framework for Interference and Energy-Aware Control in 5G Ultra-Dense Networks , 2018, IEEE Wireless Communications.

[11]  Pierre-Louis Lions,et al.  Efficiency of the price formation process in presence of high frequency participants: a mean field game analysis , 2013, 1305.6323.

[12]  Sean P. Meyn,et al.  Learning in Mean-Field Games , 2014, IEEE Transactions on Automatic Control.

[13]  Yasemin Altun,et al.  Relative Entropy Policy Search , 2010 .

[14]  Anthony Stentz,et al.  A comprehensive taxonomy for multi-robot task allocation , 2013, Int. J. Robotics Res..

[15]  M. Dufwenberg Game theory. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[16]  Spring Berman,et al.  Optimized Stochastic Policies for Task Allocation in Swarms of Robots , 2009, IEEE Transactions on Robotics.

[17]  Spring Berman,et al.  Mean-field models in swarm robotics: a survey , 2019, Bioinspiration & biomimetics.

[18]  Aditya Mahajan,et al.  Team optimal control of coupled subsystems with mean-field sharing , 2014, 53rd IEEE Conference on Decision and Control.

[19]  Peter E. Caines,et al.  A Mean Field Game Computational Methodology for Decentralized Cellular Network Optimization , 2017, IEEE Transactions on Control Systems Technology.

[20]  Tamer Basar,et al.  Markov-Nash equilibria in mean-field games with discounted cost , 2016, 2017 American Control Conference (ACC).

[21]  P. Lions,et al.  Jeux à champ moyen. II – Horizon fini et contrôle optimal , 2006 .

[22]  Seiichi Shin,et al.  Decentralized Control of Autonomous Swarm Systems Using Artificial Potential Functions: Analytical Design Guidelines , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[23]  P. Lions,et al.  Mean field games , 2007 .

[24]  Vijay Kumar,et al.  The Impact of Diversity on Optimal Control Policies for Heterogeneous Robot Swarms , 2017, IEEE Transactions on Robotics.

[25]  Ulrich Horst,et al.  A Mean Field Game of Optimal Portfolio Liquidation , 2018, Math. Oper. Res..

[26]  Peter E. Caines,et al.  Large population stochastic dynamic games: closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle , 2006, Commun. Inf. Syst..

[27]  Panagiotis Tsiotras,et al.  Learning Nash Equilibria in Zero-Sum Stochastic Games via Entropy-Regularized Policy Approximation , 2021, IJCAI.