Master Equation for Discrete-Time Stackelberg Mean Field Games with single leader

In this paper, we consider a discrete-time Stackelberg mean field game with a leader and an infinite number of followers. The leader and the followers each observe types privately that evolve as conditionally independent controlled Markov processes. The leader commits to a dynamic policy and the followers best respond to that policy and each other. Knowing that the followers would play a mean field game based on her policy, the leader chooses a policy that maximizes her reward. We refer to the resulting outcome as a Stackelberg mean field equilibrium (SMFE). In this paper, we provide a master equation of this game that allows one to compute all SMFE. Based on our framework, we consider two numerical examples. First, we consider an epidemic model where the followers get infected based on the mean field population. The leader chooses subsidies for a vaccine to maximize social welfare and minimize vaccination costs. In the second example, we consider a technology adoption game where the followers decide to adopt a technology or a product and the leader decides the cost of one product that maximizes his returns, which are proportional to the people adopting that technology.

[1]  Andrea J. Goldsmith,et al.  Competition in Wireless Systems via Bayesian Interference Games , 2007, ArXiv.

[2]  Yan Ma,et al.  Mean field stochastic games: Monotone costs and threshold policies , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[3]  Tamer Basar,et al.  Dynamic Games in Cyber-Physical Security: An Overview , 2019, Dyn. Games Appl..

[4]  Eric Maskin,et al.  Markov Perfect Equilibrium: I. Observable Actions , 2001, J. Econ. Theory.

[5]  Jean C. Walrand,et al.  How Bad Are Selfish Investments in Network Security? , 2011, IEEE/ACM Transactions on Networking.

[6]  Randall Berry,et al.  The impact of observation and action errors on informational cascades , 2014, 53rd IEEE Conference on Decision and Control.

[7]  Ashutosh Nayyar,et al.  Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach , 2012, IEEE Transactions on Automatic Control.

[8]  Yan Ma,et al.  Mean field stochastic games with binary actions: Stationary threshold policies , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[9]  Asuman Ozdaglar,et al.  Optimal dynamic information provision in traffic routing , 2020, ArXiv.

[10]  P. Lions,et al.  The Master Equation and the Convergence Problem in Mean Field Games , 2015, 1509.02505.

[11]  José R. Correa,et al.  Sloan School of Management Working Paper 4319-03 June 2003 Selfish Routing in Capacitated Networks , 2022 .

[12]  Achilleas Anastasopoulos,et al.  Signaling equilibria for dynamic LQG games with asymmetric information , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[13]  Olivier Guéant,et al.  Mean Field Games and Applications , 2011 .

[14]  Deepanshu Vasal Stochastic Stackelberg games , 2020, ArXiv.

[15]  Achilleas Anastasopoulos,et al.  Decentralized Bayesian learning in dynamic games , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[16]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[17]  Minyi Huang,et al.  Mean Field Stochastic Games with Binary Action Spaces and Monotone Costs , 2017, 1701.06661.

[18]  Mukund Sundararajan,et al.  Mean Field Equilibria of Dynamic Auctions with Learning , 2014, Manag. Sci..

[19]  Ramesh Johari,et al.  Equilibria of Dynamic Games with Many Players: Existence, Approximation, and Market Structure , 2010, J. Econ. Theory.

[20]  M. Ilić,et al.  Dynamic games-based modeling of electricity markets , 1999, IEEE Power Engineering Society. 1999 Winter Meeting (Cat. No.99CH36233).

[21]  Sriram Vishwanath,et al.  Sequential Decomposition of Graphon Mean Field Games , 2020, ArXiv.

[22]  Non-cooperative dynamic games for general insurance markets , 2018 .

[23]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .

[24]  Aditya Mahajan,et al.  Reinforcement Learning in Stationary Mean-field Games , 2019, AAMAS.

[25]  Peter E. Caines,et al.  Large population stochastic dynamic games: closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle , 2006, Commun. Inf. Syst..

[26]  D. Bergemann,et al.  Dynamic Auctions: A Survey , 2010 .

[27]  P. Lions,et al.  Mean field games , 2007 .

[28]  Olivier Guéant,et al.  Application of Mean Field Games to Growth Theory , 2008 .

[29]  T. Başar,et al.  Dynamic Noncooperative Game Theory, 2nd Edition , 1998 .

[30]  Deepanshu Vasal,et al.  Sequential Decomposition of Mean-Field Games , 2020, 2020 American Control Conference (ACC).