Sequential decomposition of dynamic games with asymmetric information and dependent states

We consider a finite-horizon dynamic game with asymmetric information with $N$ selfish players, where there exists an underlying state of the system that is a controlled Markov process, controlled by players' actions. In each period, a player makes a common observation of the state together with all the other players, and a private observation, and gets an instantaneous reward which is a function of the state and everyone's' actions. The players' private observations are conditionally independent across time, conditioned on the system state and players' previous actions, however, they are potentially correlated among players in each period. This observation model includes the case when players observe their rewards at the end of each period. We first solve the team version of this problem where users are cooperative i.e. have the same objective. Using these results as motivation for the definition of information state, we present a sequential decomposition methodology to compute \emph{structured perfect Bayesian equilibria} (SPBE) of this game, introduced in~[1]. In general, these equilibria exhibit \textit{signaling} behavior, i.e. players' actions reveal part of their private information that is payoff relevant to other users.

[1]  Yi Ouyang,et al.  Dynamic Games With Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition , 2015, IEEE Transactions on Automatic Control.

[2]  Robert J. Weber,et al.  Distributional Strategies for Games with Incomplete Information , 1985, Math. Oper. Res..

[3]  Demosthenis Teneketzis,et al.  On the design of globally optimal communication strategies for real-time noisy communication systems with noisy feedback , 2008, IEEE Journal on Selected Areas in Communications.

[4]  H. J. Jacobsen,et al.  The One-Shot-Deviation Principle for Sequential Rationality , 1996 .

[5]  Eric Maskin,et al.  Markov Perfect Equilibrium: I. Observable Actions , 2001, J. Econ. Theory.

[6]  M. Spence Job Market Signaling , 1973 .

[7]  R. Bertram,et al.  Stochastic Systems , 2008, Control Theory for Physicists.

[8]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[9]  Hamidreza Tavafoghi Jahromi,et al.  On Design and Analysis of Cyber-Physical Systems with Strategic Agents , 2017 .

[10]  Abhinav Sinha,et al.  Structured perfect Bayesian equilibrium in infinite horizon dynamic games with asymmetric information , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[11]  Aditya Mahajan,et al.  Optimal Decentralized Control of Coupled Subsystems With Control Sharing , 2011, IEEE Transactions on Automatic Control.

[12]  Eric van Damme,et al.  Non-Cooperative Games , 2000 .

[13]  Tuomas Sandholm,et al.  Safe and Nested Subgame Solving for Imperfect-Information Games , 2017, NIPS.

[14]  T. Başar,et al.  Dynamic Noncooperative Game Theory, 2nd Edition , 1998 .

[15]  G. Mailath,et al.  Repeated Games and Reputations: Long-Run Relationships , 2006 .

[16]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .

[17]  Yu-Chi Ho Team decision theory and information structures , 1980, Proceedings of the IEEE.

[18]  Achilleas Anastasopoulos,et al.  Stochastic Control of Relay Channels With Cooperative and Strategic Users , 2014, IEEE Transactions on Communications.

[19]  J. Sobel,et al.  Equilibrium Selection in Signaling Games , 1987 .

[20]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[21]  Achilleas Anastasopoulos,et al.  Signaling equilibria for dynamic LQG games with asymmetric information , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[22]  Achilleas Anastasopoulos,et al.  Decentralized Bayesian learning in dynamic games , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[23]  Akira Okada Perfect Bayesian Equilibrium and Sequential Equilibrium , 2011 .

[24]  David M. Kreps,et al.  Signaling Games and Stable Equilibria , 1987 .

[25]  Yi Ouyang,et al.  On stochastic dynamic games with delayed sharing information structure , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[26]  Achilleas Anastasopoulos,et al.  A systematic process for evaluating structured perfect Bayesian equilibria in dynamic games with asymmetric information , 2015, 2016 American Control Conference (ACC).

[27]  Ashutosh Nayyar,et al.  Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach , 2012, IEEE Transactions on Automatic Control.