Common Information Based Markov Perfect Equilibria for Stochastic Games With Asymmetric Information: Finite Games

A model of stochastic games where multiple controllers jointly control the evolution of the state of a dynamic system but have access to different information about the state and action processes is considered. The asymmetry of information among the controllers makes it difficult to compute or characterize Nash equilibria. Using the common information among the controllers, the game with asymmetric information is used to construct another game with symmetric information such that the equilibria of the new game can be transformed to equilibria of the original game. Further, under certain conditions, a Markov state is identified for the new symmetric information game and its Markov perfect equilibria are characterized. This characterization provides a backward induction algorithm to find Nash equilibria of the original game with asymmetric information in pure or behavioral strategies. Each step of this algorithm involves finding Bayesian Nash equilibria of a one-stage Bayesian game. The class of Nash equilibria of the original game that can be characterized in this backward manner are named common information based Markov perfect equilibria.

[1]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[2]  Y. Ho,et al.  On a class of linear stochastic differential games , 1968 .

[3]  W. Willman Formal solutions for a class of stochastic pursuit-evasion games , 1968 .

[4]  D. Luenberger,et al.  Differential games with imperfect state information , 1969 .

[5]  M. J. Sobel Noncooperative Stochastic Games , 1971 .

[6]  Y. Ho On the minimax principle and zero sum stochastic differential games , 1972, CDC 1972.

[7]  Basar Tamer,et al.  A multistage pursuit-evasion game that admits a gaussian random process as a maximin control policy , 1975 .

[8]  Y. Ho On the Minimax Principle andZero-Sum Stochastic Differential Games , 1976 .

[9]  Tamer Ba csar Two-criteria LQG decision problems with one-step delay observation sharing pattern , 1978 .

[10]  T. Başar Decentralized multicriteria optimization of linear stochastic systems , 1978 .

[11]  Tamer Basar,et al.  Two-Criteria LQG Decision Problems with One-Step Delay Observation Sharing Pattern , 1978, Inf. Control..

[12]  T. Başar On the saddle-point solution of a class of stochastic differential games , 1981, 1979 18th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes.

[13]  Yu-Chi Ho,et al.  Team decision theory and information structures , 1980 .

[14]  Pravin Varaiya,et al.  Stochastic Systems: Estimation, Identification, and Adaptive Control , 1986 .

[15]  Drew Fudenberg,et al.  Game theory (3. pr.) , 1991 .

[16]  Roger B. Myerson,et al.  Game theory - Analysis of Conflict , 1991 .

[17]  Ariel Rubinstein,et al.  A Course in Game Theory , 1995 .

[18]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .

[19]  Harold L. Cole,et al.  Dynamic Games with Hidden Actions and Hidden States , 2001, J. Econ. Theory.

[20]  Eric Maskin,et al.  Markov Perfect Equilibrium: I. Observable Actions , 2001, J. Econ. Theory.

[21]  J. Hespanha,et al.  Nash equilibria in partial-information games on Markov chains , 2001, Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No.01CH37228).

[22]  Eitan Altman,et al.  Stochastic games with one step delay sharing information pattern with application to power control , 2009, 2009 International Conference on Game Theory for Networks.

[23]  Ashutosh Nayyar,et al.  Optimal Control Strategies in Delayed Sharing Information Structures , 2010, IEEE Transactions on Automatic Control.

[24]  Tamer Basar,et al.  Dynamic stochastic games with asymmetric information , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[25]  Ashutosh Nayyar,et al.  Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach , 2012, IEEE Transactions on Automatic Control.