The MaxMin value of stochastic games with imperfect monitoring

Abstract.We study finite zero-sum stochastic games in which players do not observe the actions of their opponent. Rather, at each stage, each player observes a stochastic signal that may depend on the current state and on the pair of actions chosen by the players. We assume that each player observes the state and his/her own action. We prove that the uniform max-min value always exists. Moreover, the uniform max-min value is independent of the information structure of player 2. Symmetric results hold for the uniform min-max value.

[1]  Jean-Michel Coulomb Absorbing Games with a Signalling Structure , 2001, Math. Oper. Res..

[2]  Ehud Lehrer,et al.  Nash equilibria of n-player repeated games with semi-standard information , 1990 .

[3]  Marie-Françoise Roy,et al.  Real algebraic geometry , 1992 .

[4]  Jean-Michel Coulomb Stochastic games without perfect monitoring , 2003, Int. J. Game Theory.

[5]  R. Radner Monitoring Cooperative Agreements in a Repeated Principal-Agent Relationship , 1981 .

[6]  O. J. Vrieze,et al.  On equilibria in repeated games with absorbing states , 1989 .

[7]  Rakesh V. Vohra,et al.  Correlated equilibrium payoffs and public signalling in absorbing games , 2002, Int. J. Game Theory.

[8]  Robert J. Aumann,et al.  Repeated Games with Incomplete Information , 1995 .

[9]  Jean-Michel Coulomb,et al.  Generalized "Big-Match" , 1999, Math. Oper. Res..

[10]  E. Lehrer Lower equilibrium payoffs in two-player repeated games with non-observable actions , 1989 .

[11]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[12]  R. Vohra,et al.  Correlated Equilibrium, Public Signaling and Absorbing Games , 1999 .

[13]  Eilon Solan Three-Player Absorbing Games , 1999, Math. Oper. Res..

[14]  Noga Alon,et al.  The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.

[15]  Ariel Rubinstein,et al.  Repeated insurance contracts and moral hazard , 1983 .

[16]  N. Vieille,et al.  BLACKWELL OPTIMALITY IN MARKOV DECISION PROCESSES WITH PARTIAL OBSERVATION , 2002 .

[17]  J. Aubin,et al.  Existence of Solutions to Differential Inclusions , 1984 .

[18]  J. M. Coulomb Repeated games with absorbing states and no signals , 1992 .

[19]  Jean-Michelcoul Omb ABSORBING GAMES WITH A SIGNALLING STRUCTURE , 2001 .

[20]  Ehud Lehrer,et al.  On the equilibrium payoffs set of two player repeated games with imperfect monitoring , 1992 .

[21]  Ehud Lehrer,et al.  Correlated Equilibria in Two-Player Repeated Games with Nonobservable Actions , 1992, Math. Oper. Res..