Communication complexity of stochastic games

We derive upper and lower bounds on the communication complexity of determining the existence of pure strategy Nash equilibria for some classes of stochastic games. We prove that pure equilibria of single controller stochastic games and those of SER-SIT (Separable Reward - State Independent Transition) games correspond to those of bimatrix games that are constructed from these stochastic games. Hence we extend communication complexity upper bounds of bimatrix games to these stochastic games. For SER-SIT games, we prove an upper bound of O(n2) which is tight and which coincides with that for bimatrix games. Here n is the number of actions of each player in each state. Note that this bound is independent of the size of the actual payoffs. For single-controller games, we obtain an upper bound of min (O(n2|S|), O(|S| n2 log M)) where S is the set of states and M is the largest entry across all payoff matrices. Further, we reduce bimatrix games to stochastic games and hence, the lower bound extends from bimatrix games to stochastic games as well. We also establish the following results while proving upper bounds for SER-SIT games. To prove that pure equilibria of SER-SIT games correspond to those of auxiliary bimatrix games, we show that every SER-SIT game that has a pure equilibrium has a state-independent pure equilibrium too. We also show that we cannot relax the constraints of separable rewards or state independent transitions. We provide counter examples when the game is SER (but not SIT) and SIT (but not SER).

[1]  C. E. Lemke,et al.  Bimatrix Equilibrium Points and Mathematical Programming , 1965 .

[2]  Stef Tijs,et al.  Stochastic games with state independent transitions and separable rewards , 1984 .

[3]  T. Raghavan,et al.  Finite-Step Algorithms for Single-Controller and Perfect Information Stochastic Games , 2003 .

[4]  Katta G. Murty,et al.  Linear complementarity, linear and nonlinear programming , 1988 .

[5]  K. G. Murty,et al.  LINEAR COMPLEMENTARITY � LINEAR AND NONLINEAR PROGRAMMING Internet Edition , 2006 .

[6]  R. Sridhar,et al.  The generalized linear complementarity problem revisited , 1996, Math. Program..

[7]  Sanjeev Arora,et al.  Computational Complexity: A Modern Approach , 2009 .

[8]  T. E. S. Raghavan,et al.  Computing Stationary Nash Equilibria of Undiscounted Single-Controller Stochastic Games , 2002, Math. Oper. Res..

[9]  S. R. Mohan,et al.  Pivoting Algorithms for Some Classes of Stochastic Games: a Survey , 2001, IGTR.

[10]  Bernd Gärtner,et al.  Simple Stochastic Games and P-Matrix Generalized Linear Complementarity Problems , 2005, FCT.

[11]  Matthew J. Sobel,et al.  Myopic Solutions of Markov Decision Processes and Stochastic Games , 1981, Oper. Res..

[12]  T. E. S. Raghavan,et al.  A finite step algorithm via a bimatrix game to a single controller non-zero sum stochastic game , 1993, Math. Program..

[13]  Anne Condon,et al.  The Complexity of Stochastic Games , 1992, Inf. Comput..

[14]  Linear complementarity and discounted switching controller stochastic games , 1992 .

[15]  Jerzy A. Filar,et al.  Nonlinear programming and stationary equilibria in stochastic games , 1991, Math. Program..

[16]  T. Raghavan,et al.  Perfect information stochastic games and related classes , 1997 .

[17]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[18]  Eitan Altman,et al.  Zero-sum Markov games and worst-case optimal control of queueing systems , 1995, Queueing Syst. Theory Appl..

[19]  T. Parthasarathy,et al.  An orderfield property for stochastic games when one player controls transition probabilities , 1981 .

[20]  Jeannette M. Wing,et al.  Game strategies in network security , 2005, International Journal of Information Security.

[21]  Venkat Anantharam An approach to the design of high speed networks for bursty traffic , 1993, Proceedings of 32nd IEEE Conference on Decision and Control.