A mathematical programming based characterization of Nash equilibria of some constrained stochastic games

We consider two classes of constrained finite state-action stochas- tic games. First, we consider a two player nonzero sum single controller con- strained stochastic game with both average and discounted cost criterion. We consider the same type of constraints as in (1), i.e., player 1 has subscrip- tion based constraints and player 2, who controls the transition probabilities, has realization based constraints which can also depend on the strategies of player 1. Next, we consider a N-player nonzero sum constrained stochastic game with independent state processes where each player has average cost cri- terion as discussed in (2). We show that the stationary Nash equilibria of both classes of constrained games, which exists under strong Slater and irreducibil- ity conditions (3), (2), has one to one correspondence with global minima of certain mathematical programs. In the single controller game if the constraints of player 2 do not depend on the strategies of the player 1, then the mathe- matical program reduces to the non-convex quadratic program. In two player independent state processes stochastic game if the constraints of a player do not depend on the strategies of another player, then the mathematical pro- gram reduces to a non-convex quadratic program. Computational algorithms for finding global minima of non-convex quadratic program exist (4), (5) and hence, one can compute Nash equilibria of these constrained stochastic games. Our results generalize some existing results for zero sum games (1), (6), (7). ⋆ A portion of Section 3 (two player case) has been presented in the 8th International

[1]  T. Parthasarathy,et al.  An orderfield property for stochastic games when one player controls transition probabilities , 1981 .

[2]  Jing Hu,et al.  An LPCC approach to nonconvex quadratic programs , 2012, Math. Program..

[3]  Eitan Altman,et al.  Zero-sum constrained stochastic games with independent state processes , 2005, Math. Methods Oper. Res..

[4]  Eitan Altman,et al.  Constrained Markov games with transition probabilities controlled by a single player , 2007, ValueTools '07.

[6]  Mokhtar S. Bazaraa,et al.  Nonlinear Programming: Theory and Algorithms , 1993 .

[8]  O. J. Vrieze Linear programming and undiscounted stochastic games in which one player controls transitions , 1981 .

[9]  T. E. S. Raghavan,et al.  Algorithms for stochastic games — A survey , 1991, ZOR Methods Model. Oper. Res..

[10]  Rachid El Azouzi,et al.  Constrained Stochastic Games in Wireless Networks , 2007, IEEE GLOBECOM 2007 - IEEE Global Telecommunications Conference.

[11]  O. Mangasarian,et al.  Two-person nonzero-sum games and quadratic programming , 1964 .

[12]  Jerzy A. Filar,et al.  Nonlinear programming and stationary equilibria in stochastic games , 1991, Math. Program..

[13]  Michèle Breton,et al.  Algorithms for Stochastic Games , 1991 .

[14]  E. Altman Constrained Markov Decision Processes , 1999 .

[15]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[16]  Eitan Altman,et al.  Constrained cost-coupled stochastic games with independent state processes , 2007, Oper. Res. Lett..

[17]  Eitan Altman,et al.  Constrained Markov Games: Nash Equilibria , 2000 .

[18]  Jerzy A. Filar Quadratic programming and the single-controller stochastic game , 1986 .

[19]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .

[20]  Onésimo Hernández-Lerma,et al.  Existence of nash equilibria for constrained stochastic games , 2006, Math. Methods Oper. Res..

[21]  Franco Giannessi,et al.  Nonconvex Quadratic Programs, Linear Complementarity Problems, and Integer Linear Programs , 1973, Optimization Techniques.

[22]  John N. Tsitsiklis,et al.  Introduction to linear optimization , 1997, Athena scientific optimization and computation series.

[23]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.