Nonlinear programming and stationary equilibria in stochastic games

Stationary equilibria in discounted and limiting average finite state/action space stochastic games are shown to be equivalent to global optima of certain nonlinear programs. For zero sum limiting average games, this formulation reduces to a linear objective, nonlinear constraints program, which finds the “best” stationary strategies, even whenε-optimal stationary strategies do not exist, for arbitrarily smallε.

[1]  T. Parthasarathy,et al.  Nonzero-sum stochastic games , 1991 .

[2]  Cyrus Derman,et al.  Finite State Markovian Decision Processes , 1970 .

[3]  M. J. Sobel Noncooperative Stochastic Games , 1971 .

[4]  Philip D Rogers,et al.  NONZERO-SUM STOCHASTIC GAMES , 1969 .

[5]  C. E. Lemke,et al.  Equilibrium Points of Bimatrix Games , 1964 .

[6]  Dean Gillette,et al.  9. STOCHASTIC GAMES WITH ZERO STOP PROBABILITIES , 1958 .

[7]  Arie Hordijk,et al.  Semi-markov strategies in stochastic games , 1983 .

[8]  Elon Kohlberg,et al.  On Stochastic Games with Stationary Optimal Strategies , 1978, Math. Oper. Res..

[9]  D. Blackwell Discrete Dynamic Programming , 1962 .

[10]  O. Mangasarian,et al.  Two-person nonzero-sum games and quadratic programming , 1964 .

[11]  Robert Wilson,et al.  Computing Equilibria of N-Person Games , 1971 .

[12]  D. Blackwell,et al.  THE BIG MATCH , 1968, Classics in Game Theory.

[13]  L. C. Thomas,et al.  Stochastic Games with Finite State and Action Spaces , 1988 .

[14]  O. J. Vrieze,et al.  Stochastic Games with Finite State and Action Spaces. , 1988 .

[15]  H. Kunzi,et al.  Lectu re Notes in Economics and Mathematical Systems , 1975 .

[16]  J. Filar,et al.  On the computation of equilibria in discounted Stochastic games , 1986 .

[17]  Jerzy A. Filar,et al.  Nonlinear programming and stationary strategies in stochastic games , 1986, Math. Program..

[18]  A. M. Fink,et al.  Equilibrium in a stochastic $n$-person game , 1964 .

[19]  J. Rosenmüller On a Generalization of the Lemke–Howson Algorithm to Noncooperative N-Person Games , 1971 .

[20]  Masayuki Takahashi Equilibrium points of stochastic non-cooperative $n$-person games , 1964 .

[21]  A. Hordijk,et al.  Linear Programming and Markov Decision Chains , 1979 .