On Lemke processibility of LCP formulations for solving discounted switching control stochastic games

Schultz (J Optim Theory Appl 73(1):89–99, 1992) formulated 2-person, zero-sum, discounted switching control stochastic games as a linear complementarity problem (LCP) and discussed computational results. It remained open to prove or disprove Lemke-processibility of this LCP. We settle this question by providing a counter example to show that Lemke’s algorithm does not always successfully process this LCP.We propose a new LCP formulation with the aim of making the underlying matrix belong to the classes R $$_{0}$$ and E $$_{0}$$ , which would imply Lemke processibility. While the underlying matrix in the new formulation is not $$E_0$$ , we show that it is an R $$_{0}$$ -matrix. Successful processing of Lemke’s algorithm depends on the choice of the initial vector d. Because of the special structure of the LCP in our context, we may, in fact, be able to find a suitable d such that our LCPs are processible by Lemke’s algorithm. We leave this open.

[1]  T. Parthasarathy,et al.  An orderfield property for stochastic games when one player controls transition probabilities , 1981 .

[2]  T. E. S. Raghavan,et al.  Computing Stationary Nash Equilibria of Undiscounted Single-Controller Stochastic Games , 2002, Math. Oper. Res..

[3]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[4]  J. Filar Ordered field property for stochastic games when the player who controls transitions changes from state to state , 1981 .

[5]  Matthew J. Sobel,et al.  Myopic Solutions of Markov Decision Processes and Stochastic Games , 1981, Oper. Res..

[6]  T. E. S. Raghavan,et al.  A finite step algorithm via a bimatrix game to a single controller non-zero sum stochastic game , 1993, Math. Program..

[7]  S. R. Mohan,et al.  Pivoting Algorithms for Some Classes of Stochastic Games: a Survey , 2001, IGTR.

[8]  Stef Tijs,et al.  Stochastic games with state independent transitions and separable rewards , 1984 .

[9]  C. B. García,et al.  Some classes of matrices in linear complementarity theory , 1973, Math. Program..

[10]  Richard W. Cottle,et al.  Linear Complementarity Problem. , 1992 .

[11]  C. E. Lemke,et al.  Bimatrix Equilibrium Points and Mathematical Programming , 1965 .

[12]  T. Raghavan,et al.  Finite-Step Algorithms for Single-Controller and Perfect Information Stochastic Games , 2003 .

[13]  Linear complementarity and discounted switching controller stochastic games , 1992 .

[14]  Jerzy A. Filar,et al.  Nonlinear programming and stationary equilibria in stochastic games , 1991, Math. Program..

[15]  R. Sridhar,et al.  The generalized linear complementarity problem revisited , 1996, Math. Program..

[16]  Katta G. Murty,et al.  Linear complementarity, linear and nonlinear programming , 1988 .

[17]  G. Dantzig,et al.  COMPLEMENTARY PIVOT THEORY OF MATHEMATICAL PROGRAMMING , 1968 .