Subgame Maxmin Strategies in Zero-Sum Stochastic Games with Tolerance Levels

We study subgame $$\phi $$ ϕ -maxmin strategies in two-player zero-sum stochastic games with a countable state space, finite action spaces, and a bounded and universally measurable payoff function. Here, $$\phi $$ ϕ denotes the tolerance function that assigns a nonnegative tolerated error level to every subgame. Subgame $$\phi $$ ϕ -maxmin strategies are strategies of the maximizing player that guarantee the lower value in every subgame within the subgame-dependent tolerance level as given by $$\phi $$ ϕ . First, we provide necessary and sufficient conditions for a strategy to be a subgame $$\phi $$ ϕ -maxmin strategy. As a special case, we obtain a characterization for subgame maxmin strategies, i.e., strategies that exactly guarantee the lower value at every subgame. Secondly, we present sufficient conditions for the existence of a subgame $$\phi $$ ϕ -maxmin strategy. Finally, we show the possibly surprising result that each game admits a strictly positive tolerance function $$\phi ^*$$ ϕ ∗ with the following property: if a player has a subgame $$\phi ^*$$ ϕ ∗ -maxmin strategy, then he has a subgame maxmin strategy too. As a consequence, the existence of a subgame $$\phi $$ ϕ -maxmin strategy for every positive tolerance function $$\phi $$ ϕ is equivalent to the existence of a subgame maxmin strategy.

[1]  Alessandro Abate,et al.  On the effect of perturbation of conditional probabilities in total variation , 2014 .

[2]  J. Yeh,et al.  Martingales and stochastic analysis , 1995 .

[3]  János Flesch,et al.  On refinements of subgame perfect $$\epsilon $$ϵ-equilibrium , 2016, Int. J. Game Theory.

[4]  R. Radner Collusive behavior in noncooperative epsilon-equilibria of oligopolies with long but finite lives , 1980 .

[5]  A. Tversky,et al.  The framing of decisions and the psychology of choice. , 1981, Science.

[6]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[7]  Larry Samuelson,et al.  Contemporaneous perfect epsilon-equilibria , 2005, Games Econ. Behav..

[8]  Véronique Bruyère,et al.  Computer Aided Synthesis: A Game-Theoretic Approach , 2017, DLT.

[9]  János Flesch,et al.  Characterization and simplification of optimal strategies in positive stochastic games , 2018, J. Appl. Probab..

[10]  P. Herings,et al.  Subgame Maxmin Strategies in Zero-Sum Stochastic Games with Tolerance Levels , 2018, Dynamic Games and Applications.

[11]  Improving strategies in stochastic games , 1998, Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171).

[12]  Ayala Mashiah-Yaakovi,et al.  Correlated Equilibria in Stochastic Games with Borel Measurable Payoffs , 2015, Dyn. Games Appl..

[13]  W. Sudderth,et al.  Discrete Gambling and Stochastic Games , 1996 .

[14]  N. Vieille,et al.  Stopping games with randomized strategies , 2001 .

[15]  Donald A. Martin,et al.  The determinacy of Blackwell games , 1998, Journal of Symbolic Logic.

[16]  William D. Sudderth,et al.  Finitely additive stochastic games with Borel measurable payoffs , 1998, Int. J. Game Theory.

[17]  Rida Laraki,et al.  Two-Person Zero-Sum Stochastic Games with Semicontinuous Payoff , 2013, Dyn. Games Appl..

[18]  N. Vieille,et al.  Uniform value in recursive games , 2002 .

[19]  D. Blackwell,et al.  THE BIG MATCH , 1968, Classics in Game Theory.