Finite- and Infinite-Horizon Shapley Games with Nonsymmetric Partial Observation

We consider asymmetric partially observed Shapley-type finite-horizon and infinite-horizon games where the state, a controlled Markov chain $\{X_t\}$, is not observable to one player (minimizer) who observes only a state-dependent signal $\{Y_t\}$. The maximizer observes both. The minimizer is informed of the maximizer's action after (before) choosing his control in the MINMAX (MAXMIN) game. A nontrivial open problem in such situations is how the minimizer can use this knowledge to update his belief about $\{X_t\}$. To address this, the maximizer uses off-line control functions which are known to the minimizer. Using these, novel control-parameterized nonlinear filters are constructed which are proved to characterize the conditional distribution of the full path of $\{X_t\}$. Using these filters, recursive algorithms are developed which show that saddle-points exist in both behavioral and Markov strategies for the finite-horizon case in both games. These algorithms are extended to prove saddle-points in M...

[1]  M. K rn,et al.  Stochastic Optimal Control , 1988 .

[2]  L. Stettner,et al.  Approximations of discrete time partially observed control problems , 1994 .

[3]  S. Sorin “Big Match” with lack of information on one side (part i) , 1984 .

[4]  Mrinal K. Ghosh,et al.  Partially observed semi-Markov zero-sum games with average payoff ✩ , 2008 .

[5]  Sylvain Sorin,et al.  A 2-Person Game with Lack of Information on 1½ Sides , 1985, Math. Oper. Res..

[6]  S. Zamir,et al.  "Big match" with lack of information on one side (Part II) , 1984 .

[7]  O. Kallenberg Foundations of Modern Probability , 2021, Probability Theory and Stochastic Modelling.

[8]  L. S. Zaremba,et al.  Existence of value in differential games with fixed time duration , 1982 .

[9]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[10]  M. K. Ghosh,et al.  Zero-Sum Stochastic Games with Partial Information , 2004 .

[11]  Microeconomics-Charles W. Upton Repeated games , 2020, Game Theory.

[12]  S. Sorin “Big match” with lack of information on one side (Part II) , 1985 .

[13]  Dinah Rosenberg,et al.  Zero Sum Absorbing Games with Incomplete Information on One Side: Asymptotic Analysis , 2000, SIAM J. Control. Optim..

[14]  Jean-François Mertens,et al.  The value of two-person zero-sum repeated games with lack of information on both sides , 1971 .

[15]  S. Sorin A First Course on Zero Sum Repeated Games , 2002 .

[16]  Mrinal K. Ghosh,et al.  Partially Observable Semi-Markov Games with Discounted Payoff , 2006 .

[17]  Subhamay Saha Zero-Sum Stochastic Games with Partial Information and Average Payoff , 2014, J. Optim. Theory Appl..

[18]  Onésimo Hernández-Lerma,et al.  Controlled Markov Processes , 1965 .

[19]  Shmuel Zamir,et al.  Repeated games of incomplete information: Zero-sum , 1992 .

[20]  Robert J. Aumann,et al.  Repeated Games with Incomplete Information , 1995 .

[21]  Phil Howlett,et al.  Stochastic Optimal Control of a Solar Car , 2001 .