A Class of Tractable Partially Observed Discrete Stochastic Games

Stochastic games under partial information are typically computationally intractable even in the discrete-time/discrete-state case considered here. We consider a problem where one player has perfect information. A chief problem is that the information state for the player with imperfect information is a function over the space of probability distributions (a function over a simplex), and so infinite-dimensional. However, in the problem form here, the payoff is only a function of the terminal state of the system, and the initial information state is either linear or a sum of max-plus delta functions. In this case, the information state and state-feedback value functions belong to finite-dimensional sets. Thus computational tractability is greatly enhanced.

[1]  M. James,et al.  Extending H-infinity Control to Nonlinear Systems: Control of Nonlinear Systems to Achieve Performance Objectives , 1987 .

[2]  K. Ito,et al.  Stochastic games and inverse Lyapunov methods in air operations , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[3]  R. Elliott,et al.  The Existence Of Value In Differential Games , 1972 .

[4]  Huihui Jiang,et al.  Modeling and control of military operations against adversarial control , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[5]  H. S. Morse,et al.  The DARPA JFACC program: modeling and control of military operations , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[6]  Milton B. Adams,et al.  Closed-Loop Hierarchical Control of Military Air Operations , 2002 .

[7]  T. Basar,et al.  H∞-0ptimal Control and Related Minimax Design Problems: A Dynamic Game Approach , 1996, IEEE Trans. Autom. Control..

[8]  Debasish Ghose,et al.  Game theoretic campaign modeling and analysis , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[9]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .