N-Person Discrete-Time Dynamic Games of Asymmetric Information

This paper considers a class of N-person discrete-time dynamic games with an asymmetric information structure. Each player has private information revealed only to himself, which is modeled as a random variable called the type. Each player aims to find an optimal feedback control policy to reach the desired state while minimizing the control cost without exact knowledge of the system dynamics. Players can form a belief on the unknowns based on the observation of the state trajectory and update it via the Bayesian rule to learn the type value of other players. To deal with the uncertainty caused by the private information, each player forms his control policy under the expectation of the type belief, which forms the perfect Bayesian Nash equilibrium (PBNE). The strong coupling of the type estimation and the control policy establishes no separation principle in our non-classical information structure. In particular, we investigate the linear-quadratic setting, and we obtain generalized Riccati equations and an affine state-feedback PBNE policy. Moreover, we show that the PBNE policy is unique if it exists and is strongly time consistent. Finally, we numerically illustrate the proposed framework with a case study.

[1]  Roger B. Myerson,et al.  Optimal Auction Design , 1981, Math. Oper. Res..

[2]  Michael N. Katehakis,et al.  The Multi-Armed Bandit Problem: Decomposition and Computation , 1987, Math. Oper. Res..

[3]  Quanyan Zhu,et al.  A Game-theoretic Taxonomy and Survey of Defensive Deception for Cybersecurity and Privacy , 2017, ACM Comput. Surv..

[4]  Rabih Salhab,et al.  A Dynamic Game Model of Collective Choice in Multiagent Systems , 2015, IEEE Transactions on Automatic Control.

[5]  Quanyan Zhu,et al.  Game theory meets network security and privacy , 2013, CSUR.

[6]  John C. Harsanyi,et al.  Games with Incomplete Information Played by "Bayesian" Players, I-III: Part I. The Basic Model& , 2004, Manag. Sci..

[7]  T. Başar,et al.  Stochastic Teams with Nonclassical Information Revisited: When is an Affine Law Optimal? , 1986, 1986 American Control Conference.

[8]  Tamer Basar,et al.  Variations on the theme of the Witsenhausen counterexample , 2008, 2008 47th IEEE Conference on Decision and Control.

[9]  Georges Zaccour,et al.  Dynamic Games in the Economics and Management of Pollution , 2009 .

[10]  Quanyan Zhu,et al.  A Large-Scale Markov Game Approach to Dynamic Protection of Interdependent Infrastructure Networks , 2017, GameSec.

[11]  H. Chizeck,et al.  Controllability, stabilizability, and continuous-time Markovian jump linear quadratic control , 1990 .

[12]  Quanyan Zhu,et al.  Analysis and Computation of Adaptive Defense Strategies Against Advanced Persistent Threats for Cyber-Physical Systems , 2018, GameSec.