论文信息 - MASAGE: Model-Agnostic Sequential and Adaptive Game Estimation

MASAGE: Model-Agnostic Sequential and Adaptive Game Estimation

Zero-sum games have been used to model cybersecurity scenarios between an attacker and a defender. However, unknown and uncertain environments have made it difficult to rely on a prescribed zero-sum game to capture the interactions between the players. In this work, we aim to estimate and recover an unknown matrix game that encodes the uncertainties of nature and opponent based on the knowledge of historical games and the current observations of game outcomes. The proposed approach effectively transfers the past experiences that are encoded as expert games to estimate and inform future game plays. We formulate the game knowledge transfer and estimation problem as a sequential least-square problem. We characterize the structural properties of the problem and show that the non-convex problem has well-behaved gradient and Hessian under mild assumptions. We propose gradient-based methods to enable dynamic and adaptive estimation of the unknown game. A case study is used to corroborate the results and illustrate the behavior of the proposed algorithm.

Quanyan Zhu | Juntao Chen | Guanze Peng | Yunian Pan

[1] Quanyan Zhu,et al. Dynamic policy-based IDS configuration , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[2] H. Kuk. On equilibrium points in bimatrix games , 1996 .

[3] V. J. Hotz,et al. Conditional Choice Probabilities and the Estimation of Dynamic Models , 1993 .

[4] Boyan Jovanovic,et al. Observable Implications of Models with Multiple Equilibria , 1989 .

[5] Han Hong,et al. Identification and Estimation of a Discrete Game of Complete Information , 2010 .

[6] John C. Harsanyi,et al. Games with Incomplete Information Played by "Bayesian" Players, I-III: Part I. The Basic Model& , 2004, Manag. Sci..

[7] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[8] Joel E. Cohen,et al. Perturbation Theory of Completely Mixed Matrix Games , 1986 .

[9] Changbao Wu,et al. Asymptotic Theory of Nonlinear Least Squares Estimation , 1981 .

[10] Martin Pesendorfer,et al. Identification and Estimation of Dynamic Games , 2003 .

[11] Stefan Rass,et al. On Game-Theoretic Network Security Provisioning , 2012, Journal of Network and Systems Management.

[12] E. Malinvaud. The Consistency of Nonlinear Regressions , 1970 .

[13] S. Zamir,et al. Formulation of Bayesian analysis for games with incomplete information , 1985 .