MASAGE: Model-Agnostic Sequential and Adaptive Game Estimation

Zero-sum games have been used to model cybersecurity scenarios between an attacker and a defender. However, unknown and uncertain environments have made it difficult to rely on a prescribed zero-sum game to capture the interactions between the players. In this work, we aim to estimate and recover an unknown matrix game that encodes the uncertainties of nature and opponent based on the knowledge of historical games and the current observations of game outcomes. The proposed approach effectively transfers the past experiences that are encoded as expert games to estimate and inform future game plays. We formulate the game knowledge transfer and estimation problem as a sequential least-square problem. We characterize the structural properties of the problem and show that the non-convex problem has well-behaved gradient and Hessian under mild assumptions. We propose gradient-based methods to enable dynamic and adaptive estimation of the unknown game. A case study is used to corroborate the results and illustrate the behavior of the proposed algorithm.

[1]  Quanyan Zhu,et al.  Dynamic policy-based IDS configuration , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[2]  H. Kuk On equilibrium points in bimatrix games , 1996 .

[3]  V. J. Hotz,et al.  Conditional Choice Probabilities and the Estimation of Dynamic Models , 1993 .

[4]  Boyan Jovanovic,et al.  Observable Implications of Models with Multiple Equilibria , 1989 .

[5]  Han Hong,et al.  Identification and Estimation of a Discrete Game of Complete Information , 2010 .

[6]  John C. Harsanyi,et al.  Games with Incomplete Information Played by "Bayesian" Players, I-III: Part I. The Basic Model& , 2004, Manag. Sci..

[7]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[8]  Joel E. Cohen,et al.  Perturbation Theory of Completely Mixed Matrix Games , 1986 .

[9]  Changbao Wu,et al.  Asymptotic Theory of Nonlinear Least Squares Estimation , 1981 .

[10]  Martin Pesendorfer,et al.  Identification and Estimation of Dynamic Games , 2003 .

[11]  Stefan Rass,et al.  On Game-Theoretic Network Security Provisioning , 2012, Journal of Network and Systems Management.

[12]  E. Malinvaud The Consistency of Nonlinear Regressions , 1970 .

[13]  S. Zamir,et al.  Formulation of Bayesian analysis for games with incomplete information , 1985 .

[14]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[15]  H. F. Bohnenblust,et al.  Solutions of Discrete, Two-Person Games , 1949 .

[16]  R. Jennrich Asymptotic Properties of Non-Linear Least Squares Estimators , 1969 .

[17]  Martin Pesendorfer,et al.  Asymptotic Least Squares Estimators for Dynamic Games , 2008 .

[18]  Dimitri P. Bertsekas,et al.  Incremental Least Squares Methods and the Extended Kalman Filter , 1996, SIAM J. Optim..

[19]  Quanyan Zhu,et al.  A Stochastic Game Model for Jamming in Multi-Channel Cognitive Radio Systems , 2010, 2010 IEEE International Conference on Communications.

[20]  Ruli Xiao Identification and Estimation of Incomplete Information Games with Multiple Equilibria , 2015 .

[21]  Masao Fukushima,et al.  The Incremental Gauss-Newton Algorithm with Adaptive Stepsize Rule , 2003, Comput. Optim. Appl..

[22]  Perturbation Theory of Completely Mixed Bimatrix Games , 2001 .