论文信息 - Dinkelbach-Type Algorithm for Computing Quantal Stackelberg Equilibrium

Dinkelbach-Type Algorithm for Computing Quantal Stackelberg Equilibrium

Stackelberg security games (SSGs) have been deployed in many real-world situations to optimally allocate scarce resource to protect targets against attackers. However, actual human attackers are not perfectly rational and there are several behavior models that attempt to predict subrational behavior. Quantal response is among the most commonly used such models and Quantal Stackelberg Equilibrium (QSE) describes the optimal strategy to commit to when facing a subrational opponent. Nonconcavity makes computing QSE computationally challenging and while there exist algorithms for computing QSE for SSGs, they cannot be directly used for solving an arbitrary game in the normal form. We (1) present a transformation of the primal problem for computing QSE using a Dinkelbach’s method for any general-sum normal-form game, (2) provide a gradient-based and a MILPbased algorithm, give the convergence criteria, and bound their error, and finally (3) we experimentally demonstrate that using our novel transformation, a QSE can be closely approximated several orders of magnitude faster.

[1] Sarit Kraus,et al. Deployed ARMOR protection: the application of a game theoretic model for security at the Los Angeles International Airport , 2008, AAMAS.

[2] G. Winskel. What Is Discrete Mathematics , 2007 .

[3] Yoav Shoham,et al. Run the GAMUT: a comprehensive approach to evaluating game-theoretic algorithms , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[4] Garth P. McCormick,et al. Computability of global solutions to factorable nonconvex programs: Part I — Convex underestimating problems , 1976, Math. Program..

[5] Rong Yang,et al. Computing optimal strategy against quantal response in security games , 2012, AAMAS.

[6] Colin Camerer. Behavioral Game Theory: Experiments in Strategic Interaction , 2003 .

[7] Kevin Waugh,et al. DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker , 2017, ArXiv.

[8] William J. E. Potts,et al. Generalized additive neural networks , 1999, KDD '99.

[9] George L. Nemhauser,et al. Mixed-Integer Models for Nonseparable Piecewise-Linear Optimization: Unifying Framework and Extensions , 2010, Oper. Res..

[10] R. McKelvey,et al. Quantal Response Equilibria for Normal Form Games , 1995 .

[11] M. D. Wilkinson,et al. Management science , 1989, British Dental Journal.

[12] R. Tibshirani,et al. Generalized Additive Models , 1986 .

[13] A. Tversky,et al. Prospect theory: an analysis of decision under risk — Source link , 2007 .

[14] Milind Tambe,et al. CAPTURE: A New Predictive Anti-Poaching Tool for Wildlife Protection , 2016, AAMAS.

[15] E. Yechiam,et al. Losses as modulators of attention: review and analysis of the unique effects of losses over gains. , 2013, Psychological bulletin.