Dinkelbach-Type Algorithm for Computing Quantal Stackelberg Equilibrium

Stackelberg security games (SSGs) have been deployed in many real-world situations to optimally allocate scarce resource to protect targets against attackers. However, actual human attackers are not perfectly rational and there are several behavior models that attempt to predict subrational behavior. Quantal response is among the most commonly used such models and Quantal Stackelberg Equilibrium (QSE) describes the optimal strategy to commit to when facing a subrational opponent. Nonconcavity makes computing QSE computationally challenging and while there exist algorithms for computing QSE for SSGs, they cannot be directly used for solving an arbitrary game in the normal form. We (1) present a transformation of the primal problem for computing QSE using a Dinkelbach’s method for any general-sum normal-form game, (2) provide a gradient-based and a MILPbased algorithm, give the convergence criteria, and bound their error, and finally (3) we experimentally demonstrate that using our novel transformation, a QSE can be closely approximated several orders of magnitude faster.

[1]  Sarit Kraus,et al.  Deployed ARMOR protection: the application of a game theoretic model for security at the Los Angeles International Airport , 2008, AAMAS.

[2]  G. Winskel What Is Discrete Mathematics , 2007 .

[3]  Yoav Shoham,et al.  Run the GAMUT: a comprehensive approach to evaluating game-theoretic algorithms , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[4]  Garth P. McCormick,et al.  Computability of global solutions to factorable nonconvex programs: Part I — Convex underestimating problems , 1976, Math. Program..

[5]  Rong Yang,et al.  Computing optimal strategy against quantal response in security games , 2012, AAMAS.

[6]  Colin Camerer Behavioral Game Theory: Experiments in Strategic Interaction , 2003 .

[7]  Kevin Waugh,et al.  DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker , 2017, ArXiv.

[8]  William J. E. Potts,et al.  Generalized additive neural networks , 1999, KDD '99.

[9]  George L. Nemhauser,et al.  Mixed-Integer Models for Nonseparable Piecewise-Linear Optimization: Unifying Framework and Extensions , 2010, Oper. Res..

[10]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .

[11]  M. D. Wilkinson,et al.  Management science , 1989, British Dental Journal.

[12]  R. Tibshirani,et al.  Generalized Additive Models , 1986 .

[13]  A. Tversky,et al.  Prospect theory: an analysis of decision under risk — Source link , 2007 .

[14]  Milind Tambe,et al.  CAPTURE: A New Predictive Anti-Poaching Tool for Wildlife Protection , 2016, AAMAS.

[15]  E. Yechiam,et al.  Losses as modulators of attention: review and analysis of the unique effects of losses over gains. , 2013, Psychological bulletin.

[16]  Pedro M. Castro,et al.  Global optimization of bilinear programs with a multiparametric disaggregation technique , 2013, J. Glob. Optim..

[17]  Toshimde Ibaraki Integer programming formulation of combinatorial optimization problems , 1976, Discret. Math..

[18]  Jaime Simão Sichman,et al.  Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2 , 2009, AAMAS 2009.

[19]  Sarit Kraus,et al.  Game-Theoretic Patrolling with Dynamic Execution Uncertainty and a Case Study on a Real Transit System , 2014, J. Artif. Intell. Res..

[20]  Noam Brown,et al.  Superhuman AI for heads-up no-limit poker: Libratus beats top professionals , 2018, Science.

[21]  I. Stancu-Minasian Nonlinear Fractional Programming , 1997 .

[22]  Oriol Carbonell-Nicolau Games and Economic Behavior , 2011 .

[23]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[24]  Bo An,et al.  PAWS - A Deployed Game-Theoretic Application to Combat Poaching , 2017, AI Mag..