论文信息 - Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games - 字舞流文

Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games

The study of finding the equilibrium for multiplayer games is challenging. This paper focuses on computing Team-Maxmin Equilibria (TMEs) in zero-sum multiplayer Extensive-Form Games (EFGs), which describes the optimal strategies for a team of players who share the same goal but they take actions independently against an adversary. TMEs can capture many realistic scenarios, including: 1) a team of players play against a target player in poker games; and 2) defense resources schedule and patrol independently in security games. However, the study of efficiently finding TMEs within any given accuracy in EFGs is almost completely unexplored. To fill this gap, we first study the inefficiency caused by computing the equilibrium where team players correlate their strategies and then transforming it into the mixed strategy profile of the team and show that this inefficiency can be arbitrarily large. Second, to efficiently solve the non-convex program for finding TMEs directly, we develop the Associated Recursive Asynchronous Multiparametric Disaggregation Technique (ARAMDT) to approximate multilinear terms in the program with two novel techniques: 1) an asynchronous precision method to reduce the number of constraints and variables for approximation by using different precision levels to approximate these terms; and 2) an associated constraint method to reduce the feasible solution space of the mixed-integer linear program resulting from ARAMDT by exploiting the relation between these terms. Third, we develop a novel iterative algorithm to efficiently compute TMEs within any given accuracy based on ARAMDT. Our algorithm is orders of magnitude faster than baselines in the experimental evaluation.

Bo An | Youzhi Zhang | Bo An | Y. Zhang

[1] Nikolaos V. Sahinidis,et al. A hybrid LP/NLP paradigm for global optimization relaxations , 2018, Mathematical Programming Computation.

[2] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[3] Yoav Shoham,et al. Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[4] Pedro M. Castro,et al. Global optimization of bilinear programs with a multiparametric disaggregation technique , 2013, Journal of Global Optimization.

[5] Nicola Gatti,et al. Computational Results for Extensive-Form Adversarial Team Games , 2017, AAAI.

[6] Vincent Conitzer,et al. Computing the optimal strategy to commit to , 2006, EC '06.

[7] Nikolaos V. Sahinidis,et al. Analysis of Bounds for Multilinear Functions , 2001, J. Glob. Optim..

[8] Bo An,et al. Stop Nuclear Smuggling Through Efficient Container Inspection , 2017, AAMAS.

[9] Bo An,et al. Stackelberg Security Games: Looking Beyond a Decade of Success , 2018, IJCAI.

[10] Nicola Basilico,et al. Team-Maxmin Equilibrium: Efficiency Bounds and Algorithms , 2016, AAAI.

[11] Tiago Andrade,et al. Enhancing the normalized multiparametric disaggregation technique for mixed-integer quadratic programming , 2018, Journal of Global Optimization.

[12] B. Stengel,et al. Team-Maxmin Equilibria☆ , 1997 .

[13] Pedro M. Castro,et al. Normalized multiparametric disaggregation: an efficient relaxation for mixed-integer bilinear problems , 2016, J. Glob. Optim..

[14] Garth P. McCormick,et al. Computability of global solutions to factorable nonconvex programs: Part I — Convex underestimating problems , 1976, Math. Program..

[15] Kevin Waugh,et al. Accelerating Best Response Calculation in Large Extensive Games , 2011, IJCAI.

[16] Branislav Bosanský,et al. Approximating maxmin strategies in imperfect recall games using A-loss recall property , 2018, Int. J. Approx. Reason..

[17] Eric van Damme,et al. Non-Cooperative Games , 2000 .

[18] Tuomas Sandholm,et al. Ex ante coordination and collusion in zero-sum multi-player extensive-form games , 2018, NeurIPS.

[19] Milind Tambe,et al. Defender (Mis)coordination in Security Games , 2013, IJCAI.

[20] Xi Chen,et al. 3-NASH is PPAD-Complete , 2005, Electron. Colloquium Comput. Complex..

[21] Noam Brown,et al. Superhuman AI for multiplayer poker , 2019, Science.

[22] Noam Brown,et al. Superhuman AI for heads-up no-limit poker: Libratus beats top professionals , 2018, Science.