A New Algorithm for Generating Equilibria in Massive Zero-Sum Games

In normal scenarios, computer scientists often consider the number of states in a game to capture the difficulty of learning an equilibrium. However, players do not see games in the same light: most consider Go or Chess to be more complex than Monopoly. In this paper, we discuss a new measure of game complexity that links existing state-of-the-art algorithms for computing approximate equilibria to a more human measure. In particular, we consider the range of skill in a game, i.e. how many different skill levels exist. We then modify existing techniques to design a new algorithm to compute approximate equilibria whose performance can be captured by this new measure. We use it to develop the first near Nash equilibrium for a four round abstraction of poker, and show that it would have been able to win handily the bankroll competition from last year's AAAI poker competition.

[1]  Arthur L. Samuel,et al.  Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[2]  George B. Dantzig,et al.  Decomposition Principle for Linear Programs , 1960 .

[3]  R. Gomory,et al.  A Linear Programming Approach to the Cutting-Stock Problem , 1961 .

[4]  J. G. Pierce,et al.  Geometric Algorithms and Combinatorial Optimization , 2016 .

[5]  L. Méro,et al.  Ways of Thinking: The Limits of Rational Thought and Artificial Intelligence , 1990 .

[6]  D. Koller,et al.  The complexity of two-person zero-sum games in extensive form , 1992 .

[7]  Bernhard von Stengel,et al.  Fast algorithms for finding randomized strategies in game trees , 1994, STOC '94.

[8]  H. Kuk On equilibrium points in bimatrix games , 1996 .

[9]  Feng-Hsiung Hsu,et al.  Behind Deep Blue: Building the Computer that Defeated the World Chess Champion , 2002 .

[10]  Jonathan Schaeffer,et al.  Approximating Game-Theoretic Optimal Strategies for Full-scale Poker , 2003, IJCAI.

[11]  Avrim Blum,et al.  Planning in the Presence of Cost Functions Controlled by an Adversary , 2003, ICML.

[12]  Jonathan Schaeffer,et al.  Building the Checkers 10-piece Endgame Databases , 2003, ACG.

[13]  Jonathan Schaeffer,et al.  Game-Tree Search with Adaptation in Stochastic Imperfect-Information Games , 2004, Computers and Games.

[14]  Tuomas Sandholm,et al.  A Competitive Texas Hold'em Poker Player via Automated Abstraction and Real-Time Equilibrium Computation , 2006, AAAI.

[15]  Tuomas Sandholm,et al.  A Texas Hold'em poker player based on automated abstraction and real-time equilibrium computation , 2006, AAMAS '06.

[16]  Michael L. Littman,et al.  The 2006 AAAI Computer Poker Competition , 2006 .