论文信息 - A New Algorithm for Generating Equilibria in Massive Zero-Sum Games

A New Algorithm for Generating Equilibria in Massive Zero-Sum Games

In normal scenarios, computer scientists often consider the number of states in a game to capture the difficulty of learning an equilibrium. However, players do not see games in the same light: most consider Go or Chess to be more complex than Monopoly. In this paper, we discuss a new measure of game complexity that links existing state-of-the-art algorithms for computing approximate equilibria to a more human measure. In particular, we consider the range of skill in a game, i.e. how many different skill levels exist. We then modify existing techniques to design a new algorithm to compute approximate equilibria whose performance can be captured by this new measure. We use it to develop the first near Nash equilibrium for a four round abstraction of poker, and show that it would have been able to win handily the bankroll competition from last year's AAAI poker competition.

[1] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..

[2] George B. Dantzig,et al. Decomposition Principle for Linear Programs , 1960 .

[3] R. Gomory,et al. A Linear Programming Approach to the Cutting-Stock Problem , 1961 .

[4] J. G. Pierce,et al. Geometric Algorithms and Combinatorial Optimization , 2016 .

[5] L. Méro,et al. Ways of Thinking: The Limits of Rational Thought and Artificial Intelligence , 1990 .

[6] D. Koller,et al. The complexity of two-person zero-sum games in extensive form , 1992 .

[7] Bernhard von Stengel,et al. Fast algorithms for finding randomized strategies in game trees , 1994, STOC '94.

[8] H. Kuk. On equilibrium points in bimatrix games , 1996 .

[9] Feng-Hsiung Hsu,et al. Behind Deep Blue: Building the Computer that Defeated the World Chess Champion , 2002 .

[10] Jonathan Schaeffer,et al. Approximating Game-Theoretic Optimal Strategies for Full-scale Poker , 2003, IJCAI.

[11] Avrim Blum,et al. Planning in the Presence of Cost Functions Controlled by an Adversary , 2003, ICML.

[12] Jonathan Schaeffer,et al. Building the Checkers 10-piece Endgame Databases , 2003, ACG.

[13] Jonathan Schaeffer,et al. Game-Tree Search with Adaptation in Stochastic Imperfect-Information Games , 2004, Computers and Games.

[14] Tuomas Sandholm,et al. A Competitive Texas Hold'em Poker Player via Automated Abstraction and Real-Time Equilibrium Computation , 2006, AAAI.

[15] Tuomas Sandholm,et al. A Texas Hold'em poker player based on automated abstraction and real-time equilibrium computation , 2006, AAMAS '06.

[16] Michael L. Littman,et al. The 2006 AAAI Computer Poker Competition , 2006 .