论文信息 - Using counterfactual regret minimization to create competitive multiplayer poker agents

Using counterfactual regret minimization to create competitive multiplayer poker agents

Games are used to evaluate and advance Multiagent and Artificial Intelligence techniques. Most of these games are deterministic with perfect information (e.g. Chess and Checkers). A deterministic game has no chance element and in a perfect information game, all information is visible to all players. However, many real-world scenarios with competing agents are stochastic (non-deterministic) with imperfect information. For two-player zero-sum perfect recall games, a recent technique called Counterfactual Regret Minimization (CFR) computes strategies that are provably convergent to an e-Nash equilibrium. A Nash equilibrium strategy is useful in two-player games since it maximizes its utility against a worst-case opponent. However, for multiplayer (three or more player) games, we lose all theoretical guarantees for CFR. However, we believe that CFR-generated agents may perform well in multiplayer games. To test this hypothesis, we used this technique to create several 3-player limit Texas Hold'em poker agents and two of them placed first and second in the 3-player event of the 2009 AAAI/IJCAI Computer Poker Competition. We also demonstrate that good strategies can be obtained by grafting sets of two-player subgame strategies to a 3-player base strategy after one of the players is eliminated.

Duane Szafron | Nick Abou Risk

[1] Ariel Rubinstein,et al. A Course in Game Theory , 1995 .

[2] Feng-Hsiung Hsu,et al. Behind Deep Blue: Building the Computer that Defeated the World Chess Champion , 2002 .

[3] Brian Sheppard,et al. World-championship-caliber Scrabble , 2002, Artif. Intell..

[4] Darse Billings. Algorithms and assessment in computer poker , 2006 .

[5] Darse Billings,et al. A Tool for the Direct Assessment of Poker Decisions , 2006, J. Int. Comput. Games Assoc..

[6] Javier Peña,et al. Gradient-Based Algorithms for Finding Nash Equilibria in Extensive Form Games , 2007, WINE.

[7] Andrew Hodges,et al. Alan Turing: The Enigma , 1983 .

[8] Jonathan Schaeffer,et al. Checkers Is Solved , 2007, Science.

[9] Michael H. Bowling,et al. Bayes' Bluff: Opponent Modelling in Poker , 2005, UAI 2005.

[10] Jonathan Schaeffer,et al. The challenge of poker , 2002, Artif. Intell..

[11] Jonathan Schaeffer,et al. CHINOOK: The World Man-Machine Checkers Champion , 1996, AI Mag..