论文信息 - Regret Minimization in Games with Incomplete Information

Regret Minimization in Games with Incomplete Information

Extensive games are a powerful model of multiagent decision-making scenarios with incomplete information. Finding a Nash equilibrium for very large instances of these games has received a great deal of recent attention. In this paper, we describe a new technique for solving large games based on regret minimization. In particular, we introduce the notion of counterfactual regret, which exploits the degree of incomplete information in an extensive game. We show how minimizing counterfactual regret minimizes overall regret, and therefore in self-play can be used to compute a Nash equilibrium. We demonstrate this technique in the domain of poker, showing we can solve abstractions of limit Texas Hold'em with as many as 1012 states, two orders of magnitude larger than previous methods.

[1] D. Blackwell. An analog of the minimax theorem for vector payoffs. , 1956 .

[2] D. Koller,et al. The complexity of two-person zero-sum games in extensive form , 1992 .

[3] Ariel Rubinstein,et al. A Course in Game Theory , 1995 .

[4] Jonathan Schaeffer,et al. Approximating Game-Theoretic Optimal Strategies for Full-scale Poker , 2003, IJCAI.

[5] Tuomas Sandholm,et al. A Competitive Texas Hold'em Poker Player via Automated Abstraction and Real-Time Equilibrium Computation , 2006, AAAI.

[6] Michael L. Littman,et al. The 2006 AAAI Computer Poker Competition , 2006 .

[7] Geoffrey J. Gordon. No-regret Algorithms for Online Convex Programs , 2006, NIPS.

[8] Tuomas Sandholm,et al. Finding equilibria in large sequential games of imperfect information , 2006, EC '06.

[9] Javier Peña,et al. Gradient-Based Algorithms for Finding Nash Equilibria in Extensive Form Games , 2007, WINE.

[10] Michael H. Bowling,et al. A New Algorithm for Generating Equilibria in Massive Zero-Sum Games , 2007, AAAI.