论文信息 - Lossless abstraction of imperfect information games

Lossless abstraction of imperfect information games

Finding an equilibrium of an extensive form game of imperfect information is a fundamental problem in computational game theory, but current techniques do not scale to large games. To address this, we introduce the ordered game isomorphism and the related ordered game isomorphic abstraction transformation. For a multi-player sequential game of imperfect information with observable actions and an ordered signal space, we prove that any Nash equilibrium in an abstracted smaller game, obtained by one or more applications of the transformation, can be easily converted into a Nash equilibrium in the original game. We present an algorithm, GameShrink, for abstracting the game using our isomorphism exhaustively. Its complexity is õ(n2), where n is the number of nodes in a structure we call the signal tree. It is no larger than the game tree, and on nontrivial games it is drastically smaller, so GameShrink has time and space complexity sublinear in the size of the game tree. Using GameShrink, we find an equilibrium to a poker game with 3.1 billion nodes—over four orders of magnitude more than in the largest poker game solved previously. To address even larger games, we introduce approximation methods that do not preserve equilibrium, but nevertheless yield (ex post) provably close-to-optimal strategies.

Tuomas Sandholm | Andrew Gilpin | T. Sandholm | Andrew Gilpin

[1] Daniel Dominic Sleator,et al. Computer analysis of Sprouts , 1999 .

[2] Moshe Tennenholtz,et al. Local-Effect Games , 2003, IJCAI.

[3] J. Mertens,et al. ON THE STRATEGIC STABILITY OF EQUILIBRIA , 1986 .

[4] Tuomas Sandholm,et al. Sequences of take-it-or-leave-it offers: near-optimal auctions without full valuation revelation , 2003, AAMAS '06.

[5] Yuval Rabani,et al. Linear Programming , 2007, Handbook of Approximation Algorithms and Metaheuristics.

[6] Vincent Conitzer,et al. Mixed-Integer Programming Methods for Finding Nash Equilibria , 2005, AAAI.

[7] Jonathan Schaeffer,et al. Approximating Game-Theoretic Optimal Strategies for Full-scale Poker , 2003, IJCAI.

[8] Kevin Leyton-Brown,et al. Computing Nash Equilibria of Action-Graph Games , 2004, UAI.

[9] Michael L. Littman,et al. Abstraction Methods for Game Theoretic Poker , 2000, Computers and Games.

[10] Peter Bro Miltersen,et al. Computing sequential equilibria for two-player games , 2006, SODA '06.

[11] Andrés Perea. Rationality in extensive form games , 2001 .

[12] Daphne Koller,et al. A Continuation Method for Nash Equilibria in Structured Games , 2003, IJCAI.

[13] Stephen J. Wright. Primal-Dual Interior-Point Methods , 1997, Other Titles in Applied Mathematics.

[14] Dana S. Nau,et al. Computer Bridge - A Big Win for AI Planning , 1998, AI Mag..

[15] Yoav Shoham,et al. Simple search methods for finding a Nash equilibrium , 2004, Games Econ. Behav..

[16] D. Koller,et al. The complexity of two-person zero-sum games in extensive form , 1992 .

[17] D. Koller,et al. Finding mixed strategies with small supports in extensive form games , 1996 .

[18] Reinhard Selten,et al. Evolutionary stability in extensive two-person games - correction and further development , 1988 .

[19] Javier Peña,et al. Gradient-Based Algorithms for Finding Nash Equilibria in Extensive Form Games , 2007, WINE.

[20] Jonathan Schaeffer,et al. The challenge of poker , 2002, Artif. Intell..

[21] Robert E. Tarjan,et al. Efficiency of a Good But Not Linear Set Union Algorithm , 1972, JACM.

[22] B. Stengel,et al. Efficient Computation of Behavior Strategies , 1996 .

[23] Tuomas Sandholm,et al. Optimal Rhode Island Hold'em Poker , 2005, AAAI.

[24] B. Stengel,et al. COMPUTING EQUILIBRIA FOR TWO-PERSON GAMES , 1996 .

[25] H. W. Kuhn,et al. 11. Extensive Games and the Problem of Information , 1953 .

[26] W. Ackermann. Zum Hilbertschen Aufbau der reellen Zahlen , 1928 .

[27] Troels Bjerre Lund,et al. Potential-Aware Automated Abstraction of Sequential Games, and Holistic Equilibrium Analysis of Texas Hold'em Poker , 2007, AAAI.

[28] Dan Suciu,et al. Journal of the ACM , 2006 .

[29] Matthew L. Ginsberg,et al. Partition Search , 1996, AAAI/IAAI, Vol. 1.

[30] Bernhard von Stengel,et al. Exponentially many steps for finding a Nash equilibrium in a bimatrix game , 2004, 45th Annual IEEE Symposium on Foundations of Computer Science.

[31] R BELLMAN,et al. Some two person games involving bluffing. , 1949, Proceedings of the National Academy of Sciences of the United States of America.

[32] Matthew L. Ginsberg,et al. GIB: Steps Toward an Expert-Level Bridge-Playing Program , 1999, IJCAI.

[33] Avi Pfeffer,et al. Representations and Solutions for Game-Theoretic Problems , 1997, Artif. Intell..

[34] D. Michie. GAME-PLAYING AND GAME-LEARNING AUTOMATA , 1966 .

[35] Tuomas Sandholm,et al. Better automated abstraction techniques for imperfect information games, with application to Texas Hold'em poker , 2007, AAMAS '07.

[36] J. Nash. Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.

[37] Craig A. Knoblock. Automatically Generating Abstractions for Planning , 1994, Artif. Intell..

[38] R. McKelvey,et al. Computation of equilibria in finite games , 1996 .

[39] E. Rowland. Theory of Games and Economic Behavior , 1946, Nature.

[40] P. Reny,et al. On the Strategic Equivalence of Extensive Form Games , 1994 .

[41] Tim Roughgarden,et al. Computing equilibria in multi-player games , 2005, SODA '05.

[42] Xiaotie Deng,et al. Settling the Complexity of Two-Player Nash Equilibrium , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[43] Michael P. Wellman,et al. On state-space abstraction for anytime evaluation of Bayesian networks , 1996, SGAR.