Computing sequential equilibria for two-player games

Koller, Megiddo and von Stengel showed how to efficiently compute minimax strategies for two-player extensive-form zero-sum games with imperfect information but perfect recall using linear programming and avoiding conversion to normal form. Koller and Pfeffer pointed out that the strategies obtained by the algorithm are not necessarily sequentially rational and that this deficiency is often problematic for the practical applications. We show how to remove this deficiency by modifying the linear programs constructed by Koller, Megiddo and von Stengel so that pairs of strategies forming a sequential equilibrium are computed. In particular, we show that a sequential equilibrium for a two-player zero-sum game with imperfect information but perfect recall can be found in polynomial time. In addition, the equilibrium we find is normal-form perfect. Our technique generalizes to general-sum games, yielding an algorithm for such games which is likely to be prove practical, even though it is not polynomial-time.

[1]  Tuomas Sandholm,et al.  Finding equilibria in large sequential games of imperfect information , 2006, EC '06.

[2]  Jean-François Mertens Two examples of strategic equilibrium , 1995 .

[3]  E. Vandamme Stability and perfection of nash equilibria , 1987 .

[4]  J. Neumann,et al.  Theory of games and economic behavior , 1945, 100 Years of Math Milestones.

[5]  B. Stengel,et al.  Efficient Computation of Behavior Strategies , 1996 .

[6]  B. Eaves The Linear Complementarity Problem , 1971 .

[7]  L. Khachiyan Polynomial algorithms in linear programming , 1980 .

[8]  John C. Harsanyi,et al.  Общая теория выбора равновесия в играх / A General Theory of Equilibrium Selection in Games , 1989 .

[9]  L. G. H. Cijan A polynomial algorithm in linear programming , 1979 .

[10]  Robert Wilson Computing Simply Stable Equilibria , 1992 .

[11]  J. Nash Equilibrium Points in N-Person Games. , 1950, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Bernhard von Stengel,et al.  Exponentially many steps for finding a Nash equilibrium in a bimatrix game , 2004, 45th Annual IEEE Symposium on Foundations of Computer Science.

[13]  Christos H. Papadimitriou,et al.  On graph-theoretic lemmata and complexity classes , 1990, Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.

[14]  Narendra Karmarkar,et al.  A new polynomial-time algorithm for linear programming , 1984, STOC '84.

[15]  R. Selten Reexamination of the perfectness concept for equilibrium points in extensive games , 1975, Classics in Game Theory.

[16]  Andrew McLennan,et al.  Gambit: Software Tools for Game Theory , 2006 .

[17]  A. Charnes Optimality and Degeneracy in Linear Programming , 1952 .

[18]  E. Damme A relation between perfect equilibria in extensive form games and proper equilibria in normal form games , 1984 .

[19]  A. U.S.,et al.  Computation of Equilibria in Noncooperative Games , 2005 .

[20]  R. McKelvey,et al.  Computation of equilibria in finite games , 1996 .

[21]  R. Tourky,et al.  From Imitation Games to Kakutani , 2005 .

[22]  D. Koller,et al.  Efficient Computation of Equilibria for Extensive Two-Person Games , 1996 .

[23]  R. McKelvey,et al.  Quantal Response Equilibria for Extensive Form Games , 1998 .

[24]  D. Koller,et al.  The complexity of two-person zero-sum games in extensive form , 1992 .

[25]  Peter Bro Miltersen,et al.  Computing Proper Equilibria of Zero-Sum Games , 2006, Computers and Games.

[26]  J. Neumann,et al.  Theory of Games and Economic Behavior. , 1945 .

[27]  E. Damme Refinements of the Nash Equilibrium Concept , 1983 .

[28]  John H. Reif,et al.  Computation of equilibriain noncooperative games , 2005 .

[29]  H. Kuhn 9. A SIMPLIFIED TWO-PERSON POKER , 1951 .

[30]  Mihalis Yannakakis,et al.  How easy is local search? , 1985, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[31]  Yuval Rabani,et al.  Linear Programming , 2007, Handbook of Approximation Algorithms and Metaheuristics.

[32]  Avi Pfeffer,et al.  Representations and Solutions for Game-Theoretic Problems , 1997, Artif. Intell..

[33]  Vincent Conitzer,et al.  Complexity Results about Nash Equilibria , 2002, IJCAI.

[34]  Bernhard von Stengel,et al.  Computing Normal Form Perfect Equilibria for Extensive Two-Person Games , 2002 .

[35]  E. Damme Stability and perfection of Nash equilibria , 1987 .

[36]  Philip J. Reny,et al.  Independence on Relative Probability Spaces and Consistent Assessments in Game Trees , 1997 .

[37]  B. Stengel,et al.  COMPUTING EQUILIBRIA FOR TWO-PERSON GAMES , 1996 .

[38]  Jonathan Schaeffer,et al.  Approximating Game-Theoretic Optimal Strategies for Full-scale Poker , 2003, IJCAI.

[39]  Bernhard von Stengel,et al.  Fast algorithms for finding randomized strategies in game trees , 1994, STOC '94.