论文信息 - Fast algorithms for finding randomized strategies in game trees - 字舞流文

Fast algorithms for finding randomized strategies in game trees

Interactions among agents can be conveniently described by game trees. In order to analyze a game, it is important to derive optimal (or equilibrium) strategies for the different players. The standard approach to finding such strategies in games with imperfect information is, in general, computationally intractable. The approach is to generate the normal form of the game (the matrix containing the payoff for each strategy combination), and then solve a linear program (LP) or a linear complementarity problem (LCP). The size of the normal form, however, is typically exponential in the size of the game tree, thus making this method impractical in all but the simplest cases. This paper describes a new representation of strategies which results in a practical linear formulation of the problem of two-player games with perfect recall (i.e., games where players never forget anything, which is a standard assumption). Standard LP or LCP solvers can then be applied to find optimal randomized strategies. The resulting algorithms are, in general, exponentially better than the standard ones, both in terms of time and in terms of space. ∗Computer Science Division, University of California, Berkeley, CA 94720; and IBM Almaden Research Center, 650 Harry Road, San Jose, CA 95120 †IBM Almaden Research Center, 650 Harry Road, San Jose, CA 95120; and School of Mathematical Sciences, Tel Aviv University, Tel Aviv, Israel. ‡Informatik 5, University of the Federal Armed Forces at Munich, 85577 Neubiberg, Germany. Research supported in part by ONR Contract N00014-91-C-0026, by the Air Force Office of Scientific Research (AFSC) under Contract F49620-91-C-0080, and by the Volkswagen Foundation. Some of the work was performed while the first author was at Stanford University. The United States Government is authorized to reproduce and distribute reprints for governmental purposes. In: Proceedings of the 26th ACM Symposium on the Theory of Computing, 1994, 750–759

Bernhard von Stengel | Nimrod Megiddo | Daphne Koller | D. Koller | N. Megiddo | B. Stengel | B. V. Stengel

[1] J. Neumann,et al. Theory of Games and Economic Behavior. , 1945 .

[2] E. Rowland. Theory of Games and Economic Behavior , 1946, Nature.

[3] N. Dalkey. EQUIVALENCE OF INFORMATION PATTERNS AND ESSENTIALLY DETERMINATE GAMES , 1952 .

[4] H. W. Kuhn,et al. 11. Extensive Games and the Problem of Information , 1953 .

[5] N. Vorob’ev. Equilibrium Points in Bimatrix Games , 1958 .

[6] C. E. Lemke,et al. Equilibrium Points of Bimatrix Games , 1964 .

[7] George B. Dantzig,et al. Linear programming and extensions , 1965 .

[8] C. E. Lemke,et al. Bimatrix Equilibrium Points and Mathematical Programming , 1965 .

[9] Robert Wilson. Computing Equilibria of Two-Person Games from the Extensive Form , 1972 .

[10] William F. Lucas,et al. An Overview of the Mathematical Theory of Games , 1972 .

[11] L. Shapley. A note on the Lemke-Howson algorithm , 1974 .

[12] Robert E. Tarjan,et al. A Combinatorial Problem Which Is Complete in Polynomial Space , 1976, JACM.

[13] A. K. Chandra,et al. Alternation , 1976, 17th Annual Symposium on Foundations of Computer Science (sfcs 1976).

[14] Andrew Chi-Chih Yao,et al. Probabilistic computations: Toward a unified measure of complexity , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[15] Thomas J. Schaefer,et al. On the Complexity of Some Two-Person Perfect-Information Games , 1978, J. Comput. Syst. Sci..

[16] David Lichtenstein,et al. GO Is Polynomial-Space Hard , 1980, JACM.

[17] Aviezri S. Fraenkel,et al. Computing a Perfect Strategy for n*n Chess Requires Time Exponential in N , 1981, ICALP.

[18] Aviezri S. Fraenkel,et al. Computing a Perfect Strategy for n x n Chess Requires Time Exponential in n , 1981, J. Comb. Theory, Ser. A.

[19] L. Lovász,et al. Geometric Algorithms and Combinatorial Optimization , 1981 .

[20] John H. Reif,et al. The Complexity of Two-Player Games of Incomplete Information , 1984, J. Comput. Syst. Sci..

[21] John Michael Robson,et al. N by N Checkers is Exptime Complete , 1984, SIAM J. Comput..

[22] Nathan Linial,et al. Collective coin flipping, robust voting schemes and minima of Banzhaf values , 1985, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[23] Editors , 1986, Brain Research Bulletin.

[24] Rudolf Avenhaus. Safeguards systems analysis , 1986 .

[25] Josef Hofbauer,et al. The theory of evolution and dynamical systems , 1988 .

[26] Katta G. Murty,et al. Linear complementarity, linear and nonlinear programming , 1988 .

[27] D. Knuth,et al. A note on strategy elimination in bimatrix games , 1988 .

[28] J. G. Pierce,et al. Geometric Algorithms and Combinatorial Optimization , 2016 .

[29] Charles Blair. Linear Complementarily, Linear and Nonlinear Programming (K. G. Murty) , 1989, SIAM Rev..

[30] Eitan Zemel,et al. Nash and correlated equilibria: Some complexity considerations , 1989 .

[31] S. J. Chung. NP-Completeness of the linear complementarity problem , 1989 .

[32] Nathan Linial,et al. Collective Coin Flipping , 1989, Advances in Computational Research.

[33] Allan Borodin,et al. On the power of randomization in online algorithms , 1990, STOC '90.

[34] Shirley Dex,et al. JR 旅客販売総合システム（マルス）における運用及び管理について , 1991 .

[35] Richard W. Cottle,et al. Linear Complementarity Problem. , 1992 .

[36] Adi Shamir,et al. IP = PSPACE , 1992, JACM.

[37] D. Koller,et al. The complexity of two-person zero-sum games in extensive form , 1992 .

[38] Jeffrey S. Rosenschein,et al. Consenting Agents: Negotiation Mechanisms for Multi-Agent Systems , 1993, IJCAI.

[39] Matthew K. Franklin,et al. Eavesdropping games: a graph-theoretic approach to privacy in distributed systems , 1993, Proceedings of 1993 IEEE 34th Annual Foundations of Computer Science.

[40] Joan Feigenbaum,et al. Probabilistically checkable debate systems and approximation algorithms for PSPACE-hard functions , 1993, STOC.

[41] Eitan Zemel,et al. The Complexity of Eliminating Dominated Strategies , 1993, Math. Oper. Res..

[42] Nimrod Megiddo,et al. Constructing Small Sample Spaces Satisfying Given Constraints , 1994, SIAM J. Discret. Math..

[43] H. Kuk. On equilibrium points in bimatrix games , 1996 .