Further developments of extensive-form replicator dynamics using the sequence-form representation

Replicator dynamics model the interactions and change in large populations of agents. Standard replicator dynamics, however, only allow single action interactions between agents. Complex interactions, as modeled by general extensive games, have received comparatively little attention in this setting. Recently, replicator dynamics have been adapted to extensive-form games represented in sequence form, leading to a large reduction in computational resource requirements. In this paper, we first show that sequence-form constraints and realization equivalence to standard replicator dynamics are maintained in general n-player games. We show that sequence-form replicator dynamics can minimize regret, leading to equilibrium convergence guarantees in two-player zero-sum games. We provide the first empirical evaluation of sequence-form replicator dynamics, applied to n-player Kuhn poker with two, three, and four players. Our results show that the average strategies generated by sequence-form replicator dynamics produce approximate equilibrium strategies with increasing accuracy over time.

[1]  Matthias Rauterberg,et al.  State-coupled replicator dynamics , 2009, AAMAS.

[2]  Y. Mansour,et al.  Algorithmic Game Theory: Learning, Regret Minimization, and Equilibria , 2007 .

[3]  Richard Gibson,et al.  Regret Minimization in Non-Zero-Sum Games with Applications to Building Champion Multiplayer Computer Poker Agents , 2013, ArXiv.

[4]  Nathan R. Sturtevant,et al.  A parameterized family of equilibrium profiles for three-player kuhn poker , 2013, AAMAS.

[5]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[6]  H. W. Kuhn,et al.  11. Extensive Games and the Problem of Information , 1953 .

[7]  Ariel Rubinstein,et al.  A Course in Game Theory , 1995 .

[8]  Marcello Restelli,et al.  Efficient Evolutionary Dynamics with Extensive-Form Games , 2013, AAAI.

[9]  Jan Ramon,et al.  An evolutionary game-theoretic analysis of poker strategies , 2009, Entertain. Comput..

[10]  János Flesch,et al.  Evolutionary Stochastic Games , 2013, Dyn. Games Appl..

[11]  Yoav Shoham,et al.  Multiagent Systems - Algorithmic, Game-Theoretic, and Logical Foundations , 2009 .

[12]  J. M. Smith,et al.  The Logic of Animal Conflict , 1973, Nature.

[13]  Philip Wolfe,et al.  Contributions to the theory of games , 1953 .

[14]  P. Taylor,et al.  Evolutionarily Stable Strategies and Game Dynamics , 1978 .

[15]  Herbert Gintis,et al.  Game Theory Evolving: A Problem-Centered Introduction to Modeling Strategic Interaction - Second Edition , 2009 .

[16]  Michael H. Bowling,et al.  Regret Minimization in Games with Incomplete Information , 2007, NIPS.

[17]  Manfred K. Warmuth,et al.  The weighted majority algorithm , 1989, 30th Annual Symposium on Foundations of Computer Science.

[18]  Y. Mansour,et al.  4 Learning , Regret minimization , and Equilibria , 2006 .

[19]  Karl Tuyls,et al.  Evolutionary Dynamics of Regret Minimization , 2010, ECML/PKDD.

[20]  Josef Hofbauer,et al.  Evolutionary Games and Population Dynamics , 1998 .

[21]  Duane Szafron,et al.  Using counterfactual regret minimization to create competitive multiplayer poker agents , 2010, AAMAS 2010.

[22]  D. E. Matthews Evolution and the Theory of Games , 1977 .

[23]  Bernhard von Stengel,et al.  Fast algorithms for finding randomized strategies in game trees , 1994, STOC '94.

[24]  Simon Parsons,et al.  What evolutionary game theory tells us about multiagent learning , 2007, Artif. Intell..

[25]  R. Cressman Evolutionary Dynamics and Extensive Form Games , 2003 .

[26]  Bret Hoehn,et al.  Effective short-term opponent exploitation in simplified poker , 2005, Machine Learning.