Towards a Taxonomy of Learning Dynamics in 2 × 2 Games

Learning would be a convincing method to achieve coordination on an equilibrium. But does learning converge, and to what? We answer this question in generic 2-player, 2-strategy games, using Experience-Weighted Attraction (EWA), which encompasses many extensively studied learning algorithms. We exhaustively characterize the parameter space of EWA learning, for any payoff matrix, and we understand the generic properties that imply convergent or non-convergent behaviour in 2 x 2 games. Irrational choice and lack of incentives imply convergence to a mixed strategy in the centre of the strategy simplex, possibly far from the Nash Equilibrium (NE). In the opposite limit, in which the players quickly modify their strategies, the behaviour depends on the payoff matrix: (i) a strong discrepancy between the pure strategies describes dominance-solvable games, which show convergence to a unique fixed point close to the NE; (ii) a preference towards profiles of strategies along the main diagonal describes coordination games, with multiple stable fixed points corresponding to the NE; (iii) a cycle of best responses defines discoordination games, which commonly yield limit cycles or low-dimensional chaos. While it is well known that mixed strategy equilibria may be unstable, our approach is novel from several perspectives: we fully analyse EWA and provide explicit thresholds that define the onset of instability; we find an emerging taxonomy of the learning dynamics, without focusing on specific classes of games ex-ante; we show that chaos can occur even in the simplest games; we make a precise theoretical prediction that can be tested against data on experimental learning of discoordination games.

[1]  J. Robinson AN ITERATIVE METHOD OF SOLVING A GAME , 1951, Classics in Game Theory.

[2]  Frederick Mosteller,et al.  Stochastic Models for Learning , 1956 .

[3]  K. Miyasawa ON THE CONVERGENCE OF THE LEARNING PROCESS IN A 2 X 2 NON-ZERO-SUM TWO-PERSON GAME , 1961 .

[4]  J. Muth Rational Expectations and the Theory of Price Movements , 1961 .

[5]  L. Shapley SOME TOPICS IN TWO-PERSON GAMES , 1963 .

[6]  V. Crawford Learning the Optimal Strategy in a Zero-Sum Game , 1974 .

[7]  G. Benettin,et al.  Lyapunov Characteristic Exponents for smooth dynamical systems and for hamiltonian systems; a method for computing all of them. Part 1: Theory , 1980 .

[8]  Y. Pomeau,et al.  Intermittent transition to turbulence in dissipative dynamical systems , 1980 .

[9]  B. Huberman,et al.  Fluctuations and simple chaotic dynamics , 1982 .

[10]  Mw Hirsch,et al.  Chaos In Dynamical Systems , 2016 .

[11]  Dale O. Stahl,et al.  On the instability of mixed-strategy Nash equilibria , 1988 .

[12]  John Nachbar “Evolutionary” selection dynamics in games: Convergence and limit properties , 1990 .

[13]  Robert W. Rosenthal,et al.  Testing the Minimax Hypothesis: A Re-examination of O'Neill's Game Experiment , 1990 .

[14]  Paul R. Milgrom,et al.  Adaptive and sophisticated learning in normal form games , 1991 .

[15]  Reinhard Selten,et al.  Anticipatory Learning in Two-Person Games , 1991 .

[16]  M. Macy Learning to Cooperate: Stochastic and Tacit Collusion in Social Exchange , 1991, American Journal of Sociology.

[17]  Opper,et al.  Phase transition and 1/f noise in a game dynamical model. , 1992, Physical review letters.

[18]  John Conlisk,et al.  Adaptation in games: Two solutions to the Crawford puzzle , 1993 .

[19]  H. Young,et al.  The Evolution of Conventions , 1993 .

[20]  David M. Kreps,et al.  Learning Mixed Equilibria , 1993 .

[21]  John Conlisk Adaptive tactics in games: Further solutions to the Crawford puzzle , 1993 .

[22]  R. Bloomfield Learning a mixed strategy equilibrium in the laboratory , 1994 .

[23]  Dilip Mookherjee,et al.  Learning behavior in an experimental matching pennies game , 1994 .

[24]  Ariel Rubinstein,et al.  A Course in Game Theory , 1995 .

[25]  Doron Sonsino Learning to learn, pattern recognition, and Nash equilibrium , 1995 .

[26]  R. Nagel Unraveling in Guessing Games: An Experimental Study , 1995 .

[27]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .

[28]  L. Shapley,et al.  Fictitious Play Property for Games with Identical Interests , 1996 .

[29]  Dean P. Foster,et al.  On the Nonconvergence of Fictitious Play in Coordination Games , 1998 .

[30]  D. Monderer,et al.  A 2£ 2 Game without the Fictitious Play Property ⁄ , 1996 .

[31]  D. Stahl Boundedly rational rule learning in a guessing game , 1996 .

[32]  Daniel Friedman,et al.  Individual Learning in Normal Form Games: Some Laboratory Results☆☆☆ , 1997 .

[33]  Tilman Börgers,et al.  Learning Through Reinforcement and Replicator Dynamics , 1997 .

[34]  William A. Brock,et al.  A rational route to randomness , 1997 .

[35]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[36]  Josef Hofbauer,et al.  Evolutionary Games and Population Dynamics , 1998 .

[37]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[38]  Martin Weigt,et al.  Entropy and typical properties of Nash equilibria in two-player games , 1999 .

[39]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[40]  E. Hopkins Two Competing Models of How People Learn in Games (first version) , 1999 .

[41]  M. Hirsch,et al.  Mixed Equilibria and Dynamical Systems Arising from Fictitious Play in Perturbed Games , 1999 .

[42]  S. Hahn The convergence of fictitious play in 3×3 games with strategic complementarities , 1999 .

[43]  Josef Hofbauer,et al.  Sophisticated imitation in cyclic games , 2000 .

[44]  Herbert Gintis,et al.  Game Theory Evolving: A Problem-Centered Introduction to Modeling Strategic Interaction - Second Edition , 2009 .

[45]  H P Young,et al.  On the impossibility of predicting the behavior of rational agents , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[46]  M. Macy,et al.  Learning dynamics in social dilemmas , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[47]  J. Crutchfield,et al.  Coupled replicator equations for the dynamics of learning in multiagent systems. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[48]  Teck-Hua Ho,et al.  Self-tuning experience weighted attraction learning in games , 2007, J. Econ. Theory.

[49]  Ulrich Berger Learning in games with strategic complementarities revisited , 2008, J. Econ. Theory.

[50]  Josef Hofbauer,et al.  Learning in games with unstable equilibria , 2005, J. Econ. Theory.

[51]  Tobias Galla,et al.  Intrinsic noise in game dynamical learning. , 2009, Physical review letters.

[52]  William H. Sandholm,et al.  Population Games And Evolutionary Dynamics , 2010, Economic learning and social evolution.

[53]  Tobias Galla,et al.  Cycles of cooperation and defection in imperfect learning , 2011, ArXiv.

[54]  H. Young,et al.  Stochastic Learning Dynamics and Speed of Convergence in Population Games , 2011 .

[55]  Daniele Vilone,et al.  Chaos and unpredictability in evolutionary dynamics in discrete time. , 2011, Physical review letters.

[56]  Tobias Galla,et al.  Complex dynamics in learning complicated games , 2011, Proceedings of the National Academy of Sciences.

[57]  Timothy N. Cason,et al.  Cycles and Instability in a Rock-Paper-Scissors Population Game: a Continuous Time Experiment ∗ , 2014 .

[58]  Jan Tuinstra,et al.  UvA-DARE ( Digital Academic Repository ) Evolutionary Competition Between Adjustment Processes in Cournot Oligopoly : Instability and Complex Dynamics , 2018 .

[59]  Augustin M. Cournot Cournot, Antoine Augustin: Recherches sur les principes mathématiques de la théorie des richesses , 2019, Die 100 wichtigsten Werke der Ökonomie.

[60]  Marco Pangallo,et al.  Best reply structure and equilibrium convergence in generic games , 2017, Science Advances.