Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria

The authors examine learning in all experiments they could locate involving one hundred periods or more of games with a unique equilibrium in mixed strategies, and in a new experiment. They study both the ex post ('best fit') descriptive power of learning models, and their ex ante predictive power, by simulating each experiment using parameters estimated from the other experiments. Even a one-parameter reinforcement learning model robustly outperforms the equilibrium predictions. Predictive power is improved by adding 'forgetting' and 'experimentation,' or by allowing greater rationality as in probabilistic fictitious play. Implications for developing a low-rationality, cognitive game theory are discussed. Copyright 1998 by American Economic Association.

[1]  E. Thorndike “Animal Intelligence” , 1898, Nature.

[2]  O. L. Tinklepaugh An experimental study of representative factors in monkeys. , 1928 .

[3]  J. M. Blackburn The acquisition of skill : an analysis of learning curves , 1936 .

[4]  J. Deese The psychology of learning , 1952 .

[5]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[6]  B. Skinner,et al.  Science and human behavior , 1953 .

[7]  N. Guttman,et al.  Discriminability and stimulus generalization. , 1956, Journal of experimental psychology.

[8]  J. S. Brown,et al.  A new technique for studying spatial generalization with voluntary responses. , 1958, Journal of experimental psychology.

[9]  P. Suppes,et al.  An analysis of two-person game situations in terms of statistical learning theory. , 1958, Journal of experimental psychology.

[10]  R. Duncan Luce,et al.  Individual Choice Behavior , 1959 .

[11]  Klaus Krickeberg,et al.  Markov learning models for multiperson interactions , 1962 .

[12]  Wayne Lee,et al.  Decision theory and human behavior , 1971 .

[13]  J M Smith,et al.  Evolution and the theory of games , 1976 .

[14]  Anatol Rapoport,et al.  The 2 X 2 game , 1976 .

[15]  C. Harley Learning the evolutionarily stable strategy. , 1981, Journal of theoretical biology.

[16]  John R. Anderson Acquisition of cognitive skill. , 1982 .

[17]  J. Cross A theory of adaptive economic behavior , 1983 .

[18]  John R. Anderson The Architecture of Cognition , 1983 .

[19]  A. Roth The Evolution of the Labor Market for Medical Interns and Residents: A Case Study in Game Theory , 1984, Journal of Political Economy.

[20]  Colin Camerer Behavioral Game Theory , 1990 .

[21]  S. Zamir,et al.  Bargaining and Market Behavior in Jerusalem, Ljubljana, Pittsburgh, and Tokyo: An Experimental Study , 1991 .

[22]  V. Crawford An “evolutionary” interpretation of Van Huyck, Battalio, and Beil's experimental results on coordination , 1991 .

[23]  M. Macy Learning to Cooperate: Stochastic and Tacit Collusion in Social Exchange , 1991, American Journal of Sociology.

[24]  Jordi Brandts,et al.  An Experimental Test of Equilibrium Dominance in Signaling Games , 1992 .

[25]  Eric J. Johnson,et al.  The adaptive decision maker , 1993 .

[26]  R. Battalio,et al.  Selection Dynamics, Asymptotic Stability, and Adaptive Behavior , 1994, Journal of Political Economy.

[27]  W. Estes Toward a Statistical Theory of Learning. , 1994 .

[28]  Ido Erev,et al.  The Effect of Repeated Play in the IPG and IPD Team Games , 1994 .

[29]  A. Roth,et al.  Jumping the Gun: Imperfections and Institutions Related to the Timing of Market Transactions , 1994 .

[30]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[31]  R. Nagel Unraveling in Guessing Games: An Experimental Study , 1995 .

[32]  J. Kagel,et al.  Handbook of Experimental Economics , 1997 .

[33]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .

[34]  Daniel Gopher,et al.  Toward a generalization of signal detection theory to N -person games: the example of two-person safety problem , 1995 .

[35]  Eyal Winter,et al.  Experimental study of repeated team-games , 1996 .

[36]  Fang-Fang Tang,et al.  Anticipatory Learning in Two-Person Games: An Experimental Study, Part I: Equilibrium and Stability , 1996 .

[37]  D. Stahl Boundedly rational rule learning in a guessing game , 1996 .

[38]  A. Rapoport,et al.  Randomization and Adaptive Learning in a Simplified Poker Game , 1997 .

[39]  D. Fudenberg,et al.  Measuring Players' Losses in Experimental Games , 1997 .

[40]  Joachim Meyer,et al.  Beyond Bayes's theorem: Effect of base-rate information in consensus games. , 1997 .

[41]  Daniel Friedman,et al.  Learning in evolutionary games: some laboratory results , 1997 .

[42]  John H. Kagel,et al.  Signalling and Adaptive Learning in an Entry Limit Pricing Game , 1997 .

[43]  R. Selten,et al.  Duopoly Strategies Programmed by Experienced Players , 1997 .

[44]  Bereby-Meyer,et al.  On Learning To Become a Successful Loser: A Comparison of Alternative Abstractions of Learning Processes in the Loss Domain. , 1998, Journal of mathematical psychology.

[45]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .