Functional EWA: A One-parameter Theory of Learning in Games

Functional experience weighted attraction (fEWA) is a one-parameter theory of learning in games. It approximates the free parameters in an earlier model (EWA) with functions of experience. The theory was originally tested on seven different games and compared to four other learning and equilibrium theories, then three more games were added. Generally fEWA or parameterized EWA predict best out-of-sample, but one kind of reinforcement learning predicts well in games with mixed-strategy equilibrium. Of the learning models, belief learning models fit worst but fit better than noisy (quantal response) equilibrium models. The economic value of a theory is measured by how much more subjects would have earned if they followed the theory's recommendations. Most learning theories add value (though equilibrium theories often subtract value) and fEWA and EWA usually add the most value.

[1]  Dilip Mookherjee,et al.  Learning and Decision Costs in Experimental Constant Sum Games , 1997 .

[2]  E. Hopkins Two Competing Models of How People Learn in Games (first version) , 1999 .

[3]  Charles A. Holt,et al.  Anomalous Behavior in a Traveler's Dilemma? , 1999 .

[4]  A. Roth,et al.  Jumping the Gun: Imperfections and Institutions Related to the Timing of Market Transactions , 1994 .

[5]  J. Bather,et al.  Multi‐Armed Bandit Allocation Indices , 1990 .

[6]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[7]  V. Crawford Adaptive dynamics in coordination games , 1995 .

[8]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .

[9]  D. Stahl Boundedly rational rule learning in a guessing game , 1996 .

[10]  Timothy C. Salmon Evidence for Learning to Learn Behavior in Normal Form Games , 2004 .

[11]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[12]  Alvin E. Roth,et al.  The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models , 1999 .

[13]  A. Rapoport,et al.  Mixed strategies and iterative elimination of strongly dominated strategies: an experimental investi , 2000 .

[14]  Christian M. Ernst,et al.  Multi-armed Bandit Allocation Indices , 1989 .

[15]  Colin Camerer,et al.  Iterated Dominance and Iterated Best-Response in Experimental P-Beauty Contests , 1998 .

[16]  William Samuelson,et al.  Status quo bias in decision making , 1988 .

[17]  Philip W. Anderson,et al.  WHAT MAD PURSUIT. A Personal View of Scientific Discovery. , 1989 .

[18]  J. Robinson AN ITERATIVE METHOD OF SOLVING A GAME , 1951, Classics in Game Theory.

[19]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[20]  J. Huyck,et al.  Tacit Coordination Games, Strategic Uncertainty, and Coordination Failure , 1990 .

[21]  Dana Heller Parametric Adaptive Learning , 2000 .

[22]  Alvin E. Roth,et al.  Modelling Predicting How People Play Games: Reinforcement learning in experimental games with unique , 1998 .

[23]  Teck-Hua Ho,et al.  A Functional Theory of Learning in Games , 2001 .

[24]  David A. Wise,et al.  Price Differences in almost Competitive Markets , 1979 .

[25]  Colin Camerer,et al.  Sophisticated EWA Learning and Strategic Teaching in Repeated Games , 2000 .

[26]  Daniel Kahneman,et al.  Anomalies: The Endowment Effect, Loss Aversion, and Status Quo Bias , 1991 .

[27]  A. Cabrales,et al.  Equilibrium selection through incomplete information in coordination games: an experimental study , 2002 .

[28]  W. Arthur,et al.  INCREASING RETURNS AND LOCK-IN BY HISTORICAL EVENTS , 1989 .

[29]  D. Kahneman,et al.  Anomalies: The Endowment Effect, Loss Aversion, and Status Quo Bias , 1991 .

[30]  Colin Camerer,et al.  Strategic teaching and equilibrium models of repeated trust and entry games , 2003 .

[31]  S. Paradiso The Emotional Brain: The Mysterious Underpinnings of Emotional Life , 1998 .

[32]  John B. Van Huyck,et al.  Adaptive behavior and coordination failure , 1997 .

[33]  Kaushik Basu,et al.  The traveler's dilemma: Paradoxes of rationality in game theory , 1994 .

[34]  A. Rapoport,et al.  Coordination, “Magic,” and Reinforcement Learning in a Market Entry Game , 1998 .

[35]  Dale O. Stahl,et al.  Sophisticated Learning and Learning Sophistication , 2003 .

[36]  Dilip Mookherjee,et al.  Learning behavior in an experimental matching pennies game , 1994 .

[37]  M. Marchesi,et al.  Journal of economic behavior and organization: special issue on heterogeneous interacting agents in financial markets , 2002 .

[38]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[39]  Teck-Hua Ho,et al.  Experience-Weighted Attraction Learning in Coordination Games: Probability Rules, Heterogeneity, and Time-Variation. , 1998, Journal of mathematical psychology.

[40]  Colin Camerer,et al.  Economic Value of EWA Lite: A Functional Theory of Learning in Games , 2001 .

[41]  Alvin E. Roth,et al.  The Effect of Adding a Constant to All Payoffs: Experimental Investigation, and a Reinforcement Learning Model with Self-Adjusting Speed of Learning , 1999 .

[42]  Ali Hortaçsu,et al.  Winner's Curse, Reserve Prices and Endogenous Entry: Empirical Insights from Ebay Auctions , 2003 .

[43]  Roberto A. Weber,et al.  The effects of payoff magnitude and heterogeneity on behavior in 2 x 2 games with unique mixed strategy equilibria , 2000 .

[44]  Teck-Hua Ho,et al.  EWA learning in bilateral call markets , 2002 .

[45]  Yan Chen,et al.  Learning under limited information , 2003, Games Econ. Behav..

[46]  Colin Camerer Behavioral Game Theory: Experiments in Strategic Interaction , 2003 .

[47]  Frederick Mosteller,et al.  Stochastic Models for Learning , 1956 .

[48]  R. Selten,et al.  End behavior in sequences of finite prisoner's dilemma supergames , 1986 .

[49]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[50]  Patrick H. McAllister,et al.  Adaptive approaches to stochastic programming , 1991, Ann. Oper. Res..

[51]  Jason M. Shachat,et al.  Mixed Strategy Play and the Minimax Hypothesis , 2002, J. Econ. Theory.

[52]  Individual Di ® erences in EWA Learning with Partial Payo ® Information , 2001 .

[53]  Allan Timmermann,et al.  How Learning in Financial Markets Generates Excess Volatility and Predictability in Stock Prices , 1993 .

[54]  Wagner A. Kamakura,et al.  A Parsimonious Model of SKU Choice: Familiarity-based Reinforcement and Response Sensitivity , 1999 .

[55]  W. Arthur Designing Economic Agents that Act Like Human Agents: A Behavioral Approach to Bounded Rationality , 1991 .

[56]  Daniel Friedman,et al.  Individual Learning in Normal Form Games: Some Laboratory Results☆☆☆ , 1997 .