Individual Differences in EWA Learning with Partial Payoff Information

We extend experience-weighted attraction (EWA) learning to games in which only the set of possible foregone payoffs from unchosen strategies are known, and estimate parameters separately for each player to study heterogeneity. We assume players estimate unknown foregone payoffs from a strategy, by substituting the last payoff actually received from that strategy, by clairvoyantly guessing the actual foregone payoff, or by averaging the set of possible foregone payoffs conditional on the actual outcomes. All three assumptions improve predictive accuracy of EWA. Individual parameter estimates suggest that players cluster into two separate subgroups (which differ from traditional reinforcement and belief learning).

[1]  Timothy C. Salmon An Evaluation of Econometric Models of Adaptive Learning , 2001 .

[2]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[3]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[4]  Nagel,et al.  Experimental Results on the Centipede Game in Normal Form: An Investigation on Learning. , 1998, Journal of mathematical psychology.

[5]  R. Selten,et al.  End behavior in sequences of finite prisoner's dilemma supergames , 1986 .

[6]  J. Kagel,et al.  Handbook of Experimental Economics , 1997 .

[7]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[8]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .

[9]  Eyal Biyalogorsky,et al.  Stuck in the Past: Why Managers Persist with New Product Failures , 2006 .

[10]  V. Crawford Adaptive dynamics in coordination games , 1995 .

[11]  G. Mailath Do People Play Nash Equilibrium? Lessons From Evolutionary Game Theory , 1998 .

[12]  H. Takizawa,et al.  Level-K Analysis of Experimental Centipede Games , 2008 .

[13]  Wagner A. Kamakura,et al.  A Parsimonious Model of SKU Choice: Familiarity-based Reinforcement and Response Sensitivity , 1999 .

[14]  Karl H. Schlag,et al.  Which One Should I Imitate , 1999 .

[15]  Teck-Hua Ho,et al.  Task complexity, equilibrium selection, and learning: an experimental study , 1996 .

[16]  J. Robinson AN ITERATIVE METHOD OF SOLVING A GAME , 1951, Classics in Game Theory.

[17]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[18]  Daniel Friedman,et al.  Individual Learning in Normal Form Games: Some Laboratory Results☆☆☆ , 1997 .

[19]  Patrick H. McAllister,et al.  Adaptive approaches to stochastic programming , 1991, Ann. Oper. Res..

[20]  Yan Chen,et al.  Learning under limited information , 2003, Games Econ. Behav..

[21]  Nancy Argüelles,et al.  Author ' s , 2008 .

[22]  C. Harley Learning the evolutionarily stable strategy. , 1981, Journal of theoretical biology.

[23]  Dale O. Stahl,et al.  Rule Learning in Symmetric Normal-Form Games: Theory and Evidence , 2000, Games Econ. Behav..

[24]  Colin Camerer Behavioral Game Theory , 1990 .

[25]  M. Sutter,et al.  Efficiency Gains from Team-Based Coordination – Large-Scale Experimental Evidence , 2009 .

[26]  Teck-Hua Ho,et al.  Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games , 2002, J. Econ. Theory.

[27]  A. Rapoport,et al.  Mixed strategies and iterative elimination of strongly dominated strategies: an experimental investi , 2000 .

[28]  Nicolaas J. Vriend,et al.  Will reasoning improve learning , 1997 .

[29]  E. Hopkins Two Competing Models of How People Learn in Games (first version) , 1999 .

[30]  R. Staelin,et al.  The Quality Double Whammy , 1999 .

[31]  Dilip Mookherjee,et al.  Learning and Decision Costs in Experimental Constant Sum Games , 1997 .

[32]  Jason Shachat,et al.  Learning About Learning in Games Through Experimental Control of Strategic Interdependence , 2003 .

[33]  Nathaniel T. Wilcox,et al.  Theories of Learning in Games and Heterogeneity Bias , 2006 .

[34]  Augustin M. Cournot Cournot, Antoine Augustin: Recherches sur les principes mathématiques de la théorie des richesses , 2019, Die 100 wichtigsten Werke der Ökonomie.

[35]  Colin F. Camerer,et al.  Experience-weighted attraction learning in sender-receiver signaling games , 2000 .

[36]  M. Neugart,et al.  Referral hiring, endogenous social networks, and inequality: an agent-based analysis , 2010 .

[37]  R. Sarin,et al.  Strategy Similarity and Coordination , 2004 .

[38]  A. Cournot Researches into the Mathematical Principles of the Theory of Wealth , 1898, Forerunners of Realizable Values Accounting in Financial Reporting.

[39]  A. Marcet,et al.  Recurrent Hyperinflations and Learning , 2003 .

[40]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[41]  Dale O. Stahl,et al.  Sophisticated Learning and Learning Sophistication , 2003 .

[42]  Dilip Mookherjee,et al.  Learning behavior in an experimental matching pennies game , 1994 .

[43]  Nicholas C. Yannelis,et al.  Bounded rationality and learning , 1994 .

[44]  Teck-Hua Ho,et al.  A Parsimonious Model of Stockkeeping-Unit Choice , 2003 .

[45]  John Morgan,et al.  An Experimental Investigation of Unprofitable Games , 2002, Games Econ. Behav..

[46]  Alvin E. Roth,et al.  Modelling Predicting How People Play Games: Reinforcement learning in experimental games with unique , 1998 .

[47]  M. Cripps The theory of learning in games. , 1999 .

[48]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[49]  Teck-Hua Ho,et al.  Experience-Weighted Attraction Learning in Coordination Games: Probability Rules, Heterogeneity, and Time-Variation. , 1998, Journal of mathematical psychology.

[50]  Teck-Hua Ho,et al.  Self-tuning experience weighted attraction learning in games , 2007, J. Econ. Theory.

[51]  Bruno Broseta,et al.  Adaptive Learning and Equilibrium Selection in Experimental Coordination Games: An ARCH(1) Approach , 2000, Games Econ. Behav..

[52]  Teck H. Ho,et al.  individual learning in games , 2010 .

[53]  John B. Van Huyck,et al.  Adaptive behavior and coordination failure , 1997 .

[54]  Teck-Hua Ho,et al.  A learning-based model of repeated games with incomplete information , 2005, Games Econ. Behav..