Recency, records and recaps: learning and non-equilibrium behavior in a simple decision problem

Nash equilibrium takes optimization as a primitive, but suboptimal behavior can persist in simple stochastic decision problems. This has motivated the development of other equilibrium concepts such as cursed equilibrium and behavioral equilibrium. We experimentally study a simple adverse selection (or “lemons”) problem and find that learning models that heavily discount past information (i.e. display recency bias) explain patterns of behavior better than Nash, cursed or behavioral equilibrium. Providing counterfactual information or a record of past outcomes does little to aid convergence to optimal strategies, but providing sample averages (“recaps”) gets individuals most of the way to optimality. Thus recency effects are not solely due to limited memory but stem from some other form of cognitive constraints. Our results show the importance of going beyond static optimization and incorporating features of human learning into economic models.

[1]  D. Fudenberg,et al.  Measuring Players' Losses in Experimental Games , 1997 .

[2]  Sheryl B. Ball,et al.  An evaluation of learning in the bilateral winner's curse , 1991 .

[3]  Daniel Friedman,et al.  Monty Hall's Three Doors: Construction and Deconstruction of a Choice Anomaly , 1998 .

[4]  Timothy J. Pleskac,et al.  Decisions from experience: Why small samples? , 2010, Cognition.

[5]  D. Fudenberg,et al.  Self-confirming equilibrium , 1993 .

[6]  I. Erev,et al.  Small feedback‐based decisions and their limited correspondence to description‐based decisions , 2003 .

[7]  David G. Rand,et al.  Economic Games on the Internet: The Effect of $1 Stakes , 2011, PloS one.

[8]  David G. Rand,et al.  The online laboratory: conducting experiments in a real labor market , 2010, ArXiv.

[9]  D. Posada,et al.  Model selection and model averaging in phylogenetics: advantages of akaike information criterion and bayesian approaches over likelihood ratio tests. , 2004, Systematic biology.

[10]  C. Chabris,et al.  Common (Mis)Beliefs about Memory: A Replication and Comparison of Telephone and Mechanical Turk Survey Methods , 2012, PloS one.

[11]  I. Erev,et al.  On Surprise, Change, and the Effect of Recent Outcomes , 2011, Front. Psychology.

[12]  Colin Camerer Behavioral Game Theory: Experiments in Strategic Interaction , 2003 .

[13]  Josef Hofbauer,et al.  Learning in games with unstable equilibria , 2005, J. Econ. Theory.

[14]  John E. Laird,et al.  Learning to play , 2009 .

[15]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[16]  David G. Rand,et al.  Social heuristics shape intuitive cooperation , 2014, Nature Communications.

[17]  Markku Kaustia,et al.  Do Investors Overweight Personal Experience? Evidence from IPO Subscriptions , 2007 .

[18]  Ehud Lehrer,et al.  Partially-Specified Probabilities: Decisions and Games , 2006 .

[19]  Gary E. Bolton,et al.  Learning and the Economics of Small Decisions , 2013 .

[20]  Daniel Friedman,et al.  Individual Learning in Normal Form Games: Some Laboratory Results☆☆☆ , 1997 .

[21]  Drew Fudenberg,et al.  Learning to Play Bayesian Games , 2001, Games Econ. Behav..

[22]  Sendhil Mullainathan,et al.  Learning Through Noticing: Theory and Experimental Evidence in Farming , 2012 .

[23]  Ulrike Malmendier,et al.  Depression Babies: Do Macroeconomic Experiences Affect Risk-Taking? , 2009 .

[24]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[25]  J. Kagel,et al.  Behavioral Equilibrium in Economies with Adverse Selection , 2007 .

[26]  S. Krauss,et al.  The psychology of the Monty Hall problem: discovering psychological mechanisms for solving a tenacious brain teaser. , 2003, Journal of experimental psychology. General.

[27]  Eld,et al.  Cursed Equilibrium , 2000 .

[28]  D. Fudenberg,et al.  Steady state learning and Nash equilibrium , 1993 .

[29]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[30]  Brit Grosskopf,et al.  Overcoming the Winner's Curse: An Adaptive Learning Perspective , 2002 .

[31]  K Fiedler Beware of samples! A cognitive-ecological sampling approach to judgment biases. , 2000, Psychological review.

[32]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[33]  David I. Laibson,et al.  Learning in the Credit Card Market , 2013 .

[34]  M. Bazerman,et al.  Focusing Failures in Competitive Environments: Explaining Decision Errors in the Monty Hall Game, the Acquiring a Company Problem, and Multiparty Ultimatums , 2003 .

[35]  George A. Akerlof,et al.  The Market for “Lemons”: Quality Uncertainty and the Market Mechanism , 1970 .

[36]  David M. Kreps,et al.  Learning in Extensive Games, I: Self-Confirming Equilibrium , 2010 .

[37]  J. Kagel,et al.  The Winner's Curse and Public Information in Common Value Auctions: Comment , 1999 .

[38]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[39]  A. Tversky,et al.  Naive Strategies in Zero-Sum Games , 1993 .

[40]  udenberg Rationalizable partition-confirmed equilibrium D rew , 2015 .

[41]  Ignacio Palacios-Huerta Learning to Open Monty Hall's Doors , 2002 .

[42]  Pierpaolo Battigalli,et al.  Conjectural Equilibria and Rationalizability in a Game with Incomplete Information , 1997 .

[43]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[44]  L. Samuelson Evolutionary Games and Equilibrium Selection , 1997 .

[45]  Dan Levin,et al.  The Origin of the Winner's Curse: A Laboratory Study , 2007 .

[46]  V. Crawford,et al.  Level-k Auctions: Can a Non-Equilibrium Model of Strategic Thinking Explain the Winner's Curse and Overbidding in Private-Value Auctions? , 2007 .

[47]  A. Peysakhovich,et al.  When Punishment Doesn't Pay: 'Cold Glow' and Decisions to Punish , 2015 .

[48]  D. Fudenberg,et al.  Payoff Information and Self-Confirming Equilibrium , 1999 .

[49]  D. Levine Is Behavioral Economics Doomed? The Ordinary versus the Extraordinary , 2012 .

[50]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[51]  David G. Rand,et al.  Spontaneous giving and calculated greed , 2012, Nature.

[52]  A. Cooper,et al.  Predictive Reward Signal of Dopamine Neurons , 2011 .

[53]  Colin Camerer,et al.  Neuroeconomics: decision making and the brain , 2008 .

[54]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .