Sophisticated EWA Learning and Strategic Teaching in Repeated Games

Most learning models assume players are adaptive (i.e., they respond only to their own previous experience and ignore others' payoff information) and that behavior is not sensitive to the way in which players are matched. Empirical evidence suggests otherwise. In this paper, we extend our adaptive experience-weighted attraction (EWA) learning model to capture sophisticated learning and strategic teaching in repeated games. The generalized model assumes that there is a mixture of adaptive learners and sophisticated players. Like before, an adaptive learner adjusts his behavior the EWA way. A sophisticated player however does not learn and rationally best-responds to her forecasts of all other behaviors. A sophisticated player can be either myopic or foresighted. A foresighted player develops multiple-period rather than single-period forecasts of others' behaviors and chooses to 'teach' the other players by choosing a strategy scenario that gives her the highest discounted net present value. Consequently a foresighted player can develop a reputation for herself by strategic teaching if she is matched with an adaptive player repeatedly. We estimate the model using data fromp-beauty contests and repeated entry-deterrence (chain-store) games. Overall, the results show that the generalized model is better than the adaptive EWA model in describing and predicting behavior. Including teaching also allows an empirical learning-based approach to reputation formation which is at least as plausible as the now-standard type-based approach, and is superior in predictive performance in four di®erent ways.

[1]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[2]  R. Rosenthal Games of perfect information, predatory pricing and the chain-store paradox , 1981 .

[3]  David M. Kreps,et al.  Reputation and imperfect information , 1982 .

[4]  R. Selten,et al.  End behavior in sequences of finite prisoner's dilemma supergames , 1986 .

[5]  J. Mertens,et al.  ON THE STRATEGIC STABILITY OF EQUILIBRIA , 1986 .

[6]  H. Rosenthal,et al.  Private incentives in social dilemmas: The effects of incomplete information and altruism , 1988 .

[7]  Colin Camerer,et al.  EXPERIMENTAL TESTS OF A SEQUENTIAL EQUILIBRIUM REPUTATION MODEL , 1988 .

[8]  D. Fudenberg,et al.  Reputation and Equilibrium Selection in Games with a Patient Player , 1989 .

[9]  J. Huyck,et al.  Tacit Coordination Games, Strategic Uncertainty, and Coordination Failure , 1990 .

[10]  R. Selten Evolution, learning, and economic behavior , 1991 .

[11]  Patrick H. McAllister,et al.  Adaptive approaches to stochastic programming , 1991, Ann. Oper. Res..

[12]  A. Tversky,et al.  Loss Aversion in Riskless Choice: A Reference-Dependent Model , 1991 .

[13]  永福 智志 The Organization of Learning , 2005, Journal of Cognitive Neuroscience.

[14]  R. McKelvey,et al.  An experimental study of the centipede game , 1992 .

[15]  J. Ochs,et al.  The Sequential Equilibrium Theory of Reputation Building: A Further Test , 1992 .

[16]  John H. Miller,et al.  Rational Cooperation in the Finitely Repeated Prisoner's Dilemma: Experimental Evidence , 1993 .

[17]  Andrew Schotter,et al.  Does Game Theory Predict Well for the Wrong Reasons: An Experimental Investigation , 1993 .

[18]  Rosemarie Nagel,et al.  Experimental Results on Interactive Competitive Guessing , 1993 .

[19]  Colin Camerer,et al.  Cognition and framing in sequential bargaining for gains and losses , 1993 .

[20]  Joel Watson,et al.  A ‘Reputation’ Refinement without Equilibrium , 1993 .

[21]  Dilip Mookherjee,et al.  Learning behavior in an experimental matching pennies game , 1994 .

[22]  J. Kagel,et al.  On the Existence of Predatory Pricing: An Experimental Study of Reputation and Entry Deterrence in the Chain-Store Game , 1994 .

[23]  Ken Binmore,et al.  Learning to be imperfect: The ultimatum game , 1995 .

[24]  D. Stahl,et al.  On Players' Models of Other Players: Theory and Experimental Evidence , 1995 .

[25]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[26]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .

[27]  V. Crawford Adaptive dynamics in coordination games , 1995 .

[28]  C. A. Holt,et al.  5. Industrial Organization: A Survey of Laboratory Research , 1995 .

[29]  William W. Sharkey,et al.  Cost allocation , 1996 .

[30]  Gérard P. Cachon,et al.  Loss-Avoidance and Forward Induction in Experimental Coordination Games , 1996 .

[31]  D. Stahl Boundedly rational rule learning in a guessing game , 1996 .

[32]  Glenn Ellison,et al.  Learning from Personal Experience: One Rational Guy and the Justification of Myopia , 1997 .

[33]  John Duffy,et al.  On the Robustness of Behaviour in Experimental ‘Beauty Contest’ Games , 1997 .

[34]  Dilip Mookherjee,et al.  Learning and Decision Costs in Experimental Constant Sum Games , 1997 .

[35]  Lutz Kilian,et al.  Asynchronicity and Learning in Cost Sharing Mechanisms I Am Indebted to Scott Shenker for His Many Helpful Comments and Suggestions. I Thank , 1997 .

[36]  V. Crawford,et al.  What Price Coordination? The Efficiency-Enhancing Effect of Auctioning the Right to Play , 1997 .

[37]  Colin Camerer Progress in Behavioral Game Theory , 1997 .

[38]  J. Friedman,et al.  Boundedly Rational Nash Equilibrium: A Probabilistic Choice Approach☆☆☆ , 1997 .

[39]  Pierpaolo Battigalli,et al.  On 'Reputation' Refinements with Heterogeneous Beliefs , 1997 .

[40]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[41]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[42]  Wilfred Amaldoss,et al.  Collaborating to compete: A game-theoretical model and experimental investigation of competition among alliances , 1998 .

[43]  Teck-Hua Ho,et al.  Experience-Weighted Attraction Learning in Coordination Games: Probability Rules, Heterogeneity, and Time-Variation. , 1998, Journal of mathematical psychology.

[44]  R. McKelvey,et al.  Quantal Response Equilibria for Extensive Form Games , 1998 .

[45]  Andreas Blume,et al.  Learning in Sender-Receiver Games , 1998 .

[46]  Dale O. Stahl,et al.  Population Rule Learning in Symmetric Normal-Form Games , 2001 .

[47]  Miguel A. Costa-Gomes,et al.  Cognition and Behavior in Normal-Form Games: An Experimental Study , 1998 .

[48]  L. Samuelson,et al.  Risk dominance, payoff dominance and probabilistic choice learning , 1998 .

[49]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[50]  Colin Camerer,et al.  Iterated Dominance and Iterated Best-Response in Experimental P-Beauty Contests , 1998 .

[51]  John Duffy,et al.  Does observation of others affect learning in strategic environments? An experimental study , 1999, Int. J. Game Theory.

[52]  I. Erev,et al.  Games and Human Behavior , 1999 .

[53]  Wagner A. Kamakura,et al.  A Parsimonious Model of SKU Choice: Familiarity-based Reinforcement and Response Sensitivity , 1999 .

[54]  Dirk Engelmann,et al.  The False Consensus Effect Disappears if Representative Information and Monetary Incentives Are Given , 2000 .

[55]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[56]  Colin Camerer,et al.  Experience-Weighted Attraction Learning in Games: EstimatesFrom Weak-Link Games , 1999 .

[57]  J K Goeree,et al.  Stochastic game theory: for playing games, not just for doing theory. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[58]  E. Hopkins Two Competing Models of How People Learn in Games (first version) , 1999 .

[59]  Richard D. McKelvey,et al.  The hard sciences. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[60]  Dale O. Stahl,et al.  Rule Learning in Symmetric Normal-Form Games: Theory and Evidence , 2000, Games Econ. Behav..

[61]  Dana Heller Parametric Adaptive Learning , 2000 .

[62]  Colin Camerer,et al.  Increasing Cooperation in Prisoner's Dilemmas by Establishing a Precedent of Efficiency in Coordination Games. , 2000, Organizational behavior and human decision processes.

[63]  Bruno Broseta,et al.  Adaptive Learning and Equilibrium Selection in Experimental Coordination Games: An ARCH(1) Approach , 2000, Games Econ. Behav..

[64]  A. Rapoport,et al.  Mixed strategies and iterative elimination of strongly dominated strategies: an experimental investi , 2000 .

[65]  Amnon Rapoport,et al.  Choice of Prizes Allocated by Multiple Lotteries with Endogenously Determined Probabilities , 2001 .

[66]  Colin F. Camerer,et al.  Experience-Weighted Attraction Learning in Sender-Receiver Signaling Games , 2001 .

[67]  Fang-Fang Tang,et al.  Anticipatory learning in two-person games: some experimental results , 2001 .

[68]  Georg Weizsäcker Ignoring the Rationality of Others: Evidence from Experimental Normal-form Games , 2001 .

[69]  Farshid Vahid,et al.  Predicting How People Play Games: A Simple Dynamic Model of Choice , 2001, Games Econ. Behav..

[70]  Timothy C. Salmon An Evaluation of Econometric Models of Adaptive Learning , 2001 .

[71]  Colin Camerer,et al.  Economic Value of EWA Lite: A Functional Theory of Learning in Games , 2001 .

[72]  Teck-Hua Ho,et al.  EWA learning in bilateral call markets , 2002 .

[73]  A. Schotter,et al.  An Experimental Study of Belief Learning Using Elicited Beliefs , 2002 .