Overcoming the Winner's Curse: An Adaptive Learning Perspective

Even though experience was found to improve decision-making in several tasks, there are instances in which learning is ineffective. The current paper studies one example of the last category, namely the persisting tendency of negotiators in bilateral bargaining under asymmetric information to ignore the decisions of their opponent(s), which can result in negative profits also known as the "winner's curse". Bazerman and his colleagues studied this phenomenon extensively using a task known as the "Acquiring a Company" task. One surprising finding is that even experienced participants showed no adjustment to avoid the winner's curse. The current study suggests that the observed persistence of sub-optimal behavior is largely due to the variability in the environment that leads to an ambiguous feedback. Since participants adjust their behavior adaptively, i.e., they condition their behavior on outcomes of the previous rounds, which have a high variance, it is difficult for them to overcome the winner's curse in this situation. In a series of experiments using the "Acquiring a Company" task we decreased the variance in the payoff. We find that decreasing the variance improves performance, but it does not completely eliminate the winner's curse. However, even when participants were given explicit information about the expected profit of each bid, they still overbid, suggesting that the difficulty to learn to avoid the winner's curse can be partially attributed to the utility participants have from gambling. The results show the importance of giving negotiators noise-free feedback in order to improve negotiation skills.

[1]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[2]  D. Cross,et al.  Prior Experience and Patterning in a Prisoner's Dilemma Game , 1998 .

[3]  R. Hogarth Judgement and choice: The psychology of decision , 1982 .

[4]  William Samuelson,et al.  I Won the Auction But Don't Want the Prize , 1983 .

[5]  V. Smith,et al.  Research in Experimental Economics , 1979 .

[6]  Charles A. Holt,et al.  The Loser's Curse , 1994 .

[7]  A. Tversky,et al.  Rational choice and the framing of decisions , 1990 .

[8]  J. Kagel,et al.  Handbook of Experimental Economics , 1997 .

[9]  Timothy T. Baldwin,et al.  TRANSFER OF TRAINING: A REVIEW AND DIRECTIONS FOR FUTURE RESEARCH , 1988 .

[10]  J. Kagel,et al.  Learning in common value auctions: Some initial observations☆ , 1994 .

[11]  Bereby-Meyer,et al.  On Learning To Become a Successful Loser: A Comparison of Alternative Abstractions of Learning Processes in the Loss Domain. , 1998, Journal of mathematical psychology.

[12]  J. L. Myers,et al.  Range of Payoffs and Feedback in Risk Taking , 1962 .

[13]  J. Potters,et al.  An experiment on risk taking and evaluation periods , 1997 .

[14]  M. Allais Le comportement de l'homme rationnel devant le risque : critique des postulats et axiomes de l'ecole americaine , 1953 .

[15]  W. O. Jenkins,et al.  Partial reinforcement: a review and critique. , 1950, Psychological bulletin.

[16]  M. Kaplan,et al.  Human judgment and decision processes , 1975 .

[17]  J. Baron Thinking and Deciding , 2023 .

[18]  Alvin E. Roth,et al.  Laboratory experimentation in economics: List of contributors , 1987 .

[19]  J. Keith Murnighan,et al.  Learning to avoid the winner's curse , 1996 .

[20]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[21]  R. Thaler,et al.  Myopic Loss Aversion and the Equity Premium Puzzle , 1993 .

[22]  Paul Slovic,et al.  Multidimensional functional learning (MFL) and some new conceptions of feedback , 1981 .

[23]  S. Weinstock Acquisition and extinction of a partially reinforced running response at a 24-hour intertrial interval. , 1958, Journal of experimental psychology.

[24]  A. Kluger,et al.  The effects of feedback interventions on performance: A historical review, a meta-analysis, and a preliminary feedback intervention theory. , 1996 .

[25]  B. Brehmer In one word: Not from experience. , 1980 .

[26]  R. Thaler,et al.  Laboratory experimentation in economics: The psychology of choice and the assumptions of economics , 1987 .

[27]  C. Plott,et al.  The Winner's Curse: Experiments with Buyers and with Sellers , 1989 .

[28]  J. Baron Thinking and deciding, 3rd ed. , 2000 .

[29]  M. Doherty,et al.  Effects of cognitive feedback on performance. , 1989 .

[30]  J. L. Myers,et al.  Effects of range of payoffs as a variable in risk taking. , 1960, Journal of experimental psychology.

[31]  J. Kagel,et al.  The Winner's Curse and Public Information in Common Value Auctions: Comment , 1999 .

[32]  A. Tversky,et al.  The Effect of Myopia and Loss Aversion on Risk Taking: An Experimental Test , 1997 .

[33]  Sheryl B. Ball,et al.  An evaluation of learning in the bilateral winner's curse , 1991 .

[34]  R. Selten,et al.  Experimental Sealed Bid First Price Auctions with Directly Observed Bid Functions , 1994 .

[35]  Brit Grosskopf Reinforcement and Directional Learning in the Ultimatum Game with Responder Competition , 2003 .

[36]  T. R. Stewart,et al.  10 – SOCIAL JUDGMENT THEORY , 1975 .

[37]  R. Hogarth,et al.  Confidence in judgment: Persistence of the illusion of validity. , 1978 .

[38]  J. Kagel,et al.  Common Value Auctions and the Winner's Curse , 2002 .

[39]  M. Bazerman,et al.  Overcoming Focusing Failures in Competitive Environments , 2004 .

[40]  D. Robbins Partial reinforcement: A selective review of the alleyway literature since 1960. , 1971 .

[41]  Kenneth R. Hammond,et al.  Negative effects of outcome-feedback in multiple-cue probability learning. , 1973 .

[42]  R. Selten,et al.  End behavior in sequences of finite prisoner's dilemma supergames , 1986 .

[43]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[44]  Alvin E. Roth,et al.  The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models , 1999 .