Strategic Experimentation with Private Payoffs

We consider a game of strategic experimentation in which players face identical discrete-time bandit problems with a safe and a risky arm. In any period, the risky arm yields either a success or a failure, and the first success reveals the risky arm to dominate the safe one. When payoffs are public information, the ensuing free-rider problem is so severe that equilibrium experimentation ceases at the same threshold belief at which a single agent would stop, even if players can coordinate their actions through mediated communication. When payoffs are private information and the success probability on the risky arm is not too high, however, the socially optimal symmetric experimentation profile can be supported as a perfect Bayesian equilibrium for sufficiently optimistic prior beliefs, even if players can only communicate via binary cheap-talk messages.

[1]  D. Bergemann,et al.  The Financing of Innovation: Learning and Stopping , 2004 .

[2]  Johannes Horner,et al.  Collaborating , 2009 .

[3]  Godfrey Keller,et al.  Breakdowns , 2012 .

[4]  David A. Malueg,et al.  Dynamic R&D Competition with Learning , 1997 .

[5]  Michihiro Kandori,et al.  Private Observation, Communication and Collusion , 1998 .

[6]  Navin Kartik,et al.  Optimal Contracts for Experimentation , 2016 .

[7]  D. Bergemann,et al.  Venture capital financing, moral hazard, and learning , 1998 .

[8]  Olivier Compte Communication in repeated games with imperfect private monitoring , 1998 .

[9]  A. Pope Learning faster or more precisely ? Strategic experimentation in networks , 2015 .

[10]  David Besanko,et al.  The Impact of Market Structure and Learning on the Tradeoff between R&D Competition and Cooperation , 2013 .

[11]  M. Cripps,et al.  Strategic Experimentation with Exponential Bandits , 2003 .

[12]  R. Myerson MULTISTAGE GAMES WITH COMMUNICATION , 1984 .

[13]  Yair Tauman,et al.  On the value of information in a strategic conflict , 1990 .

[14]  Nicolas Vieille,et al.  Social Learning in One-Arm Bandit Problems , 2007 .

[15]  Strongly Symmetric Equilibria in Bandit Games , 2014 .

[16]  S. Athey,et al.  Collusion With Persistent Cost Shocks , 2008 .

[17]  Philipp Strack,et al.  Strategic Experimentation with Private Payoffs , 2015 .

[18]  Francesco Squintani,et al.  Preemption Games with Private Information , 2011 .

[19]  M. Rothschild A two-armed bandit theory of market pricing , 1974 .

[20]  Godfrey Keller,et al.  Strategic Experimentation with Poisson Bandits , 2009 .

[21]  Caroline D. Thomas Strategic Experimentation With Congestion , 2018, American Economic Journal: Microeconomics.

[22]  Jay Pil Choi,et al.  Dynamic R&D Competition under "Hazard Rate" Uncertainty , 1991 .

[23]  F. Forges Published by: The , 2022 .

[24]  Juuso Välimäki,et al.  Learning and Information Aggregation in an Exit Game , 2011 .

[25]  L. Samuelson,et al.  Incentives for Experimenting Agents , 2009 .

[26]  Kostas Bimpikis,et al.  Designing Dynamic Contests , 2015, EC.

[27]  Strategic experimentation : The undiscounted case , 2000 .

[28]  Takuo Sugaya,et al.  Perfect Versus Imperfect Monitoring in Repeated Games , 2014 .

[29]  Eric van Damme,et al.  Non-Cooperative Games , 2000 .

[30]  Jean Guillaume Forand Keeping Your Options Open , 2010, New Electronics.

[31]  Sven Rady,et al.  Negatively Correlated Bandits , 2008 .

[32]  Andreas Blume,et al.  Private Monitoring in Auctions , 2003, J. Econ. Theory.

[33]  Navin Kartik,et al.  Contests for Experimentation , 2017, Journal of Political Economy.

[34]  K. Arrow,et al.  The New Palgrave Dictionary of Economics , 2020 .

[35]  Jean-Paul Décamps,et al.  Investment timing and learning externalities , 2004, J. Econ. Theory.

[36]  D. Fudenberg,et al.  Subgame-perfect equilibria of finite- and infinite-horizon games , 1981 .

[37]  Asuman Ozdaglar,et al.  Experimentation, Patents, and Innovation , 2008 .