Learning and Type Compatibility in Signaling Games

Which equilibria will arise in signaling games depends on how the receiver interprets deviations from the path of play. We develop a micro-foundation for these off-path beliefs, and an associated equilibrium refinement, in a model where equilibrium arises through non-equilibrium learning by populations of patient and long-lived senders and receivers. In our model, young senders are uncertain about the prevailing distribution of play, so they rationally send out-of-equilibrium signals as experiments to learn about the behavior of the population of receivers. Differences in the payoff functions of the types of senders generate different incentives for these experiments. Using the Gittins index (Gittins, 1979), we characterize which sender types use each signal more often, leading to a constraint on the receiver's off-path beliefs based on "type compatibility" and hence a learning-based equilibrium selection.

[1]  Drew Fudenberg,et al.  Rationalizable partition-confirmed equilibrium with heterogeneous beliefs , 2018, Games Econ. Behav..

[2]  Learning and Equilibrium Refinements in Signalling Games , 2017 .

[3]  D. Fudenberg,et al.  Bayesian posteriors for arbitrarily rare events , 2016, Proceedings of the National Academy of Sciences.

[4]  Jean-François Laslier,et al.  Stubborn learning , 2014, Theory and Decision.

[5]  Ignacio Esponda,et al.  Berk-Nash Equilibrium: A Framework for Modeling Agents with Misspecified Models , 2014, 1411.1152.

[6]  J. Norris Appendix: probability and measure , 1997 .

[7]  José Niño-Mora,et al.  Computing a Classic Index for Finite-Horizon Bandits , 2011, INFORMS J. Comput..

[8]  José Niòo-Mora Computing a Classic Index for Finite-Horizon Bandits , 2011 .

[9]  David M. Kreps,et al.  Learning in Extensive Games, II: Experimentation and Nash Equilibrium , 2010 .

[10]  Drew Fudenberg,et al.  Learning, Experimentation and Equilibrium in Games , 2010 .

[11]  D. Fudenberg,et al.  Superstition and Rational Learning , 2006 .

[12]  Dov Samet,et al.  Learning to play games in extensive form by valuation , 2001, J. Econ. Theory.

[13]  Drew Fudenberg,et al.  Learning to Play Bayesian Games , 2001, Games Econ. Behav..

[14]  D. Fudenberg,et al.  Payoff Information and Self-Confirming Equilibrium , 1999 .

[15]  Drew Fudenberg,et al.  Learning in extensive-form games I. Self-confirming equilibria , 1995 .

[16]  E. Kalai,et al.  Rational Learning Leads to Nash Equilibrium , 1993 .

[17]  David M. Kreps,et al.  Learning Mixed Equilibria , 1993 .

[18]  D. Fudenberg,et al.  Steady state learning and Nash equilibrium , 1993 .

[19]  D. Freedman,et al.  On the uniform consistency of Bayes estimates for multinomial probabilities , 1990 .

[20]  Lars Stole,et al.  Fixed-equilibrium rationalizability in signaling games , 1990 .

[21]  J. Sobel,et al.  Equilibrium Selection in Signaling Games , 1987 .

[22]  David M. Kreps,et al.  Signaling Games and Stable Equilibria , 1987 .

[23]  J. Gittins Bandit processes and dynamic allocation indices , 1979 .

[24]  M. Spence Job Market Signaling , 1973 .

[25]  R. Bellman A PROBLEM IN THE SEQUENTIAL DESIGN OF EXPERIMENTS , 1954 .