An experiment on learning with limited information: nonconvergence, experimentation cascades, and the advantage of being slow

We present the results of an experiment on learning in a continuous-time low-information setting. For a dominance solvable version of a Cournot oligopoly with differentiated products, we find little evidence of convergence to the Nash equilibrium. In an asynchronous setting, characterized by players updating their strategies at different frequencies, play tends toward the Stackelberg outcome which favors the slower player. Convergence is significantly more robust for a serial cost sharing game, which satisfies a stronger solution concept of overwhelmed solvability. As the number of players grows, this improved convergence tends to diminish, seemingly driven by frequent and highly structured experimentation by players leading to a cascading effect in which experimentation by one player induces experimentation by others. These results have implications both for traditional oligopoly competition and for a wide variety of strategic situations arising on the Internet.

[1]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[2]  Drew Fudenberg,et al.  Reputation and Equilibrium Selection in Games with a Single Long-Run Player , 1989 .

[3]  C. Plott,et al.  Rational Expectations and the Aggregation of Diverse Information in Laboratory Security Markets , 1988 .

[4]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[5]  Eric J. Friedman,et al.  Learning and Implementation on the Internet , 1997 .

[6]  C. Granger Investigating Causal Relations by Econometric Models and Cross-Spectral Methods , 1969 .

[7]  H. Young,et al.  The Evolution of Conventions , 1993 .

[8]  Mikhael Shor,et al.  Learning to Respond: The Use of Heuristics in Dynamic Games , 2003 .

[9]  Paul R. Milgrom,et al.  Adaptive and sophisticated learning in normal form games , 1991 .

[10]  T. Palfrey,et al.  Asset Valuation in an Experimental Market , 1982 .

[11]  Teck-Hua Ho,et al.  Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games , 2002, J. Econ. Theory.

[12]  Scott Shenker,et al.  Making greed work in networks: a game-theoretic analysis of switch service disciplines , 1994 .

[13]  J. Robinson AN ITERATIVE METHOD OF SOLVING A GAME , 1951, Classics in Game Theory.

[14]  Aurel A. Lazar,et al.  A Game Theoretic Approach to Decentralized Flow Control of Markovian Queueing Networks , 1987, Performance.

[15]  D. Fudenberg,et al.  Reputation and Equilibrium Selection in Games with a Patient Player , 1989 .

[16]  A. Roth,et al.  THE ROLE OF INFORMATION IN BARGAINING: AN EXPERIMENTAL STUDY' , 1982 .

[17]  M. Deutsch,et al.  Sociology's Flexner Report. (Economics and the Social Sciences: The Education of Sociologists in the United States) , 1964 .

[18]  K. Binmore,et al.  Does Minimax Work? An Experimental Study , 2001 .

[19]  Dilip Mookherjee,et al.  Learning behavior in an experimental matching pennies game , 1994 .

[20]  R. Vohra,et al.  Calibrated Learning and Correlated Equilibrium , 1996 .

[21]  Ariel Orda,et al.  The designer's perspective to noncooperative networks , 1995, Proceedings of INFOCOM'95.

[22]  Andrew Whinston,et al.  A Stochastic Equilibrium Model of Internet Pricing , 1997 .

[23]  Joel Watson,et al.  A ‘Reputation’ Refinement without Equilibrium , 1993 .

[24]  Jeffrey O. Kephart,et al.  Shopbots and Pricebots , 1999, IJCAI.

[25]  Deborah Estrin,et al.  Pricing in computer networks: motivation, formulation, and example , 1993, TNET.

[26]  John Nagle,et al.  On Packet Switches with Infinite Storage , 1985, IEEE Trans. Commun..

[27]  Tad Hogg,et al.  Distributed Computation as an Economic System , 1995 .

[28]  Scott Shenker,et al.  Analysis and simulation of a fair queueing algorithm , 1989, SIGCOMM '89.

[29]  Yan Chen An experimental study of serial and average cost pricing mechanisms , 2003 .

[30]  R. Porter,et al.  NONCOOPERATIVE COLLUSION UNDER IMPERFECT PRICE INFORMATION , 1984 .

[31]  Scott Shenker,et al.  Learning in Network Contexts: Experimental Results from Simulations , 2001, Games Econ. Behav..

[32]  G. Harrison Theory and Misbehavior of First-Price Auctions , 1989 .

[33]  F. Tang,et al.  Learning and incentive‐Compatible Mechanisms for Public Goods Provision: An Experimental Study , 1998, Journal of Political Economy.

[34]  Jeffrey O. Kephart,et al.  Dynamic pricing by software agents , 2000, Comput. Networks.

[35]  Elena Yanovskaya,et al.  Serial cost sharing , 2006 .

[36]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[37]  E. Kalai,et al.  Rational Learning Leads to Nash Equilibrium , 1993 .

[38]  R. Rob,et al.  Learning, Mutation, and Long Run Equilibria in Games , 1993 .

[39]  R. Sarin,et al.  Payoff Assessments without Probabilities: A Simple Dynamic Model of Choice , 1999 .