Learning in Experimental 2 X 2 Games

In this paper, we introduce two new learning models: impulse-matching learning and action-sampling learning. These two models together with the models of self-tuning EWA and reinforcement learning are applied to 12 different 2 x 2 games and their results are compared with the results from experimental data. We test whether the models are capable of replicating the aggregate distribution of behavior, as well as correctly predicting individuals' round-by-round behavior. Our results are two-fold: while the simulations with impulse-matching and action-sampling learning successfully replicate the experimental data on the aggregate level, individual behavior is best described by self-tuning EWA. Nevertheless, impulse-matching learning has the second highest score for the individual data. In addition, only self-tuning EWA and impulse-matching learning lead to better round-by-round predictions than the aggregate frequencies, which means they adjust their predictions correctly over time.

[1]  Wei Chen,et al.  Bounded Memory, Inertia, Sampling and Weighting Model for Market Entry Games , 2011, Games.

[2]  Sebastian J. Goerg,et al.  Experimental investigation of stationary concepts in cyclic duopoly games , 2009 .

[3]  A. Roth,et al.  Learning in Extensive-Form Games: Experimental Data and Simple Dynamic Models in the Intermediate Term* , 1995 .

[4]  Timothy C. Salmon An Evaluation of Econometric Models of Adaptive Learning , 2001 .

[5]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .

[6]  Ben Polak,et al.  Fictitious play in 2×2 games: A geometric proof of convergence , 1994 .

[7]  R. Selten,et al.  End behavior in sequences of finite prisoner's dilemma supergames , 1986 .

[8]  Colin Camerer,et al.  Experience‐weighted Attraction Learning in Normal Form Games , 1999 .

[9]  R. Selten,et al.  Stationary Concepts for Experimental 2x2 Games , 2008 .

[10]  E. Kalai,et al.  Rational Learning Leads to Nash Equilibrium , 1993 .

[11]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[12]  Yan Chen,et al.  When Does Learning in Games Generate Convergence to Nash Equilibria? The Role of Supermodularity in an Experimental Setting ⁄ , 2004 .

[13]  Robert W. Rosenthal,et al.  Testing the Minimax Hypothesis: A Re-examination of O'Neill's Game Experiment , 1990 .

[14]  Colin Camerer,et al.  Stationary Concepts for Experimental 2 X 2 Games: Comment , 2011 .

[15]  Sebastian J. Goerg,et al.  Stationary Concepts for Experimental 2 X 2 Games: Reply , 2011 .

[16]  A. Tversky,et al.  Prospect theory: an analysis of decision under risk — Source link , 2007 .

[17]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .

[18]  Alvin E. Roth,et al.  A Choice Prediction Competition for Market Entry Games: An Introduction , 2010, Games.

[19]  Klaus Abbink,et al.  RatImage - research Assistance Toolbox for Computer-Aided Human Behavior Experiments , 1995 .

[20]  R. Selten Axiomatic Characterization of the Quadratic Scoring Rule , 1998 .

[21]  Daniel Friedman,et al.  Individual Learning in Normal Form Games: Some Laboratory Results☆☆☆ , 1997 .

[22]  Ido Erev,et al.  Generality, repetition, and the role of descriptive learning models , 2005 .

[23]  Alan W. Beggs,et al.  On the convergence of reinforcement learning , 2005, J. Econ. Theory.

[24]  Teck-Hua Ho,et al.  Self-tuning experience weighted attraction learning in games , 2007, J. Econ. Theory.

[25]  D. Marchiori,et al.  Predicting Human Interactive Learning by Regret-Driven Neural Networks , 2008, Science.

[26]  E. Hopkins Two Competing Models of How People Learn in Games (first version) , 1999 .

[27]  R. Selten,et al.  Experimental Sealed Bid First Price Auctions with Directly Observed Bid Functions , 1994 .

[28]  A. Rubinstein,et al.  Games with Procedurally Rational Players , 1997 .

[29]  C. Harley Learning the evolutionarily stable strategy. , 1981, Journal of theoretical biology.