Strategic Control of Myopic Best Reply in Repeated Games

How can a rational player strategically control a myopic best reply player in a repeated two-player game? We show that in games with strategic substitutes or strategic complements the optimal control strategy is monotone in the initial action of the opponent, in time periods, and in the discount rate. As an interesting example outside this class of games we present a repeated ``textbook-like'' Cournot duopoly with non-negative prices and show that the optimal control strategy involves a cycle.

[1]  Burkhard C. Schipper,et al.  Rage Against the Machines: How Subjects Learn to Play Against Computers , 2005 .

[2]  Wei-Torng Juang,et al.  Rule Evolution and Equilibrium Selection , 2002, Games Econ. Behav..

[3]  Peter Duersch,et al.  Unbeatable Imitation , 2010, Games Econ. Behav..

[4]  X. Vives Oligopoly Pricing: Old Ideas and New Tools , 1999 .

[5]  Donald M. Topkis,et al.  Minimizing a Submodular Function on a Lattice , 1978, Oper. Res..

[6]  Robert W. Rosenthal,et al.  Experimentation, Imitation, and Stochastic Stability , 1999 .

[7]  R. Amir Cournot Oligopoly and the Theory of Supermodular Games , 1996 .

[8]  W. Pyle A Theory of Learning. , 1924 .

[9]  Teck-Hua Ho,et al.  A learning-based model of repeated games with incomplete information , 2005, Games Econ. Behav..

[10]  Burkhard Hehenkamp,et al.  Imitators and Optimizers in a Changing Environment , 2008 .

[11]  Masaki Aoyagi,et al.  Evolution of Beliefs and the Nash Equilibrium of Normal Form Games , 1996 .

[12]  Burkhard C. Schipper Imitators and Optimizers in Cournot Oligopoly , 2007 .

[13]  Jan Tuinstra,et al.  Endogenous fluctuations under evolutionary pressure in Cournot competition , 2002, Games Econ. Behav..

[14]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[15]  Glenn Ellison,et al.  Learning from Personal Experience: One Rational Guy and the Justification of Myopia , 1997 .

[16]  Rabah Amir,et al.  Sensitivity analysis of multisector optimal economic dynamics , 1991 .

[17]  D. Fudenberg,et al.  Efficiency and Observability with Long-Run and Short-Run Players , 1994 .

[18]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[19]  D. Fudenberg,et al.  Reputation and Equilibrium Selection in Games with a Patient Player , 1989 .

[20]  Teck-Hua Ho,et al.  Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games , 2002, J. Econ. Theory.

[21]  Paul R. Milgrom,et al.  Rationalizability, Learning, and Equilibrium in Games with Strategic Complementarities , 1990 .

[22]  Paul R. Milgrom,et al.  Monotone Comparative Statics , 1994 .

[23]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[24]  Drew Fudenberg,et al.  Repeated Games with Long-run and Short-run Players , 1990 .