Author's Personal Copy Games and Economic Behavior a Payoff-based Learning Procedure and Its Application to Traffic Games

A stochastic process that describes a payoff-based learning procedure and the associated adaptive behavior of players in a repeated game is considered. The process is shown to converge almost surely towards a stationary state which is characterized as an equilibrium for a related game. The analysis is based on techniques borrowed from the theory of stochastic algorithms and proceeds by studying an associated continuous dynamical system which represents the evolution of the players' evaluations. An application to the case of finitely many users in a congested traffic network with parallel links is considered. Alternative descriptions for the dynamics and the corresponding rest points are discussed, including a Lagrangian representation.

[1]  W. Brian Arthur,et al.  On designing economic agents that behave like human agents , 1993 .

[2]  Thomas Pitz,et al.  Experiments and Simulations on Day-to-Day Route Choice-Behaviour , 2003, SSRN Electronic Journal.

[3]  Michael J. Smith,et al.  The Stability of a Dynamic Model of Traffic Assignment - An Application of a Method of Lyapunov , 1984, Transp. Sci..

[4]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[5]  John Duffy,et al.  Learning, information, and sorting in market entry games: theory and evidence , 2005, Games Econ. Behav..

[6]  Carlos F. Daganzo,et al.  On Stochastic Models of Traffic Assignment , 1977 .

[7]  Robert B. Dial,et al.  A PROBABILISTIC MULTIPATH TRAFFIC ASSIGNMENT MODEL WHICH OBVIATES PATH ENUMERATION. IN: THE AUTOMOBILE , 1971 .

[8]  Philip Wolfe,et al.  Contributions to the theory of games , 1953 .

[9]  J. G. Wardrop,et al.  Some Theoretical Aspects of Road Traffic Research , 1952 .

[10]  Joseph N. Prashker,et al.  The Impact of Travel Time Information on Travelers’ Learning under Uncertainty , 2006 .

[11]  William H. Sandholm,et al.  Potential Games with Continuous Player Sets , 2001, J. Econ. Theory.

[12]  Werner Güth,et al.  Equilibrium Point Selection in a Class of Market Entry Games , 1982 .

[13]  Thomas Pitz,et al.  Commuters route choice behaviour , 2007, Games Econ. Behav..

[14]  E. Cascetta A stochastic process approach to the analysis of temporal dynamics in transportation networks , 1989 .

[15]  William H. Sandholm,et al.  ON THE GLOBAL CONVERGENCE OF STOCHASTIC FICTITIOUS PLAY , 2002 .

[16]  William H. Sandholm,et al.  Evolutionary Implementation and Congestion Pricing , 2002 .

[17]  Harold J. Kushner,et al.  Stochastic Approximation Algorithms and Applications , 1997, Applications of Mathematics.

[18]  Martin Posch,et al.  Cycling in a stochastic learning algorithm for normal form games , 1997 .

[19]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[20]  M. Benaïm Dynamics of stochastic approximation algorithms , 1999 .

[21]  Terry L. Friesz,et al.  Day-To-Day Dynamic Network Disequilibria and Idealized Traveler Information Systems , 1994, Oper. Res..

[22]  Gary A. Davis,et al.  Large Population Approximations of a General Stochastic Traffic Assignment Model , 1993, Oper. Res..

[23]  J. Horowitz The stability of stochastic equilibrium in a two-link transportation network , 1984 .

[24]  Alan W. Beggs,et al.  On the convergence of reinforcement learning , 2005, J. Econ. Theory.

[25]  Tilman Börgers,et al.  Learning Through Reinforcement and Replicator Dynamics , 1997 .

[26]  Y. Freund,et al.  The non-stochastic multi-armed bandit problem , 2001 .

[27]  Josef Hofbauer,et al.  Stochastic Approximations and Differential Inclusions , 2005, SIAM J. Control. Optim..

[28]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[29]  L. Shapley,et al.  REGULAR ARTICLEPotential Games , 1996 .

[30]  R. Vohra,et al.  Calibrated Learning and Correlated Equilibrium , 1996 .

[31]  Y. Freund,et al.  Adaptive game playing using multiplicative weights , 1999 .

[32]  Josef Hofbauer,et al.  Stochastic Approximations and Differential Inclusions, Part II: Applications , 2006, Math. Oper. Res..

[33]  S. Hart,et al.  A Reinforcement Procedure Leading to Correlated Equilibrium , 2001 .

[34]  H. Peyton Young,et al.  Strategic Learning and Its Limits , 2004 .

[35]  Jean-François Laslier,et al.  A Behavioral Learning Process in Games , 2001, Games Econ. Behav..

[36]  M. Hirsch,et al.  Mixed Equilibria and Dynamical Systems Arising from Fictitious Play in Perturbed Games , 1999 .

[37]  J. Robinson AN ITERATIVE METHOD OF SOLVING A GAME , 1951, Classics in Game Theory.

[38]  L. Shapley,et al.  Potential Games , 1994 .

[39]  Giulio Erberto Cantarella,et al.  Dynamic Processes and Equilibrium in Transportation Networks: Towards a Unifying Theory , 1995, Transp. Sci..

[40]  R. Rosenthal A class of games possessing pure-strategy Nash equilibria , 1973 .

[41]  S. Hart Adaptive Heuristics , 2005 .

[42]  R. McKelvey,et al.  Quantal Response Equilibria for Normal Form Games , 1995 .