Selecting efficient correlated equilibria through distributed learning

A learning rule is completely uncoupled if each player's behavior is conditioned only on his own realized payoffs, and does not need to know the actions or payoffs of anyone else. We demonstrate a simple, completely uncoupled learning rule such that, in any finite normal form game with generic payoffs, the players' realized strategies implements a socially optimal coarse correlated (Hannan) equilibrium a very high proportion of the time. That is, the empirical frequency associated with the players' collective behavior will be consistent with a socially optimal coarse correlated equilibrium. A variant of the rule implements a socially optimal correlated equilibrium a very high proportion of the time.

[1]  H. Peyton Young,et al.  Learning by trial and error , 2009, Games Econ. Behav..

[2]  Tim Roughgarden,et al.  Computing equilibria in multi-player games , 2005, SODA '05.

[3]  Jason R. Marden,et al.  Payoff-Based Dynamics for Multiplayer Weakly Acyclic Games , 2009, SIAM J. Control. Optim..

[4]  S. Hart,et al.  A simple adaptive procedure leading to correlated equilibrium , 2000 .

[5]  H. Young,et al.  The Evolution of Conventions , 1993 .

[6]  Drew Fudenberg,et al.  The Folk Theorem in Repeated Games with Discounting or with Incomplete Information , 1986 .

[7]  H. Peyton Young,et al.  Stochastic Evolutionary Game Dynamics , 1990 .

[8]  Eitan Altman,et al.  Correlated Equilibrium in Access Control for Wireless Communications , 2006, Networking.

[9]  S. Hart,et al.  Uncoupled Dynamics Do Not Lead to Nash Equilibrium , 2003 .

[10]  H. Peyton Young,et al.  Learning efficient Nash equilibria in distributed systems , 2012, Games Econ. Behav..

[11]  Y. Ho,et al.  Value of information in two-team zero-sum problems , 1974 .

[12]  Yakov Babichenko,et al.  Average Testing and the Efficient Boundary , 2011 .

[13]  Kevin Leyton-Brown,et al.  Polynomial-time computation of exact correlated equilibrium in compact games , 2010, EC '11.

[14]  R. Vohra,et al.  Calibrated Learning and Correlated Equilibrium , 1996 .

[15]  Dean Phillips Foster,et al.  Regret Testing: Learning to Play Nash Equilibrium Without Knowing You Have an Opponent , 2006 .

[16]  Jason R. Marden,et al.  Learning efficient correlated equilibria , 2014, 53rd IEEE Conference on Decision and Control.

[17]  D. Fudenberg,et al.  The Theory of Learning in Games , 1998 .

[18]  Jason R. Marden,et al.  Achieving Pareto Optimality Through Distributed Learning , 2014, SIAM J. Control. Optim..

[19]  Zhu Han,et al.  Peer-to-peer file sharing game using correlated equilibrium , 2009, 2009 43rd Annual Conference on Information Sciences and Systems.

[20]  G. Lugosi,et al.  Global Nash Convergence of Foster and Young's Regret Testing , 2004 .