Revisiting log-linear learning: Asynchrony, completeness and payoff-based implementation

The theory of learning in games has sought to understand how and why equilibria emerge in non-cooperative games. Traditionally, social science literature develops descriptive game theoretic models for players, analyzes the limiting behavior, and generalizes the results for larger classes of games. Recently, there has been a significant amount of research seeking to understand these behavioral models not from a descriptive point of view, but rather from a prescriptive point of view [1]–[4]. The goal is to use these behavioral models as a prescriptive control approach in distributed multi-agent systems where the guaranteed limiting behavior would represent a desirable operating condition.

[1]  Jason R. Marden,et al.  Autonomous Vehicle-Target Assignment: A Game-Theoretical Formulation , 2007 .

[2]  Jason R. Marden,et al.  Cooperative Control and Potential Games , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  Devavrat Shah,et al.  Dynamics in congestion games , 2010, SIGMETRICS '10.

[4]  L. Shapley,et al.  Potential Games , 1994 .

[5]  Dean Phillips Foster,et al.  Regret Testing: Learning to Play Nash Equilibrium Without Knowing You Have an Opponent , 2006 .

[6]  Connections between cooperative control and potential games illustrated on the consensus problem , 2007, 2007 European Control Conference (ECC).

[7]  H. Young Individual Strategy and Social Structure , 2020 .

[8]  Andrea Montanari,et al.  The spread of innovations in social networks , 2010, Proceedings of the National Academy of Sciences.

[9]  Jason R. Marden,et al.  Payoff-Based Dynamics for Multiplayer Weakly Acyclic Games , 2009, SIAM J. Control. Optim..

[10]  Mark Voorneveld,et al.  Best-response potential games , 2000 .

[11]  L. Shapley,et al.  REGULAR ARTICLEPotential Games , 1996 .

[12]  Adrian Vetta,et al.  Nash equilibria in competitive societies, with applications to facility location, traffic routing and auctions , 2002, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings..

[13]  L. Blume The Statistical Mechanics of Strategic Interaction , 1993 .

[14]  H. Peyton Young,et al.  Learning by trial and error , 2009, Games Econ. Behav..

[15]  Yakov Babichenko,et al.  Completely uncoupled dynamics and Nash equilibria , 2012, Games Econ. Behav..

[16]  L. Shapley,et al.  Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[17]  Nick Netzer,et al.  The logit-response dynamics , 2010, Games Econ. Behav..

[18]  Shie Mannor,et al.  Multi-agent learning for engineers , 2007, Artif. Intell..

[19]  Tim Roughgarden,et al.  Selfish routing and the price of anarchy , 2005 .

[20]  Amin Saberi,et al.  On the Inefficiency Ratio of Stable Equilibria in Congestion Games , 2009, WINE.

[21]  Jason R. Marden,et al.  Regret based dynamics: convergence in weakly acyclic games , 2007, AAMAS '07.

[22]  Jason R. Marden,et al.  Payoff based dynamics for multi-player weakly acyclic games , 2007, 2007 46th IEEE Conference on Decision and Control.

[23]  Fabrizio Germano,et al.  Global Nash Convergence of Foster and Young's Regret Testing , 2004, Games Econ. Behav..

[24]  Alan Beggs Waiting times and equilibrium selection , 2005 .

[25]  Yoav Shoham,et al.  If multi-agent learning is the answer, what is the question? , 2007, Artif. Intell..

[26]  Lawrence E. Blume,et al.  How noise matters , 2003, Games Econ. Behav..

[27]  L. Blume,et al.  POPULATION GAMES , 1995 .

[28]  H. Peyton Young,et al.  Strategic Learning and Its Limits , 2004 .

[29]  Avrim Blum,et al.  Routing without regret: on convergence to nash equilibria of regret-minimizing algorithms in routing games , 2006, PODC '06.

[30]  G. Lugosi,et al.  Global Nash Convergence of Foster and Young's Regret Testing , 2004 .

[31]  W. Arthur,et al.  The Economy as an Evolving Complex System II , 1988 .

[32]  H. Young,et al.  The Evolution of Conventions , 1993 .

[33]  L. Shapley,et al.  Fictitious Play Property for Games with Identical Interests , 1996 .

[34]  Jason R. Marden,et al.  Joint Strategy Fictitious Play with Inertia for Potential Games , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.