论文信息 - Revisiting log-linear learning: Asynchrony, completeness and payoff-based implementation

Revisiting log-linear learning: Asynchrony, completeness and payoff-based implementation

The theory of learning in games has sought to understand how and why equilibria emerge in non-cooperative games. Traditionally, social science literature develops descriptive game theoretic models for players, analyzes the limiting behavior, and generalizes the results for larger classes of games. Recently, there has been a significant amount of research seeking to understand these behavioral models not from a descriptive point of view, but rather from a prescriptive point of view [1]–[4]. The goal is to use these behavioral models as a prescriptive control approach in distributed multi-agent systems where the guaranteed limiting behavior would represent a desirable operating condition.

Jason R. Marden | Jeff S. Shamma | J. Shamma

[1] Jason R. Marden,et al. Autonomous Vehicle-Target Assignment: A Game-Theoretical Formulation , 2007 .

[2] Jason R. Marden,et al. Cooperative Control and Potential Games , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3] Devavrat Shah,et al. Dynamics in congestion games , 2010, SIGMETRICS '10.

[4] L. Shapley,et al. Potential Games , 1994 .

[5] Dean Phillips Foster,et al. Regret Testing: Learning to Play Nash Equilibrium Without Knowing You Have an Opponent , 2006 .

[6] Connections between cooperative control and potential games illustrated on the consensus problem , 2007, 2007 European Control Conference (ECC).

[7] H. Young. Individual Strategy and Social Structure , 2020 .

[8] Andrea Montanari,et al. The spread of innovations in social networks , 2010, Proceedings of the National Academy of Sciences.

[9] Jason R. Marden,et al. Payoff-Based Dynamics for Multiplayer Weakly Acyclic Games , 2009, SIAM J. Control. Optim..

[10] Mark Voorneveld,et al. Best-response potential games , 2000 .

[11] L. Shapley,et al. REGULAR ARTICLEPotential Games , 1996 .

[12] Adrian Vetta,et al. Nash equilibria in competitive societies, with applications to facility location, traffic routing and auctions , 2002, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings..

[13] L. Blume. The Statistical Mechanics of Strategic Interaction , 1993 .

[14] H. Peyton Young,et al. Learning by trial and error , 2009, Games Econ. Behav..

[15] Yakov Babichenko,et al. Completely uncoupled dynamics and Nash equilibria , 2012, Games Econ. Behav..

[16] L. Shapley,et al. Stochastic Games* , 1953, Proceedings of the National Academy of Sciences.

[17] Nick Netzer,et al. The logit-response dynamics , 2010, Games Econ. Behav..

[18] Shie Mannor,et al. Multi-agent learning for engineers , 2007, Artif. Intell..

[19] Tim Roughgarden,et al. Selfish routing and the price of anarchy , 2005 .

[20] Amin Saberi,et al. On the Inefficiency Ratio of Stable Equilibria in Congestion Games , 2009, WINE.

[21] Jason R. Marden,et al. Regret based dynamics: convergence in weakly acyclic games , 2007, AAMAS '07.

[22] Jason R. Marden,et al. Payoff based dynamics for multi-player weakly acyclic games , 2007, 2007 46th IEEE Conference on Decision and Control.

[23] Fabrizio Germano,et al. Global Nash Convergence of Foster and Young's Regret Testing , 2004, Games Econ. Behav..

[24] Alan Beggs. Waiting times and equilibrium selection , 2005 .

[25] Yoav Shoham,et al. If multi-agent learning is the answer, what is the question? , 2007, Artif. Intell..

[26] Lawrence E. Blume,et al. How noise matters , 2003, Games Econ. Behav..

[27] L. Blume,et al. POPULATION GAMES , 1995 .

[28] H. Peyton Young,et al. Strategic Learning and Its Limits , 2004 .

[29] Avrim Blum,et al. Routing without regret: on convergence to nash equilibria of regret-minimizing algorithms in routing games , 2006, PODC '06.

[30] G. Lugosi,et al. Global Nash Convergence of Foster and Young's Regret Testing , 2004 .

[31] W. Arthur,et al. The Economy as an Evolving Complex System II , 1988 .

[32] H. Young,et al. The Evolution of Conventions , 1993 .

[33] L. Shapley,et al. Fictitious Play Property for Games with Identical Interests , 1996 .

[34] Jason R. Marden,et al. Joint Strategy Fictitious Play with Inertia for Potential Games , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.