论文信息 - Multiplicative updates outperform generic no-regret learning in congestion games: extended abstract - 字舞流文

Multiplicative updates outperform generic no-regret learning in congestion games: extended abstract

We study the outcome of natural learning algorithms in atomic congestion games. Atomic congestion games have a wide variety of equilibria often with vastly differing social costs. We show that in almost all such games, the well-known multiplicative-weights learning algorithm results in convergence to pure equilibria. Our results show that natural learning behavior can avoid bad outcomes predicted by the price of anarchy in atomic congestion games such as the load-balancing game introduced by Koutsoupias and Papadimitriou, which has super-constant price of anarchy and has correlated equilibria that are exponentially worse than any mixed Nash equilibrium. Our results identify a set of mixed Nash equilibria that we call weakly stable equilibria. Our notion of weakly stable is defined game-theoretically, but we show that this property holds whenever a stability criterion from the theory of dynamical systems is satisfied. This allows us to show that in every congestion game, the distribution of play converges to the set of weakly stable equilibria. Pure Nash equilibria are weakly stable, and we show using techniques from algebraic geometry that the converse is true with probability 1 when congestion costs are selected at random independently on each edge (from any monotonically parametrized distribution). We further extend our results to show that players can use algorithms with different (sufficiently small) learning rates, i.e. they can trade off convergence speed and long term average regret differently.

Éva Tardos | Georgios Piliouras | Robert D. Kleinberg | G. Piliouras | É. Tardos

[1] J. Milnor. Topology from the differentiable viewpoint , 1965 .

[2] Peter Secretan. Learning , 1965, Mental Health.

[3] R. Rosenthal. A class of games possessing pure-strategy Nash equilibria , 1973 .

[4] I. Shafarevich. Basic algebraic geometry , 1974 .

[5] J M Smith,et al. Evolution and the theory of games , 1976 .

[6] D. E. Matthews. Evolution and the Theory of Games , 1977 .

[7] J. Steele. An Efron-Stein inequality for nonsymmetric statistics , 1986 .

[8] Manfred K. Warmuth,et al. The weighted majority algorithm , 1989, 30th Annual Symposium on Foundations of Computer Science.

[9] E. Akin. The Differential Geometry of Population Genetics and Evolutionary Games , 1990 .

[10] L. Perko. Differential Equations and Dynamical Systems , 1991 .

[11] L. Samuelson,et al. Evolutionary Stability in Asymmetric Games , 1992 .

[12] Manfred K. Warmuth,et al. The Weighted Majority Algorithm , 1994, Inf. Comput..

[13] Jörgen W. Weibull,et al. Evolutionary Game Theory , 1996 .

[14] I. Shafarevich,et al. Basic algebraic geometry 1 (2nd, revised and expanded ed.) , 1994 .

[15] L. Shapley,et al. Potential Games , 1994 .

[16] J. Weibull,et al. Evolutionary Selection in Normal-Form Games , 1995 .

[17] L. Shapley,et al. Fictitious Play Property for Games with Identical Interests , 1996 .

[18] Dean P. Foster,et al. Calibrated Learning and Correlated Equilibrium , 1997 .

[19] S. Hart,et al. A simple adaptive procedure leading to correlated equilibrium , 2000 .

[20] Josef Hofbauer,et al. Evolutionary Games and Population Dynamics , 1998 .

[21] D. Fudenberg,et al. The Theory of Learning in Games , 1998 .

[22] Y. Freund,et al. Adaptive game playing using multiplicative weights , 1999 .

[23] Christos H. Papadimitriou,et al. Worst-case Equilibria , 1999, STACS.

[24] Christos H. Papadimitriou,et al. Worst-case equilibria , 1999 .

[25] William H. Sandholm,et al. Potential Games with Continuous Player Sets , 2001, J. Econ. Theory.

[26] H. Peyton Young,et al. Learning, hypothesis testing, and Nash equilibrium , 2003, Games Econ. Behav..

[27] Sham M. Kakade,et al. Deterministic calibration and Nash equilibrium , 2004, J. Comput. Syst. Sci..

[28] Berthold Vöcking,et al. On the Evolution of Selfish Routing , 2004, ESA.

[29] Stochastic Uncoupled Dynamics and Nash Equilibrium , 2004 .

[30] Vahab S. Mirrokni,et al. Convergence Issues in Competitive Games , 2004, APPROX-RANDOM.

[31] Vahab S. Mirrokni,et al. Sink equilibria and convergence , 2005, 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS'05).

[32] Yishay Mansour,et al. Fast convergence of selfish rerouting , 2005, SODA '05.

[33] Dean Phillips Foster,et al. Regret Testing: Learning to Play Nash Equilibrium Without Knowing You Have an Opponent , 2006 .

[34] Berthold Vöcking,et al. Fast convergence to Wardrop equilibria by adaptive sampling methods , 2006, STOC '06.

[35] Andreu Mas-Colell,et al. Stochastic Uncoupled Dynamics and Nash Equilibrium , 2004, Games Econ. Behav..

[36] Avrim Blum,et al. Routing without regret: on convergence to nash equilibria of regret-minimizing algorithms in routing games , 2006, PODC '06.

[37] Y. Mansour,et al. Algorithmic Game Theory: Learning, Regret Minimization, and Equilibria , 2007 .

[38] Fabrizio Germano,et al. Global Nash Convergence of Foster and Young's Regret Testing , 2004, Games Econ. Behav..

[39] Berthold Vöcking. Algorithmic Game Theory: Selfish Load Balancing , 2007 .

[40] T. Roughgarden. Algorithmic Game Theory: Routing Games , 2007 .

[41] Mohammad Taghi Hajiaghayi,et al. Regret minimization and the price of total anarchy , 2008, STOC.

[42] C. Cannings,et al. Evolutionary Game Theory , 2010 .

[43] William H. Sandholm,et al. Population Games And Evolutionary Dynamics , 2010, Economic learning and social evolution.

[44] Y. Mansour,et al. Regret Minimization , and Equilibria , 2022 .