论文信息 - Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence - 字舞流文

Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence

To regulate a social system comprised of self-interested agents, economic incentives are often required to induce a desirable outcome. This incentive design problem naturally possesses a bilevel structure, in which a designer modiﬁes the rewards of the agents with incentives while anticipating the response of the agents, who play a non-cooperative game that converges to an equilibrium. The existing bilevel optimization algorithms raise a dilemma when applied to this problem: anticipating how incentives aﬀect the agents at equilibrium requires solving the equilibrium problem repeatedly, which is computationally ineﬃcient; bypassing the time-consuming step of equilibrium-ﬁnding can reduce the computational cost, but may lead the designer to a sub-optimal solution. To address such a dilemma, we propose a method that tackles the designer’s and agents’ problems simultaneously in a single loop. Speciﬁcally, at each iteration, both the designer and the agents only move one step. Nevertheless, we allow the designer to gradually learn the overall inﬂuence of the incentives on the agents, which guarantees optimality after convergence. The convergence rate of the proposed scheme is also established for a broad class of games.

Hoi-To Wai | Zhuoran Yang | Mingyi Hong | Zhaoran Wang | Jiayang Li | Y. Nie | Boyi Liu

[1] Zhaoran Wang,et al. Differentiable Bilevel Programming for Stackelberg Congestion Games , 2022, arXiv.org.

[2] Nicolas Loizou,et al. Stochastic Mirror Descent: Convergence Analysis and Adaptive Variants via the Mirror Stochastic Polyak Stepsize , 2021, ArXiv.

[3] Deyu Meng,et al. Investigating Bi-Level Optimization for Learning and Vision From a Unified Perspective: A Survey and Beyond , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] E. Melo. On the uniqueness of quantal response equilibria and its application to network games , 2020, Economic Theory.

[5] Tuo Zhao,et al. Learning to Defend by Learning to Attack , 2018, AISTATS.

[6] W. Yin,et al. A Single-Timescale Stochastic Bilevel Optimization Method , 2021, ArXiv.

[7] Jing Yu,et al. End-to-End Learning and Intervention in Games , 2020, NeurIPS.

[8] Zhaoran Wang,et al. A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic , 2020, SIAM J. Optim..

[9] Sergey Levine,et al. Meta-Learning with Implicit Gradients , 2019, NeurIPS.

[10] J. Z. Kolter,et al. Deep Equilibrium Models , 2019, NeurIPS.

[11] Sergio Valcarcel Macua,et al. Coordinating the Crowd: Inducing Desirable Equilibria in Non-Cooperative Systems , 2019, AAMAS.

[12] Nicolas Le Roux,et al. Understanding the impact of entropy on policy optimization , 2018, ICML.

[13] Zhengyuan Zhou,et al. Learning in games with continuous action sets and unknown payoff functions , 2016, Mathematical Programming.

[14] Renjie Liao,et al. Understanding Short-Horizon Bias in Stochastic Meta-Optimization , 2018, ICLR.

[15] Francesca Parise,et al. Sensitivity analysis for network aggregative games , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[16] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[17] J. Zico Kolter,et al. OptNet: Differentiable Optimization as a Layer in Neural Networks , 2017, ICML.

[18] David Pfau,et al. Unrolled Generative Adversarial Networks , 2016, ICLR.

[19] D. M. V. Hesteren. Evolutionary Game Theory , 2017 .

[20] Tapani Raiko,et al. Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters , 2015, ICML.

[21] Alexandre M. Bayen,et al. Convergence of mirror descent dynamics in the routing game , 2015, 2015 European Control Conference (ECC).

[22] Ryan P. Adams,et al. Gradient-based Hyperparameter Optimization through Reversible Learning , 2015, ICML.

[23] Francisco Facchinei,et al. Convex Optimization, Game Theory, and Variational Inequality Theory , 2010, IEEE Signal Processing Magazine.

[24] O. A. B. Space,et al. EQUILIBRIUM POINTS OF NONATOMIC GAMES , 2010 .

[25] D. Weisbach,et al. The Design of a Carbon Tax , 2009 .

[26] Patrice Marcotte,et al. An overview of bilevel optimization , 2007, Ann. Oper. Res..

[27] Donald W. Hearn,et al. An MPEC approach to second-best toll pricing , 2004, Math. Program..

[28] S. Bekhor,et al. Route Choice Models Used in the Stochastic User Equilibrium Problem: A Review , 2004 .

[29] Erik T. Verhoef,et al. SECOND-BEST CONGESTION PRICING IN GENERAL NETWORKS. HEURISTIC ALGORITHMS FOR FINDING SECOND-BEST OPTIMAL TOLL LEVELS AND TOLL POINTS , 2002 .

[30] David E. Boyce,et al. ROUTE FLOW ENTROPY MAXIMIZATION IN ORIGIN-BASED TRAFFIC ASSIGNMENT , 1999 .

[31] Martin L. Hazelton. Some Remarks on Stochastic User Equilibrium , 1998 .

[32] Bethany L. Nicholson,et al. Mathematical Programs with Equilibrium Constraints , 2021, Pyomo — Optimization Modeling in Python.

[33] J. Weibull,et al. Nash Equilibrium and Evolution by Imitation , 1994 .

[34] E. Maskin. The Invisible Hand and Externalities , 1994 .

[35] Till Requate,et al. Pollution control in a Cournot duopoly via taxes or permits , 1993 .

[36] A. Nagurney. Network Economics: A Variational Inequality Approach , 1992 .

[37] Terry L. Friesz,et al. Hierarchical optimization: An introduction , 1992, Ann. Oper. Res..

[38] Terry L. Friesz,et al. Sensitivity analysis based heuristic algorithms for mathematical programs with variational inequality constraints , 1990, Math. Program..

[39] J. Pang,et al. Existence of optimal solutions to mathematical programs with equilibrium constraints , 1988 .

[40] B. Greenwald,et al. Externalities in Economies with Imperfect Information and Incomplete Markets , 1986 .

[41] Stella Dafermos,et al. An iterative scheme for variational inequalities , 1983, Math. Program..

[42] John Darzentas,et al. Problem Complexity and Method Efficiency in Optimization , 1983 .

[43] James J. Valone,et al. Free to Choose: A Personal Statement , 1980 .

[44] Stella Dafermos,et al. Traffic Equilibrium and Variational Inequalities , 1980 .

[45] Jerome Bracken,et al. Mathematical Programs with Optimization Problems in the Constraints , 1973, Oper. Res..

[46] W. Vickrey. Congestion Theory and Transport Investment , 1969 .

[47] Heinrich von Stackelberg,et al. Stackelberg (Heinrich von) - The Theory of the Market Economy, translated from the German and with an introduction by Alan T. PEACOCK. , 1953 .

[48] J. G. Wardrop,et al. Some Theoretical Aspects of Road Traffic Research , 1952 .