Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence

To regulate a social system comprised of self-interested agents, economic incentives are often required to induce a desirable outcome. This incentive design problem naturally possesses a bilevel structure, in which a designer modifies the rewards of the agents with incentives while anticipating the response of the agents, who play a non-cooperative game that converges to an equilibrium. The existing bilevel optimization algorithms raise a dilemma when applied to this problem: anticipating how incentives affect the agents at equilibrium requires solving the equilibrium problem repeatedly, which is computationally inefficient; bypassing the time-consuming step of equilibrium-finding can reduce the computational cost, but may lead the designer to a sub-optimal solution. To address such a dilemma, we propose a method that tackles the designer’s and agents’ problems simultaneously in a single loop. Specifically, at each iteration, both the designer and the agents only move one step. Nevertheless, we allow the designer to gradually learn the overall influence of the incentives on the agents, which guarantees optimality after convergence. The convergence rate of the proposed scheme is also established for a broad class of games.

[1]  Zhaoran Wang,et al.  Differentiable Bilevel Programming for Stackelberg Congestion Games , 2022, arXiv.org.

[2]  Nicolas Loizou,et al.  Stochastic Mirror Descent: Convergence Analysis and Adaptive Variants via the Mirror Stochastic Polyak Stepsize , 2021, ArXiv.

[3]  Deyu Meng,et al.  Investigating Bi-Level Optimization for Learning and Vision From a Unified Perspective: A Survey and Beyond , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  E. Melo On the uniqueness of quantal response equilibria and its application to network games , 2020, Economic Theory.

[5]  Tuo Zhao,et al.  Learning to Defend by Learning to Attack , 2018, AISTATS.

[6]  W. Yin,et al.  A Single-Timescale Stochastic Bilevel Optimization Method , 2021, ArXiv.

[7]  Jing Yu,et al.  End-to-End Learning and Intervention in Games , 2020, NeurIPS.

[8]  Zhaoran Wang,et al.  A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic , 2020, SIAM J. Optim..

[9]  Sergey Levine,et al.  Meta-Learning with Implicit Gradients , 2019, NeurIPS.

[10]  J. Z. Kolter,et al.  Deep Equilibrium Models , 2019, NeurIPS.

[11]  Sergio Valcarcel Macua,et al.  Coordinating the Crowd: Inducing Desirable Equilibria in Non-Cooperative Systems , 2019, AAMAS.

[12]  Nicolas Le Roux,et al.  Understanding the impact of entropy on policy optimization , 2018, ICML.

[13]  Zhengyuan Zhou,et al.  Learning in games with continuous action sets and unknown payoff functions , 2016, Mathematical Programming.

[14]  Renjie Liao,et al.  Understanding Short-Horizon Bias in Stochastic Meta-Optimization , 2018, ICLR.

[15]  Francesca Parise,et al.  Sensitivity analysis for network aggregative games , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[16]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[17]  J. Zico Kolter,et al.  OptNet: Differentiable Optimization as a Layer in Neural Networks , 2017, ICML.

[18]  David Pfau,et al.  Unrolled Generative Adversarial Networks , 2016, ICLR.

[19]  D. M. V. Hesteren Evolutionary Game Theory , 2017 .

[20]  Tapani Raiko,et al.  Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters , 2015, ICML.

[21]  Alexandre M. Bayen,et al.  Convergence of mirror descent dynamics in the routing game , 2015, 2015 European Control Conference (ECC).

[22]  Ryan P. Adams,et al.  Gradient-based Hyperparameter Optimization through Reversible Learning , 2015, ICML.

[23]  Francisco Facchinei,et al.  Convex Optimization, Game Theory, and Variational Inequality Theory , 2010, IEEE Signal Processing Magazine.

[24]  O. A. B. Space,et al.  EQUILIBRIUM POINTS OF NONATOMIC GAMES , 2010 .

[25]  D. Weisbach,et al.  The Design of a Carbon Tax , 2009 .

[26]  Patrice Marcotte,et al.  An overview of bilevel optimization , 2007, Ann. Oper. Res..

[27]  Donald W. Hearn,et al.  An MPEC approach to second-best toll pricing , 2004, Math. Program..

[28]  S. Bekhor,et al.  Route Choice Models Used in the Stochastic User Equilibrium Problem: A Review , 2004 .

[29]  Erik T. Verhoef,et al.  SECOND-BEST CONGESTION PRICING IN GENERAL NETWORKS. HEURISTIC ALGORITHMS FOR FINDING SECOND-BEST OPTIMAL TOLL LEVELS AND TOLL POINTS , 2002 .

[30]  David E. Boyce,et al.  ROUTE FLOW ENTROPY MAXIMIZATION IN ORIGIN-BASED TRAFFIC ASSIGNMENT , 1999 .

[31]  Martin L. Hazelton Some Remarks on Stochastic User Equilibrium , 1998 .

[32]  Bethany L. Nicholson,et al.  Mathematical Programs with Equilibrium Constraints , 2021, Pyomo — Optimization Modeling in Python.

[33]  J. Weibull,et al.  Nash Equilibrium and Evolution by Imitation , 1994 .

[34]  E. Maskin The Invisible Hand and Externalities , 1994 .

[35]  Till Requate,et al.  Pollution control in a Cournot duopoly via taxes or permits , 1993 .

[36]  A. Nagurney Network Economics: A Variational Inequality Approach , 1992 .

[37]  Terry L. Friesz,et al.  Hierarchical optimization: An introduction , 1992, Ann. Oper. Res..

[38]  Terry L. Friesz,et al.  Sensitivity analysis based heuristic algorithms for mathematical programs with variational inequality constraints , 1990, Math. Program..

[39]  J. Pang,et al.  Existence of optimal solutions to mathematical programs with equilibrium constraints , 1988 .

[40]  B. Greenwald,et al.  Externalities in Economies with Imperfect Information and Incomplete Markets , 1986 .

[41]  Stella Dafermos,et al.  An iterative scheme for variational inequalities , 1983, Math. Program..

[42]  John Darzentas,et al.  Problem Complexity and Method Efficiency in Optimization , 1983 .

[43]  James J. Valone,et al.  Free to Choose: A Personal Statement , 1980 .

[44]  Stella Dafermos,et al.  Traffic Equilibrium and Variational Inequalities , 1980 .

[45]  Jerome Bracken,et al.  Mathematical Programs with Optimization Problems in the Constraints , 1973, Oper. Res..

[46]  W. Vickrey Congestion Theory and Transport Investment , 1969 .

[47]  Heinrich von Stackelberg,et al.  Stackelberg (Heinrich von) - The Theory of the Market Economy, translated from the German and with an introduction by Alan T. PEACOCK. , 1953 .

[48]  J. G. Wardrop,et al.  Some Theoretical Aspects of Road Traffic Research , 1952 .