Global Nash Equilibrium in Non-convex Multi-player Game: Theory and Algorithms

—Wide machine learning tasks can be formulated as non-convex multi-player games, where Nash equilibrium (NE) is an acceptable solution to all players, since no one can benefit from changing its strategy unilaterally. Attributed to the non-convexity, obtaining the existence condition of global NE is challenging, let alone designing theoretically guaranteed realization algorithms. This paper takes conjugate transformation to the formulation of non-convex multi-player games, and casts the complementary problem into a variational inequality (VI) problem with a continuous pseudo-gradient mapping. We then prove the existence condition of global NE: the solution to the VI problem satisfies a duality relation. Based on this VI formulation, we design a conjugate-based ordinary differential equation (ODE) to approach global NE, which is proved to have an exponential convergence rate. To make the dynamics more implementable, we further derive a discretized algorithm. We apply our algorithm to two typical scenarios: multi-player generalized monotone game and multi-player potential game. In the two settings, we prove that the step-size setting is required to be O (1 /k ) and O (1 / √ k ) to yield the convergence rates of O (1 /k ) and O (1 / √ k ) , respectively. Extensive experiments in robust neural network training and sensor localization are in full agreement with our theory.

[1]  Fengxiang He,et al.  Benefits of Permutation-Equivariance in Auction Mechanisms , 2022, NeurIPS.

[2]  Dewei Li,et al.  Distributed Global Optimization for a Class of Nonconvex Optimization With Coupled Constraints , 2022, IEEE Transactions on Automatic Control.

[3]  Mingli Song,et al.  Topology-aware Generalization of Decentralized SGD , 2022, ICML.

[4]  K. H. Low,et al.  Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee , 2021, NeurIPS.

[5]  Yiguang Hong,et al.  Efficient Algorithm for Approximating Nash Equilibrium of Distributed Aggregative Games , 2021, IEEE Transactions on Cybernetics.

[6]  Mehrdad Mahdavi,et al.  Local Stochastic Gradient Descent Ascent: Convergence Analysis and Communication Efficiency , 2021, AISTATS.

[7]  Yiguang Hong,et al.  Distributed algorithm for ε-generalized Nash equilibria with uncertain coupled constraints , 2021, Autom..

[8]  Anders P. Eriksson,et al.  Rotation Averaging with the Chordal Distance: Global Minimizers and Strong Duality , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  N. Oudjane,et al.  Approximate Nash Equilibria in Large Nonconvex Aggregative Games , 2020, Mathematics of Operations Research.

[10]  Yu Wang,et al.  Improving Multi-Agent Generative Adversarial Nets with Variational Latent Representation , 2020, Entropy.

[11]  Xianzhong Xie,et al.  Energy-Efficient Joint Scheduling and Resource Management for UAV-Enabled Multicell Networks , 2020, IEEE Systems Journal.

[12]  Seema Bawa,et al.  Dynamic pricing techniques for Intelligent Transportation System in smart cities: A systematic review , 2020, Comput. Commun..

[13]  Bo Zhang,et al.  Triple Generative Adversarial Networks , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Michael I. Jordan,et al.  On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems , 2019, ICML.

[15]  Lantao Yu,et al.  Multi-Agent Adversarial Inverse Reinforcement Learning , 2019, ICML.

[16]  Anoop Cherian,et al.  Game Theoretic Optimization via Gradient-based Nikaido-Isoda Function , 2019, ICML.

[17]  Yuan Liang,et al.  Topology optimization via sequential integer programming and Canonical relaxation algorithm , 2019, Computer Methods in Applied Mechanics and Engineering.

[18]  Masayoshi Tomizuka,et al.  Interaction-aware Multi-agent Tracking and Probabilistic Behavior Prediction via Adversarial Learning , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[19]  Ning Li,et al.  DF-CSPG: A Potential Game Approach for Device-Free Localization Exploiting Joint Sparsity , 2019, IEEE Wireless Communications Letters.

[20]  Lacra Pavel,et al.  An operator splitting approach for distributed generalized Nash equilibria computation , 2019, Autom..

[21]  Jason D. Lee,et al.  Solving a Class of Non-Convex Min-Max Games Using Iterative First Order Methods , 2019, NeurIPS.

[22]  Jong-Shi Pang,et al.  Piecewise affine parameterized value-function based bilevel non-cooperative games , 2018, Math. Program..

[23]  Mingrui Liu,et al.  Weakly-convex–concave min–max optimization: provable algorithms and applications in machine learning , 2018, Optim. Methods Softw..

[24]  Lacra Pavel,et al.  Distributed GNE Seeking Under Partial-Decision Information Over Networks via a Doubly-Augmented Operator Splitting Approach , 2018, IEEE Transactions on Automatic Control.

[25]  Constantinos Daskalakis,et al.  The Limit Points of (Optimistic) Gradient Descent in Min-Max Optimization , 2018, NeurIPS.

[26]  Gauthier Gidel,et al.  A Variational Inequality Perspective on Generative Adversarial Networks , 2018, ICLR.

[27]  Stefano Ermon,et al.  Multi-Agent Generative Adversarial Imitation Learning , 2018, NeurIPS.

[28]  Alagan Anpalagan,et al.  Distributed TOA-Based Positioning in Wireless Sensor Networks: A Potential Game Approach , 2018, IEEE Communications Letters.

[29]  Le Song,et al.  SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation , 2017, ICML.

[30]  Jelena Diakonikolas,et al.  The Approximate Duality Gap Technique: A Unified Theory of First-Order Methods , 2017, SIAM J. Optim..

[31]  Liu Liu,et al.  Variance Reduced Methods for Non-Convex Composition Optimization , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Uday V. Shanbhag,et al.  Asynchronous Schemes for Stochastic and Misspecified Potential Games and Nonconvex Optimization , 2017, Oper. Res..

[33]  David Silver,et al.  A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning , 2017, NIPS.

[34]  Constantinos Daskalakis,et al.  Training GANs with Optimism , 2017, ICLR.

[35]  René Vidal,et al.  Structured Low-Rank Matrix Factorization: Global Optimality, Algorithms, and Applications , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Sepp Hochreiter,et al.  GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[37]  Zhengyuan Zhou,et al.  Learning in games with continuous action sets and unknown payoff functions , 2016, Mathematical Programming.

[38]  Angelia Nedic,et al.  Self-Tuned Stochastic Approximation Schemes for Non-Lipschitzian Stochastic Multi-User Optimization and Nash Games , 2016, IEEE Transactions on Automatic Control.

[39]  Angelia Nedic,et al.  Distributed Algorithms for Aggregative Games on Graphs , 2016, Oper. Res..

[40]  Alexandre M. Bayen,et al.  Accelerated Mirror Descent in Continuous and Discrete Time , 2015, NIPS.

[41]  Victor C. M. Leung,et al.  Centralized and Game Theoretical Solutions of Joint Source and Relay Power Allocation for AF Relay Based Network , 2015, IEEE Transactions on Communications.

[42]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[43]  Jie Jia,et al.  On Distributed Localization for Road Sensor Networks: A Game Theoretic Approach , 2013 .

[44]  Vittorio Latorre,et al.  Canonical duality for solving general nonconvex constrained problems , 2013, Optim. Lett..

[45]  Angelia Nedic,et al.  On Stochastic Subgradient Mirror-Descent Algorithm with Weighted Averaging , 2013, SIAM J. Optim..

[46]  Walid Saad,et al.  Game-Theoretic Methods for the Smart Grid: An Overview of Microgrid Systems, Demand-Side Management, and Smart Grid Communications , 2012, IEEE Signal Processing Magazine.

[47]  Duan Li,et al.  On zero duality gap in nonconvex quadratic programming problems , 2012, J. Glob. Optim..

[48]  Jong-Shi Pang,et al.  Nonconvex Games with Side Constraints , 2011, SIAM J. Optim..

[49]  Sonia Martínez,et al.  An Approximate Dual Subgradient Algorithm for Multi-Agent Non-Convex Optimization , 2010, IEEE Transactions on Automatic Control.

[50]  Francisco Facchinei,et al.  Penalty Methods for the Solution of Generalized Nash Equilibrium Problems , 2010, SIAM J. Optim..

[51]  Marcello Pelillo,et al.  A Game-Theoretic Approach to Hypergraph Clustering , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52]  Amit K. Roy-Chowdhury,et al.  Distributed multi-target tracking in a self-configuring camera network , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[53]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[54]  W. Haddad,et al.  Nonlinear Dynamical Systems and Control: A Lyapunov-Based Approach , 2008 .

[55]  Daniel Pérez Palomar,et al.  Power Control By Geometric Programming , 2007, IEEE Transactions on Wireless Communications.

[56]  Francisco Facchinei,et al.  Distributed Power Allocation With Rate Constraints in Gaussian Parallel Interference Channels , 2007, IEEE Transactions on Information Theory.

[57]  João Paulo Costeira,et al.  A Global Solution to Sparse Correspondence Problems , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[58]  René Vidal,et al.  A hierarchical approach to probabilistic pursuit-evasion games with unmanned ground and aerial vehicles , 2001, Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No.01CH37228).

[59]  E. Damme,et al.  Non-Cooperative Games , 2000 .

[60]  David Yang Gao,et al.  Canonical Dual Transformation Method and Generalized Triality Theory in Nonsmooth Global Optimization , 2000, J. Glob. Optim..

[61]  Y. Freund,et al.  Adaptive game playing using multiplicative weights , 1999 .

[62]  Marc Teboulle,et al.  Convergence Analysis of a Proximal-Like Minimization Algorithm Using Bregman Functions , 1993, SIAM J. Optim..

[63]  Patrick T. Harker,et al.  Finite-dimensional variational inequality and nonlinear complementarity problems: A survey of theory, algorithms and applications , 1990, Math. Program..

[64]  G. Minty Monotone (nonlinear) operators in Hilbert space , 1962 .

[65]  Eric V. Mazumdar,et al.  Global Convergence to Local Minmax Equilibrium in Classes of Nonconvex Zero-Sum Games , 2021, NeurIPS.

[66]  P. Pardalos,et al.  Equilibrium problems : nonsmooth optimization and variational inequality models , 2004 .

[67]  F. Facchinei,et al.  Finite-Dimensional Variational Inequalities and Complementarity Problems , 2003 .

[68]  Arkadi Nemirovski,et al.  The Ordered Subsets Mirror Descent Optimization Method with Applications to Tomography , 2001, SIAM J. Optim..

[69]  John Darzentas,et al.  Problem Complexity and Method Efficiency in Optimization , 1983 .

[70]  G. M. Korpelevich The extragradient method for finding saddle points and other problems , 1976 .