Last-Iterate Convergence of Saddle Point Optimizers via High-Resolution Differential Equations
暂无分享,去创建一个
[1] Haihao Lu,et al. An O(sr)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$O(s^r)$$\end{document}-resolution ODE framework for understand , 2020, Mathematical Programming.
[2] Michael I. Jordan,et al. Efficient Methods for Structured Nonconvex-Nonconcave Min-Max Optimization , 2020, AISTATS.
[3] Laurent Lessard,et al. A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints , 2020, J. Mach. Learn. Res..
[4] Ya-Ping Hsieh,et al. The limits of min-max optimization algorithms: convergence to spurious non-critical sets , 2020, ICML.
[5] Jacob Abernethy,et al. Last-iterate convergence rates for min-max optimization , 2019, ArXiv.
[6] Michael I. Jordan,et al. Understanding the acceleration phenomenon via high-resolution differential equations , 2018, Mathematical Programming.
[7] Lillian J. Ratliff,et al. Local Convergence Analysis of Gradient Descent Ascent with Finite Timescale Separation , 2021, ICLR.
[8] Noah Golowich,et al. Tight last-iterate convergence rates for no-regret learning in multi-player games , 2020, NeurIPS.
[9] Ioannis Mitliagkas,et al. LEAD: Least-Action Dynamics for Min-Max Optimization , 2020, ArXiv.
[10] Ioannis Mitliagkas,et al. Stochastic Hamiltonian Gradient Methods for Smooth Games , 2020, ICML.
[11] Michael I. Jordan,et al. On dissipative symplectic integration with applications to gradient-based optimization , 2020, Journal of Statistical Mechanics: Theory and Experiment.
[12] Noah Golowich,et al. Last Iterate is Slower than Averaged Iterate in Smooth Convex-Concave Saddle Point Problems , 2020, COLT.
[13] Jimmy Ba,et al. On Solving Minimax Optimization Locally: A Follow-the-Ridge Approach , 2019, ICLR.
[14] Pascal Vincent,et al. A Closer Look at the Optimization Landscapes of Generative Adversarial Networks , 2019, ICLR.
[15] Aryan Mokhtari,et al. A Unified Analysis of Extra-gradient and Optimistic Gradient Methods for Saddle Point Problems: Proximal Point Approach , 2019, AISTATS.
[16] S. Shankar Sastry,et al. On Gradient-Based Learning in Continuous Games , 2018, SIAM J. Math. Data Sci..
[17] Ioannis Mitliagkas,et al. A Tight and Unified Analysis of Gradient-Based Methods for a Whole Spectrum of Differentiable Games , 2020, AISTATS.
[18] J. Malick,et al. On the convergence of single-call stochastic extra-gradient methods , 2019, NeurIPS.
[19] Geoffrey E. Hinton,et al. Lookahead Optimizer: k steps forward, 1 step back , 2019, NeurIPS.
[20] Tatjana Chavdarova,et al. Reducing Noise in GAN Training with Variance Reduced Extragradient , 2019, NeurIPS.
[21] Ioannis Mitliagkas,et al. Negative Momentum for Improved Game Dynamics , 2018, AISTATS.
[22] Chuan-Sheng Foo,et al. Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile , 2018, ICLR.
[23] Thomas Hofmann,et al. Local Saddle Point Optimization: A Curvature Exploitation Approach , 2018, AISTATS.
[24] Tengyuan Liang,et al. Interaction Matters: A Note on Non-asymptotic Local Convergence of Generative Adversarial Networks , 2018, AISTATS.
[25] Constantinos Daskalakis,et al. The Limit Points of (Optimistic) Gradient Descent in Min-Max Optimization , 2018, NeurIPS.
[26] Sebastian Nowozin,et al. Which Training Methods for GANs do actually Converge? , 2018, ICML.
[27] Constantinos Daskalakis,et al. Training GANs with Optimism , 2017, ICLR.
[28] Christos H. Papadimitriou,et al. Cycles in adversarial regularized learning , 2017, SODA.
[29] Jonathan P. How,et al. Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability , 2017, ICML.
[30] Ohad Shamir,et al. On the Iteration Complexity of Oblivious First-Order Optimization Algorithms , 2016, ICML.
[31] Andre Wibisono,et al. A variational perspective on accelerated methods in optimization , 2016, Proceedings of the National Academy of Sciences.
[32] Stephen P. Boyd,et al. A Differential Equation for Modeling Nesterov's Accelerated Gradient Method: Theory and Insights , 2014, J. Mach. Learn. Res..
[33] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[34] H. Attouch,et al. A second-order differential system with hessian-driven damping; application to non-elastic shock laws , 2012 .
[35] Renato D. C. Monteiro,et al. On the Complexity of the Hybrid Proximal Extragradient Method for the Iterates and the Ergodic Mean , 2010, SIAM J. Optim..
[36] Anna Nagurney,et al. Variational Inequalities , 2009, Encyclopedia of Optimization.
[37] Yurii Nesterov,et al. Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.
[38] Arkadi Nemirovski,et al. Prox-Method with Rate of Convergence O(1/t) for Variational Inequalities with Lipschitz Continuous Monotone Operators and Smooth Convex-Concave Saddle Point Problems , 2004, SIAM J. Optim..
[39] F. Facchinei,et al. Finite-Dimensional Variational Inequalities and Complementarity Problems , 2003 .
[40] J. Schropp,et al. A dynamical systems approach to constrained minimization , 2000 .
[41] U. Helmke,et al. Optimization and Dynamical Systems , 1994, Proceedings of the IEEE.
[42] M. Hirsch,et al. Dynamics of Morse-Smale urn processes , 1995, Ergodic Theory and Dynamical Systems.
[43] P. Tseng. On linear convergence of iterative methods for the variational inequality problem , 1995 .
[44] Eyad H. Abed,et al. Guardian maps and the generalized stability of parametrized families of matrices and polynomials , 1990, Math. Control. Signals Syst..
[45] Xu-kai Xie,et al. Stable polynomials with complex coefficients , 1985, 1985 24th IEEE Conference on Decision and Control.
[46] J. Holton. Geophysical fluid dynamics. , 1983, Science.
[47] Y. Nesterov. A method for solving the convex programming problem with convergence rate O(1/k^2) , 1983 .
[48] L. Popov. A modification of the Arrow-Hurwicz method for search of saddle points , 1980 .
[49] G. Stampacchia,et al. On the regularity of the solution of a variational inequality , 1969 .
[50] Boris Polyak. Some methods of speeding up the convergence of iteration methods , 1964 .
[51] L. Hurwicz,et al. Gradient Methods for Constrained Maxima , 1957 .
[52] A. Sard,et al. The measure of the critical values of differentiable maps , 1942 .
[53] K. Schittkowski,et al. NONLINEAR PROGRAMMING , 2022 .