论文信息 - Dual Descent ALM and ADMM - 字舞流文

Dual Descent ALM and ADMM

Classical primal-dual algorithms attempt to solve $\max_{\mu}\min_{x} \mathcal{L}(x,\mu)$ by alternatively minimizing over the primal variable $x$ through primal descent and maximizing the dual variable $\mu$ through dual ascent. However, when $\mathcal{L}(x,\mu)$ is highly nonconvex with complex constraints in $x$, the minimization over $x$ may not achieve global optimality, and hence the dual ascent step loses its valid intuition. This observation motivates us to propose a new class of primal-dual algorithms for nonconvex constrained optimization with the key feature to reverse dual ascent to a conceptually new dual descent, in a sense, elevating the dual variable to the same status as the primal variable. Surprisingly, this new dual scheme achieves some best iteration complexities for solving nonconvex optimization problems. In particular, when the dual descent step is scaled by a fractional constant, we name it scaled dual descent (SDD), otherwise, unscaled dual descent (UDD). For nonconvex multiblock optimization with nonlinear equality constraints, we propose SDD-ADMM and show that it finds an $\epsilon$-stationary solution in $\mathcal{O}(\epsilon^{-4})$ iterations. The complexity is further improved to $\mathcal{O}(\epsilon^{-3})$ and $\mathcal{O}(\epsilon^{-2})$ under proper conditions. We also propose UDD-ALM, combining UDD with ALM, for weakly convex minimization over affine constraints. We show that UDD-ALM finds an $\epsilon$-stationary solution in $\mathcal{O}(\epsilon^{-2})$ iterations. These complexity bounds for both algorithms either achieve or improve the best-known results in the ADMM and ALM literature. Moreover, SDD-ADMM addresses a long-standing limitation of existing ADMM frameworks.

Andy Sun | Kaizhao Sun | Kaizhao Sun | Andy Sun

[1] X. Sun,et al. Algorithms for Difference-of-Convex Programs Based on Difference-of-Moreau-Envelopes Smoothing , 2022, INFORMS Journal on Optimization.

[2] Yangyang Xu,et al. Augmented Lagrangian-Based First-Order Methods for Convex-Constrained Programs with Weakly Convex Objective , 2021, INFORMS J. Optim..

[3] Daniel P. Robinson,et al. Inexact Sequential Quadratic Optimization for Minimizing a Stochastic Objective Function Subject to Deterministic Nonlinear Equality Constraints , 2021, 2107.03512.

[4] Wotao Yin,et al. Moreau Envelope Augmented Lagrangian Method for Nonconvex Optimization with Linear Constraints , 2021, Journal of Scientific Computing.

[5] Jefferson G. Melo,et al. Iteration Complexity of a Proximal Augmented Lagrangian Method for Solving Nonconvex Composite Optimization Problems with Nonlinear Convex Constraints , 2020, Math. Oper. Res..

[6] Jefferson G. Melo,et al. Iteration-complexity of an inner accelerated inexact proximal augmented Lagrangian method based on the classical Lagrangian function and a full Lagrange multiplier update , 2020 .

[7] Lei Zhao,et al. A First-Order Primal-Dual Method for Nonconvex Constrained Optimization Based on the Augmented Lagrangian , 2020, Mathematics of Operations Research.

[8] Daniel P. Robinson,et al. Sequential Quadratic Optimization for Nonlinear Equality Constrained Stochastic Optimization , 2020, SIAM J. Optim..

[9] Yangyang Xu,et al. Rate-improved inexact augmented Lagrangian method for constrained nonconvex optimization , 2020, AISTATS.

[10] Chen Zhang,et al. Monotone Splitting Sequential Quadratic Optimization Algorithm with Applications in Electric Power Systems , 2020, J. Optim. Theory Appl..

[11] Mingyi Hong,et al. Penalty Dual Decomposition Method for Nonsmooth Nonconvex Optimization—Part I: Algorithms and Convergence Analysis , 2020, IEEE Transactions on Signal Processing.

[12] Mingyi Hong,et al. Penalty Dual Decomposition Method for Nonsmooth Nonconvex Optimization—Part II: Applications , 2020, IEEE Transactions on Signal Processing.

[13] Jefferson G. Melo,et al. Iteration-complexity of an inexact proximal accelerated augmented Lagrangian method for solving linearly constrained smooth nonconvex composite optimization problems , 2020, 2006.08048.

[14] Yangyang Xu,et al. Inexact Proximal-Point Penalty Methods for Constrained Non-Convex Optimization , 2019 .

[15] Guanghui Lan,et al. Stochastic first-order methods for convex and nonconvex functional constrained optimization , 2019, Mathematical Programming.

[16] Tianbao Yang,et al. Proximally Constrained Methods for Weakly Convex Optimization with Weakly Convex Constraints. , 2019 .

[17] Stephen J. Wright,et al. Complexity of Proximal Augmented Lagrangian for Nonconvex Optimization with Nonlinear Equality Constraints , 2019, Journal of Scientific Computing.

[18] Volkan Cevher,et al. An Inexact Augmented Lagrangian Framework for Nonconvex Optimization with Nonlinear Constraints , 2019, NeurIPS.

[19] X. A. Sun,et al. A two-level distributed algorithm for nonconvex constrained optimization , 2019, Computational optimization and applications.

[20] Mingyi Hong,et al. Perturbed proximal primal–dual algorithm for nonconvex nonsmooth optimization , 2019, Mathematical Programming.

[21] Zhi-Quan Luo,et al. A Proximal Alternating Direction Method of Multiplier for Linearly Constrained Nonconvex Minimization , 2018, SIAM J. Optim..

[22] Jefferson G. Melo,et al. Iteration-complexity of a Jacobi-type non-Euclidean ADMM for multi-block linearly constrained nonconvex programs , 2017, 1705.07229.

[23] Jefferson G. Melo,et al. Convergence rate bounds for a proximal ADMM with over-relaxation stepsize parameter for solving nonconvex linearly constrained problems , 2017, 1702.01850.

[24] Jefferson G. Melo,et al. Extending the ergodic convergence rate of the proximal ADMM , 2016, 1611.02903.

[25] Shiqian Ma,et al. Structured nonconvex and nonsmooth optimization: algorithms and iteration complexity analysis , 2016, Computational Optimization and Applications.

[26] Mingyi Hong,et al. Decomposing Linearly Constrained Nonconvex Problems by a Proximal Primal Dual Approach: Algorithms, Convergence, and Applications , 2016, ArXiv.

[27] Wotao Yin,et al. Global Convergence of ADMM in Nonconvex Nonsmooth Optimization , 2015, Journal of Scientific Computing.

[28] Xiaoming Yuan,et al. On non-ergodic convergence rate of Douglas–Rachford alternating direction method of multipliers , 2015, Numerische Mathematik.

[29] Shiqian Ma,et al. Global Convergence of Unmodified 3-Block ADMM for a Class of Convex Minimization Problems , 2015, Journal of Scientific Computing.

[30] Damek Davis,et al. A Three-Operator Splitting Scheme and its Optimization Applications , 2015, 1504.01032.

[31] Renato D. C. Monteiro,et al. Iteration-complexity of first-order augmented Lagrangian methods for convex programming , 2015, Mathematical Programming.

[32] Bingsheng He,et al. The direct extension of ADMM for multi-block convex minimization problems is not necessarily convergent , 2014, Mathematical Programming.

[33] Yangyang Xu,et al. A Globally Convergent Algorithm for Nonconvex Optimization Based on Block Coordinate Update , 2014, Journal of Scientific Computing.

[34] Zhi-Quan Luo,et al. Convergence analysis of alternating direction method of multipliers for a family of nonconvex problems , 2014, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[35] Asuman E. Ozdaglar,et al. Broadcast-based distributed alternating direction method of multipliers , 2014, 2014 52nd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[36] Saeed Ghadimi,et al. Accelerated gradient methods for nonconvex nonlinear and stochastic programming , 2013, Mathematical Programming.

[37] Wotao Yin,et al. A Block Coordinate Descent Method for Regularized Multiconvex Optimization with Applications to Nonnegative Tensor Factorization and Completion , 2013, SIAM J. Imaging Sci..

[38] Qing Ling,et al. On the Linear Convergence of the ADMM in Decentralized Consensus Optimization , 2013, IEEE Transactions on Signal Processing.

[39] Renato D. C. Monteiro,et al. Iteration-Complexity of Block-Decomposition Algorithms and the Alternating Direction Method of Multipliers , 2013, SIAM J. Optim..

[40] Zhi-Quan Luo,et al. On the linear convergence of the alternating direction method of multipliers , 2012, Mathematical Programming.

[41] Bingsheng He,et al. On the O(1/n) Convergence Rate of the Douglas-Rachford Alternating Direction Method , 2012, SIAM J. Numer. Anal..

[42] Neal Parikh,et al. Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[43] José Mario Martínez,et al. On Augmented Lagrangian Methods with General Lower-Level Constraints , 2007, SIAM J. Optim..

[44] Dimitri P. Bertsekas,et al. On the Douglas—Rachford splitting method and the proximal point algorithm for maximal monotone operators , 1992, Math. Program..

[45] Dimitri P. Bertsekas,et al. Constrained Optimization and Lagrange Multiplier Methods , 1982 .

[46] R. Tyrrell Rockafellar,et al. Augmented Lagrangians and Applications of the Proximal Point Algorithm in Convex Programming , 1976, Math. Oper. Res..

[47] R. Rockafellar. The multiplier method of Hestenes and Powell applied to convex programming , 1973 .

[48] M. Hestenes. Multiplier and gradient methods , 1969 .

[49] H. H. Rachford,et al. On the numerical solution of heat conduction problems in two and three space variables , 1956 .

[50] H. H. Rachford,et al. The Numerical Solution of Parabolic and Elliptic Differential Equations , 1955 .

[51] Miantao Chao,et al. A QCQP-based splitting SQP algorithm for two-block nonconvex constrained optimization problems with application , 2021, J. Comput. Appl. Math..

[52] Jefferson G. Melo,et al. Iteration-Complexity of a Linearized Proximal Multiblock ADMM Class for Linearly Constrained Nonconvex Optimization Problems , 2017 .

[53] A. Ruszczynski,et al. Nonlinear Optimization , 1999 .

[54] D. Gabay. Applications of the method of multipliers to variational inequalities , 1983 .

[55] B. Mercier,et al. A dual algorithm for the solution of nonlinear variational problems via finite element approximation , 1976 .

[56] M. Powell. A method for nonlinear constraints in minimization problems , 1969 .