论文信息 - Primal-Dual Stochastic Gradient Method for Convex Programs with Many Functional Constraints

Primal-Dual Stochastic Gradient Method for Convex Programs with Many Functional Constraints

Stochastic gradient (SG) method has been popularly applied to solve optimization problems with objective that is stochastic or an average of many functions. Most existing works on SG assume that the underlying problem is unconstrained or has an easy-to-project constraint set. In this paper, we consider problems that have a stochastic objective and also many functional constraints. For such problems, it could be extremely expensive to project a point to the feasible set, or even compute subgradient and/or function value of all constraint functions. To find solutions of these problems, we propose a novel SG method based on the augmented Lagrangian function. Within every iteration, it inquires a stochastic subgradient of the objective, a subgradient and function value of one randomly sampled constraint function, and function value of another sampled constraint function. Hence, the per-iteration complexity is low. We establish its convergence rate for convex and also strongly convex problems. It can achieve the optimal $O(1/\sqrt{k})$ convergence rate for convex case and nearly optimal $O\big((\log k)/k\big)$ rate for strongly convex case. Numerical experiments on quadratically constrained quadratic programming are conducted to demonstrate its efficiency.

Yangyang Xu | Yangyang Xu

[1] Yunmei Chen,et al. Optimal Primal-Dual Methods for a Class of Saddle Point Problems , 2013, SIAM J. Optim..

[2] M. Neely,et al. A Primal-Dual Parallel Method with $O(1/\epsilon)$ Convergence for Constrained Composite Convex Programs , 2017, 1708.00322.

[3] Marco C. Campi,et al. A Sampling-and-Discarding Approach to Chance-Constrained Optimization: Feasibility and Optimality , 2011, J. Optim. Theory Appl..

[4] Adams Wei Yu,et al. BLOCK-NORMALIZED GRADIENT METHOD: AN EMPIRICAL STUDY FOR TRAINING DEEP NEURAL NETWORK , 2018 .

[5] R. Rockafellar. The multiplier method of Hestenes and Powell applied to convex programming , 1973 .

[6] Yangyang Xu,et al. First-order methods for constrained convex programming based on linearized augmented Lagrangian function , 2017, INFORMS J. Optim..

[7] Mengdi Wang,et al. Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions , 2014, Mathematical Programming.

[8] Alexander Shapiro,et al. Stochastic Approximation approach to Stochastic Programming , 2013 .

[9] James R. Luedtke,et al. A Sample Approximation Approach for Optimization with Probabilistic Constraints , 2008, SIAM J. Optim..

[10] Alexander Shapiro,et al. Lectures on Stochastic Programming: Modeling and Theory , 2009 .

[11] A. Nemirovski,et al. Scenario Approximations of Chance Constraints , 2006 .