Fully Stochastic Trust-Region Sequential Quadratic Programming for Equality-Constrained Optimization Problems

We propose a trust-region stochastic sequential quadratic programming algorithm (TR-StoSQP) to solve nonlinear optimization problems with stochastic objectives and deterministic equality constraints. We consider a fully stochastic setting, where in each iteration a single sample is generated to estimate the objective gradient. The algorithm adaptively selects the trust-region radius and, compared to the existing line-search StoSQP schemes, allows us to employ indefinite Hessian matrices (i.e., Hessians without modification) in SQP subproblems. As a trust-region method for constrained optimization, our algorithm needs to address an infeasibility issue—the linearized equality constraints and trust-region constraints might lead to infeasible SQP subproblems. In this regard, we propose an adaptive relaxation technique to compute the trial step that consists of a normal step and a tangential step. To control the lengths of the two steps, we adaptively decompose the trust-region radius into two segments based on the proportions of the feasibility and optimality residuals to the full KKT residual. The normal step has a closed form, while the tangential step is solved from a trust-region subproblem, to which a solution ensuring the Cauchy reduction is sufficient for our study. We establish the global almost sure convergence guarantee for TR-StoSQP, and illustrate its empirical performance on both a subset of problems in the CUTEst test set and constrained logistic regression problems using data from the LIBSVM collection.

[1]  Raghu Bollapragada,et al.  An Adaptive Sampling Sequential Quadratic Programming Method for Equality Constrained Stochastic Optimization , 2022, 2206.00712.

[2]  Michael W. Mahoney,et al.  Asymptotic Convergence Rate and Statistical Inference for Stochastic Sequential Quadratic Programming , 2022, ArXiv.

[3]  Michael W. Mahoney,et al.  Hessian Averaging in Stochastic Newton Methods Achieves Superlinear Convergence , 2022, Mathematical Programming.

[4]  A. Berahas,et al.  Accelerating stochastic sequential quadratic programming for equality constrained optimization using predictive variance reduction , 2022, Computational Optimization and Applications.

[5]  Daniel P. Robinson,et al.  Worst-case complexity of an SQP method for nonlinear equality constrained stochastic optimization , 2021, Mathematical Programming.

[6]  M. Anitescu,et al.  Inequality constrained stochastic nonlinear optimization via active-set sequential quadratic programming , 2021, Mathematical Programming.

[7]  Daniel P. Robinson,et al.  Inexact Sequential Quadratic Optimization for Minimizing a Stochastic Objective Function Subject to Deterministic Nonlinear Equality Constraints , 2021, 2107.03512.

[8]  Daniel P. Robinson,et al.  A Stochastic Sequential Quadratic Optimization Algorithm for Nonlinear-Equality-Constrained Optimization with Rank-Deficient Jacobians , 2021, Mathematics of Operations Research.

[9]  M. Anitescu,et al.  An adaptive stochastic sequential quadratic programming with differentiable exact augmented lagrangians , 2021, Mathematical Programming.

[10]  Daniel P. Robinson,et al.  Sequential Quadratic Optimization for Nonlinear Equality Constrained Stochastic Optimization , 2020, SIAM J. Optim..

[11]  Sen Na,et al.  Convergence Analysis of Accelerated Stochastic Gradient Descent under the Growth Condition , 2020, 2006.06782.

[12]  Rui Shi,et al.  A fully stochastic second-order trust region method , 2019, Optim. Methods Softw..

[13]  Sebastian U. Stich,et al.  Unified Optimal Analysis of the (Stochastic) Gradient Method , 2019, ArXiv.

[14]  Mark W. Schmidt,et al.  Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron , 2018, AISTATS.

[15]  Greg Mori,et al.  Constraint-Aware Deep Neural Network Compression , 2018, ECCV.

[16]  Rui Shi,et al.  A Stochastic Trust Region Algorithm Based on Careful Step Normalization , 2017, INFORMS J. Optim..

[17]  Daniel P. Robinson,et al.  Exploiting negative curvature in deterministic and stochastic optimization , 2017, Mathematical Programming.

[18]  Jorge Nocedal,et al.  Optimization Methods for Large-Scale Machine Learning , 2016, SIAM Rev..

[19]  Katya Scheinberg,et al.  Stochastic optimization using a trust-region method and random models , 2015, Mathematical Programming.

[20]  Tong Zhang,et al.  Accelerating Stochastic Gradient Descent using Predictive Variance Reduction , 2013, NIPS.

[21]  Nicholas I. M. Gould,et al.  CUTEst: a Constrained and Unconstrained Testing Environment with safe threads for mathematical optimization , 2013, Computational Optimization and Applications.

[22]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[23]  Andrew J. Wathen,et al.  Optimal Solvers for PDE-Constrained Optimization , 2010, SIAM J. Sci. Comput..

[24]  Lorenz T. Biegler,et al.  On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming , 2006, Math. Program..

[25]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[26]  Paul T. Boggs,et al.  Sequential Quadratic Programming , 1995, Acta Numerica.

[27]  M. El-Alem A global convergence theory for the Celis-Dennis-Tapia trust-region algorithm for constrained optimization , 1991 .

[28]  Ya-Xiang Yuan,et al.  A trust region algorithm for equality constrained optimization , 1990, Math. Program..

[29]  E. Omojokun Trust region algorithms for optimization with nonlinear equality and inequality constraints , 1990 .

[30]  J. Dupacová,et al.  ASYMPTOTIC BEHAVIOR OF STATISTICAL ESTIMATORS AND OF OPTIMAL SOLUTIONS OF STOCHASTIC OPTIMIZATION PROBLEMS , 1988 .

[31]  Richard H. Byrd,et al.  A Trust Region Algorithm for Nonlinearly Constrained Optimization , 1987 .

[32]  A. Vardi A Trust Region Algorithm for Equality Constrained Minimization: Convergence Properties and Implementation , 1985 .

[33]  M. R. Celis,et al.  A Trust Region Strategy for Equality Constrained Optimization , 1984 .

[34]  Dimitri P. Bertsekas,et al.  Constrained Optimization and Lagrange Multiplier Methods , 1982 .

[35]  C. Elster,et al.  A trust region method for the optimization of noisy functions , 2007 .

[36]  Dimitri P. Bertsekas,et al.  Network optimization : continuous and discrete models , 1998 .

[37]  H. Robbins,et al.  A Convergence Theorem for Non Negative Almost Supermartingales and Some Applications , 1985 .