On smoothing, regularization, and averaging in stochastic approximation methods for stochastic variational inequality problems

Traditionally, most stochastic approximation (SA) schemes for stochastic variational inequality (SVI) problems have required the underlying mapping to be either strongly monotone or monotone and Lipschitz continuous. In contrast, we consider SVIs with merely monotone and non-Lipschitzian maps. We develop a regularized smoothed SA (RSSA) scheme wherein the stepsize, smoothing, and regularization parameters are reduced after every iteration at a prescribed rate. Under suitable assumptions on the sequences, we show that the algorithm generates iterates that converge to the least norm solution in an almost sure sense, extending the results in Koshal et al. (IEEE Trans Autom Control 58(3):594–609, 2013) to the non-Lipschitzian regime. Additionally, we provide rate estimates that relate iterates to their counterparts derived from a smoothed Tikhonov trajectory associated with a deterministic problem. To derive non-asymptotic rate statements, we develop a variant of the RSSA scheme, denoted by aRSSA$$_r$$r, in which we employ a weighted iterate-averaging, parameterized by a scalar r where $$r = 1$$r=1 provides us with the standard averaging scheme. The main contributions are threefold: (i) when $$r<1$$r<1 and the parameter sequences are chosen appropriately, we show that the averaged sequence converges to the least norm solution almost surely and a suitably defined gap function diminishes at an approximate rate $$\mathcal{O}({1}\slash {\root 6 \of {k}})$$O(1/k6) after k steps; (ii) when $$r<1$$r<1, and smoothing and regularization are suppressed, the gap function admits the rate $$\mathcal{O}({1}\slash {\sqrt{k}})$$O(1/k), thus improving the rate $$\mathcal{O}(\ln (k)/\sqrt{k})$$O(ln(k)/k) under standard averaging; and (iii) we develop a window-based variant of this scheme that also displays the optimal rate for $$r < 1$$r<1. Notably, we prove the superiority of the scheme with $$r < 1$$r<1 with its counterpart with $$r=1$$r=1 in terms of the constant factor of the error bound when the size of the averaging window is sufficiently large. We present the performance of the developed schemes on a stochastic Nash–Cournot game with merely monotone and non-Lipschitzian maps.

[1]  H. Robbins A Stochastic Approximation Method , 1951 .

[2]  John R. Birge,et al.  Introduction to Stochastic Programming , 1997 .

[3]  A. Shapiro Monte Carlo Sampling Methods , 2003 .

[4]  Torbjörn Larsson,et al.  A class of gap functions for variational inequalities , 1994, Math. Program..

[5]  Uday V. Shanbhag,et al.  Addressing supply-side risk in uncertain power markets: stochastic Nash models, scalable algorithms and error analysis , 2013, Optim. Methods Softw..

[6]  Y. Ermoliev Stochastic quasigradient methods and their application to system optimization , 1983 .

[7]  Houyuan Jiang,et al.  Stochastic Approximation Approaches to the Stochastic Variational Inequality Problem , 2008, IEEE Transactions on Automatic Control.

[8]  Angelia Nedic,et al.  Self-Tuned Stochastic Approximation Schemes for Non-Lipschitzian Stochastic Multi-User Optimization and Nash Games , 2016, IEEE Transactions on Automatic Control.

[9]  Boris Polyak,et al.  Acceleration of stochastic approximation by averaging , 1992 .

[10]  Huifu Xu,et al.  Adaptive Smoothing Method, Deterministically Computable Generalized Jacobians, and the Newton Method , 2001 .

[11]  Angelia Nedic,et al.  On stochastic gradient and subgradient methods with adaptive steplength sequences , 2011, Autom..

[12]  Saeed Ghadimi,et al.  Optimal Stochastic Approximation Algorithms for Strongly Convex Stochastic Composite Optimization I: A Generic Algorithmic Framework , 2012, SIAM J. Optim..

[13]  K. Knopp,et al.  Theory and Applications of Infinite Series , 1972 .

[14]  F. Facchinei,et al.  Finite-Dimensional Variational Inequalities and Complementarity Problems , 2003 .

[15]  Dimitri P. Bertsekas,et al.  Incremental constraint projection methods for variational inequalities , 2014, Math. Program..

[16]  Alexander Shapiro,et al.  Lectures on Stochastic Programming: Modeling and Theory , 2009 .

[17]  Angelia Nedic,et al.  Multiuser Optimization: Distributed Algorithms and Error Analysis , 2011, SIAM J. Optim..

[18]  Uday V. Shanbhag,et al.  Distributed Computation of Equilibria in Monotone Nash Games via Iterative Regularization Techniques , 2012, SIAM J. Optim..

[19]  D. Mayne,et al.  Nondifferential optimization via adaptive smoothing , 1984 .

[20]  John M. Wilson,et al.  Introduction to Stochastic Programming , 1998, J. Oper. Res. Soc..

[21]  Bastian Goldlücke,et al.  Variational Analysis , 2014, Computer Vision, A Reference Guide.

[22]  Angelia Nedic,et al.  On Stochastic Subgradient Mirror-Descent Algorithm with Weighted Averaging , 2013, SIAM J. Optim..

[23]  V. Norkin The Analysis and Optimization of Probability Functions , 1993 .

[24]  D. Bertsekas Stochastic optimization problems with nondifferentiable cost functionals , 1973 .

[25]  A. Juditsky,et al.  Solving variational inequalities with Stochastic Mirror-Prox algorithm , 2008, 0809.0815.

[26]  Ines Gloeckner,et al.  Modified Lagrangians And Monotone Maps In Optimization , 2016 .

[27]  Alexander Shapiro,et al.  Stochastic Approximation approach to Stochastic Programming , 2013 .

[28]  Huifu Xu,et al.  Sample Average Approximation Methods for a Class of Stochastic Variational inequality Problems , 2010, Asia Pac. J. Oper. Res..

[29]  H. Kushner,et al.  Stochastic Approximation and Recursive Algorithms and Applications , 2003 .

[30]  O. Nelles,et al.  An Introduction to Optimization , 1996, IEEE Antennas and Propagation Magazine.

[31]  Angelia Nedic,et al.  A regularized smoothing stochastic approximation (RSSA) algorithm for stochastic variational inequality problems , 2013, 2013 Winter Simulations Conference (WSC).

[32]  Amarjit Budhiraja,et al.  Confidence Regions for Stochastic Variational Inequalities , 2013, Math. Oper. Res..

[33]  A. Nedić,et al.  Regularized Iterative Stochastic Approximation Methods for Variational Inequality Problems , 2011 .

[34]  Jorge Nocedal,et al.  An Interior Point Algorithm for Large-Scale Nonlinear Programming , 1999, SIAM J. Optim..

[35]  Michael Patriksson,et al.  Primal convergence from dual subgradient methods for convex optimization , 2014, Mathematical Programming.

[36]  Patrice Marcotte,et al.  Weak Sharp Solutions of Variational Inequalities , 1998, SIAM J. Optim..

[37]  Angelia Nedic,et al.  Regularized Iterative Stochastic Approximation Methods for Stochastic Variational Inequality Problems , 2013, IEEE Transactions on Automatic Control.

[38]  Mark Broadie,et al.  General Bounds and Finite-Time Improvement for the Kiefer-Wolfowitz Stochastic Approximation Algorithm , 2011, Oper. Res..

[39]  Uday V. Shanbhag,et al.  On the Characterization of Solution Sets of Smooth and Nonsmooth Convex Stochastic Nash Games , 2011, SIAM J. Optim..

[40]  Shu Lu Symmetric Confidence Regions and Confidence Intervals for Normal Map Formulations of Stochastic Variational Inequalities , 2014, SIAM J. Optim..

[41]  Daniela Pucci de Farias,et al.  Decentralized Resource Allocation in Dynamic Networks of Agents , 2008, SIAM J. Optim..

[42]  Martin J. Wainwright,et al.  Randomized Smoothing for Stochastic Optimization , 2011, SIAM J. Optim..

[43]  Benjamin F. Hobbs,et al.  Nash-Cournot Equilibria in Power Markets on a Linearized DC Network with Arbitrage: Formulations and Properties , 2003 .