Constrained Non-convex Optimization via Stochastic Variance Reduced Approximations

Recently, stochastic variance reduced gradient (SVRG) and related methods have made significant advancements in solving finite sum non-convex problems but limiting their applicability to convex constraints. In contrast, we study the same problem subjected to non-convex constraints and analyze stochastic variance reduced framework under mild assumptions. Presence of non-convex constraints complicates the problem and is challenging. For instance, preserving the feasibility of the iterates. To this end, we propose to fuse the stochastic variance reduced framework with successive convex approximation methods henceforth, solving a convex optimization problem at each iteration while maintaining iterate feasibility. Moreover, we prove that the proposed algorithm generates a convergent sequence under mild assumptions. Experimental evaluations are provided on node localization problem to demonstrate that it is indeed faster than the state-of-the-art non-convex optimization methods.

[1]  Gordon P. Wright,et al.  Technical Note - A General Inner Approximation Algorithm for Nonconvex Mathematical Programs , 1978, Oper. Res..

[2]  Daniel Pérez Palomar,et al.  Power Control By Geometric Programming , 2007, IEEE Transactions on Wireless Communications.

[3]  Peter L. Bartlett,et al.  Exponentiated Gradient Algorithms for Conditional Random Fields and Max-Margin Markov Networks , 2008, J. Mach. Learn. Res..

[4]  Georgios B. Giannakis,et al.  Sparsity-Exploiting Robust Multidimensional Scaling , 2012, IEEE Transactions on Signal Processing.

[5]  Marc Pollefeys,et al.  CamOdoCal: Automatic intrinsic and extrinsic calibration of a rig with multiple generic cameras and odometry , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Saeed Ghadimi,et al.  Stochastic First- and Zeroth-Order Methods for Nonconvex Stochastic Programming , 2013, SIAM J. Optim..

[7]  Zhi-Quan Luo,et al.  A Unified Convergence Analysis of Block Successive Minimization Methods for Nonsmooth Optimization , 2012, SIAM J. Optim..

[8]  Neha Mehra,et al.  Survey on Multiclass Classification Methods , 2013 .

[9]  Francisco Facchinei,et al.  Decomposition by Partial Linearization: Parallel Optimization of Multi-Agent Systems , 2013, IEEE Transactions on Signal Processing.

[10]  Francisco Facchinei,et al.  Parallel and distributed methods for nonconvex optimization , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11]  Francisco Facchinei,et al.  Parallel Selective Algorithms for Nonconvex Big Data Optimization , 2014, IEEE Transactions on Signal Processing.

[12]  Wei Wang,et al.  Recommender system application developments: A survey , 2015, Decis. Support Syst..

[13]  Francisco Facchinei,et al.  Parallel and Distributed Methods for Nonconvex Optimization-Part II: Applications , 2016, ArXiv.

[14]  Alexander J. Smola,et al.  Stochastic Frank-Wolfe methods for nonconvex optimization , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[15]  Alexander J. Smola,et al.  Stochastic Variance Reduction for Nonconvex Optimization , 2016, ICML.

[16]  Sandeep Kumar,et al.  Stochastic Multidimensional Scaling , 2016, IEEE Transactions on Signal and Information Processing over Networks.

[17]  Prabhu Babu,et al.  Majorization-Minimization Algorithms in Signal Processing, Communications, and Machine Learning , 2017, IEEE Transactions on Signal Processing.

[18]  S. Arora,et al.  Node Localization in Wireless Sensor Networks Using Butterfly Optimization Algorithm , 2017, Arabian Journal for Science and Engineering.

[19]  Vincent K. N. Lau,et al.  Stochastic Successive Convex Approximation for Non-Convex Constrained Stochastic Optimization , 2018, IEEE Transactions on Signal Processing.