On the pervasiveness of difference-convexity in optimization and statistics

With the increasing interest in applying the methodology of difference-of-convex (dc) optimization to diverse problems in engineering and statistics, this paper establishes the dc property of many functions in various areas of applications not previously known to be of this class. Motivated by a quadratic programming based recourse function in two-stage stochastic programming, we show that the (optimal) value function of a copositive (thus not necessarily convex) quadratic program is dc on the domain of finiteness of the program when the matrix in the objective function’s quadratic term and the constraint matrix are fixed. The proof of this result is based on a dc decomposition of a piecewise $$\hbox {LC}^1$$LC1 function (i.e., functions with Lipschitz gradients). Armed with these new results and known properties of dc functions existed in the literature, we show that many composite statistical functions in risk analysis, including the value-at-risk (VaR), conditional value-at-risk (CVaR), optimized certainty equivalent, and the expectation-based, VaR-based, and CVaR-based random deviation functionals are all dc. Adding the known class of dc surrogate sparsity functions that are employed as approximations of the $$\ell _0$$ℓ0 function in statistical learning, our work significantly expands the classes of dc functions and positions them for fruitful applications.

[1]  Nguyen Dong Yen,et al.  Quadratic Programming and Affine Variational Inequalities: A Qualitative Study , 2005 .

[2]  Rémi Abgrall,et al.  Computations of compressible multifluids , 2001 .

[3]  Jing Hu,et al.  An LPCC approach to nonconvex quadratic programs , 2012, Math. Program..

[4]  M. Teboulle,et al.  Expected Utility, Penalty Functions, and Duality in Stochastic Nonlinear Programming , 1986 .

[5]  M. Teboulle,et al.  AN OLD‐NEW CONCEPT OF CONVEX RISK MEASURES: THE OPTIMIZED CERTAINTY EQUIVALENT , 2007 .

[6]  Franco Giannessi,et al.  Nonconvex Quadratic Programs, Linear Complementarity Problems, and Integer Linear Programs , 1973, Optimization Techniques.

[7]  Le Thi Hoai An,et al.  DC approximation approaches for sparse optimization , 2014, Eur. J. Oper. Res..

[8]  P. Hartman On functions representable as a difference of convex functions , 1959 .

[9]  Le Thi Hoai An,et al.  DC programming and DCA: thirty years of developments , 2018, Math. Program..

[10]  Paul Tseng,et al.  Error Bound and Convergence Analysis of Matrix Splitting Algorithms for the Affine Variational Inequality Problem , 1992, SIAM J. Optim..

[11]  Jong-Shi Pang,et al.  Two-stage non-cooperative games with risk-averse players , 2017, Math. Program..

[12]  H. Zou,et al.  STRONG ORACLE OPTIMALITY OF FOLDED CONCAVE PENALIZED ESTIMATION. , 2012, Annals of statistics.

[13]  Shen Weidong,et al.  A Global Arbitrary Lagrangian-Eulerian Method for Stratified Richtmyer-Meshkov Instability , 2014, 2014 International Conference on Computational Science and Computational Intelligence.

[14]  Grégoire Allaire,et al.  A five-equation model for the simulation of interfaces between compressible fluids , 2002 .

[15]  Zhi-Quan Luo,et al.  A Unified Algorithmic Framework for Block-Structured Optimization Involving Big Data: With applications in machine learning and signal processing , 2015, IEEE Signal Processing Magazine.

[16]  Stan Uryasev,et al.  Optimality conditions in portfolio analysis with general deviation measures , 2005, Math. Program..

[17]  B. Eaves On Quadratic Programming , 1971 .

[18]  James M. Ortega,et al.  Iterative solution of nonlinear equations in several variables , 2014, Computer science and applied mathematics.

[19]  R. Rockafellar,et al.  Conditional Value-at-Risk for General Loss Distributions , 2001 .

[20]  R. Horst,et al.  DC Programming: Overview , 1999 .

[21]  J. Nash NON-COOPERATIVE GAMES , 1951, Classics in Game Theory.

[22]  Sergey Sarykalin,et al.  Value-at-Risk vs. Conditional Value-at-Risk in Risk Management and Optimization , 2008 .

[23]  R. Rockafellar,et al.  Optimization of conditional value-at risk , 2000 .

[24]  Le Thi Hoai An,et al.  The DC (Difference of Convex Functions) Programming and DCA Revisited with DC Models of Real World Nonconvex Optimization Problems , 2005, Ann. Oper. Res..

[25]  R. Rockafellar,et al.  Generalized Deviations in Risk Analysis , 2004 .

[26]  L. L. Veselý,et al.  Delta-convex mappings between Banach spaces and applications , 1989 .

[27]  Jong-Shi Pang,et al.  Computing B-Stationary Points of Nonsmooth DC Programs , 2015, Math. Oper. Res..

[28]  F. Facchinei,et al.  Finite-Dimensional Variational Inequalities and Complementarity Problems , 2003 .

[29]  Zhi-Quan Luo,et al.  A Unified Convergence Analysis of Block Successive Minimization Methods for Nonsmooth Optimization , 2012, SIAM J. Optim..

[30]  Jong-Shi Pang,et al.  A study of the difference-of-convex approach for solving linear programs with complementarity constraints , 2018, Math. Program..

[31]  S. Scholtes Piecewise Differentiable Functions , 2012 .

[32]  Jie Sun On the structure of convex piecewise quadratic functions , 1992 .

[33]  Bethany L. Nicholson,et al.  Mathematical Programs with Equilibrium Constraints , 2021, Pyomo — Optimization Modeling in Python.

[34]  Alexander Shapiro,et al.  Lectures on Stochastic Programming: Modeling and Theory , 2009 .

[35]  R. Rockafellar,et al.  The fundamental risk quadrangle in risk management, optimization and statistical estimation , 2013 .

[36]  H. Tuy Convex analysis and global optimization , 1998 .

[37]  Zeyao Mo,et al.  Parallel Flux Sweep Algorithm for Neutron Transport on Unstructured Grid , 2004, The Journal of Supercomputing.

[38]  Luděk Zajíček,et al.  On compositions of d.c. functions and mappings , 2009 .

[39]  Le Thi Hoai An,et al.  Recent Advances in DC Programming and DCA , 2013, Trans. Comput. Collect. Intell..

[40]  Nguyen Dong Yen,et al.  On the Optimal Value Function of a Linearly Perturbed Quadratic Program , 2005, J. Glob. Optim..

[41]  Jong-Shi Pang,et al.  A New Decomposition Method for Multiuser DC-Programming and Its Applications , 2014, IEEE Transactions on Signal Processing.

[42]  Jonathan M. Borwein,et al.  On difference convexity of locally Lipschitz functions , 2011 .

[43]  Jong-Shi Pang,et al.  Decomposition Methods for Computing Directional Stationary Solutions of a Class of Nonsmooth Nonconvex Optimization Problems , 2018, SIAM J. Optim..

[44]  Song Jiang,et al.  An arbitrary Lagrangian–Eulerian method based on the adaptive Riemann solvers for general equations of state , 2009 .

[45]  Akiko Takeda,et al.  DC formulations and algorithms for sparse optimization problems , 2017, Mathematical Programming.

[46]  Mingyi Hong,et al.  Local Minimizers and Second-Order Conditions in Composite Piecewise Programming via Directional Derivatives , 2017 .

[47]  Zhi-Quan Luo,et al.  A Stochastic Successive Minimization Method for Nonsmooth Nonconvex Optimization with Applications to Transceiver Design in Wireless Communication Networks , 2013, Mathematical Programming.

[48]  D. Klatte,et al.  On the Lipschitz behavior of optimal solutions in parametric problems of quadratic optimization and linear complementarity , 1985 .

[49]  J. Hiriart-Urruty Generalized Differentiability / Duality and Optimization for Problems Dealing with Differences of Convex Functions , 1985 .

[50]  Meisam Razaviyayn,et al.  Successive Convex Approximation: Analysis and Applications , 2014 .

[51]  Jack Xin,et al.  Difference-of-Convex Learning: Directional Stationarity, Optimality, and Sparsity , 2017, SIAM J. Optim..

[52]  John M. Wilson,et al.  Introduction to Stochastic Programming , 1998, J. Oper. Res. Soc..

[53]  Francisco Facchinei,et al.  Decomposition by Partial Linearization: Parallel Optimization of Multi-Agent Systems , 2013, IEEE Transactions on Signal Processing.

[54]  T. P. Dinh,et al.  Convex analysis approach to d.c. programming: Theory, Algorithm and Applications , 1997 .

[55]  H. Tuy Global Minimization of a Difference of Two Convex Functions , 1987 .

[56]  David Wozabal,et al.  Value-at-Risk optimization using the difference of convex algorithm , 2012, OR Spectr..

[57]  Zhi-Quan Luo,et al.  Parallel Successive Convex Approximation for Nonsmooth Nonconvex Optimization , 2014, NIPS.

[58]  F. Clarke Optimization And Nonsmooth Analysis , 1983 .

[59]  Jong-Shi Pang,et al.  Composite Difference-Max Programs for Modern Statistical Estimation Problems , 2018, SIAM J. Optim..

[60]  S. Ovchinnikov Max-Min Representation of Piecewise Linear Functions , 2000, math/0009026.

[61]  Marc Teboulle,et al.  Penalty Functions and Duality in Stochastic Programming Via ϕ-Divergence Functionals , 1987, Math. Oper. Res..