First- and second-order optimality conditions for piecewise smooth objective functions

Any piecewise smooth function that is specified by an evaluation procedure involving smooth elemental functions and piecewise linear functions like and can be represented in the so-called abs-normal form. By an extension of algorithmic, or automatic, differentiation, one can then compute certain first- and second-order derivative vectors and matrices that represent a local piecewise linearization and provide additional curvature information. On the basis of these quantities, we characterize local optimality by first- and second-order necessary and sufficient conditions, which generalize the corresponding Kuhn-Tucker-Karush (KKT) theory for smooth problems. The key assumption is the linear independence kink qualification, a generalization of Linear Independence Constraint Qualification (LICQ) familiar from nonlinear optimization. It implies that the objective has locally a so-called decomposition and renders everything tractable in terms of matrix factorizations and other simple linear algebra operations. By yielding descent directions, whenever they are violated the new optimality conditions point the way to a superlinearly convergent generalized Quadratic Program solver, which is currently under development. We exemplify the theory on two nonsmooth examples of Nesterov.

[1]  Mert Gürbüzbalaban,et al.  On Nesterov's Nonsmooth Chebyshev-Rosenbrock Functions , 2012 .

[2]  Yurii Nesterov,et al.  Lexicographic differentiation of nonsmooth functions , 2005, Math. Program..

[3]  S. Scholtes Introduction to Piecewise Differentiable Equations , 2012 .

[4]  Robert Mifflin,et al.  Primal-Dual Gradient Structured Functions: Second-Order Results; Links to Epi-Derivatives and Partly Smooth Functions , 2003, SIAM J. Optim..

[5]  Uwe Naumann,et al.  Combinatorial Scientific Computing , 2012 .

[6]  Adrian S. Lewis,et al.  Active Sets, Nonsmoothness, and Sensitivity , 2002, SIAM J. Optim..

[7]  Robert Mifflin,et al.  On VU-theory for Functions with Primal-Dual Gradient Structure , 2000, SIAM J. Optim..

[8]  Robert Mifflin,et al.  A Science Fiction Story in Nonsmooth Optimization Originating at IIASA , 2012 .

[9]  Andreas Griewank,et al.  On stable piecewise linearization and generalized algorithmic differentiation , 2013, Optim. Methods Softw..

[10]  A. Griewank,et al.  Solving piecewise linear systems in abs-normal form , 2015, 1701.00753.

[11]  Andreas Griewank,et al.  On Lipschitz optimization based on gray-box piecewise linearization , 2016, Math. Program..

[12]  F. Giannessi Variational Analysis and Generalized Differentiation , 2006 .

[13]  Richard H. Bartels,et al.  Linearly Constrained Discrete I1 Problems , 1980, TOMS.

[14]  Andrea Walther,et al.  Getting Started with ADOL-C , 2009, Combinatorial Scientific Computing.

[15]  A. Conn,et al.  Minimization Techniques for Piecewise Differentiable Functions: The l_1 Solution to an Overdetermined Linear System , 1978 .

[16]  Andreas Griewank,et al.  Evaluating derivatives - principles and techniques of algorithmic differentiation, Second Edition , 2000, Frontiers in applied mathematics.

[17]  Shuning Wang,et al.  Exact Penalty and Optimality Condition for Nonseparable Continuous Piecewise Linear Programming , 2012, J. Optim. Theory Appl..