An Approach for Analyzing the Global Rate of Convergence of Quasi-Newton and Truncated-Newton Methods

Quasi-Newton and truncated-Newton methods are popular methods in optimization and are traditionally seen as useful alternatives to the gradient and Newton methods. Throughout the literature, results are found that link quasi-Newton methods to certain first-order methods under various assumptions. We offer a simple proof to show that a range of quasi-Newton methods are first-order methods in the definition of Nesterov. Further, we define a class of generalized first-order methods and show that the truncated-Newton method is a generalized first-order method and that first-order methods and generalized first-order methods share the same worst-case convergence rates. Further, we extend the complexity analysis for smooth strongly convex problems to finite dimensions. An implication of these results is that in a worst-case scenario, the local superlinear or faster convergence rates of quasi-Newton and truncated-Newton methods cannot be effective unless the number of iterations exceeds half the size of the problem dimension.

[1]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[2]  Peter Richtárik,et al.  Smooth minimization of nonsmooth functions with parallel coordinate descent methods , 2013, Modeling and Optimization: Theory and Applications.

[3]  G. E. Myers Properties of the conjugate-gradient and Davidon methods , 1968 .

[4]  Y. Nesterov Gradient methods for minimizing composite objective function , 2007 .

[5]  John Darzentas,et al.  Problem Complexity and Method Efficiency in Optimization , 1983 .

[6]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[7]  Y. Nesterov A method for unconstrained convex minimization problem with the rate of convergence o(1/k^2) , 1983 .

[8]  D. Goldfarb A family of variable-metric methods derived by variational means , 1970 .

[9]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[10]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[11]  David F. Shanno,et al.  Conjugate Gradient Methods with Inexact Searches , 1978, Math. Oper. Res..

[12]  Robert Todd Gregory,et al.  A collection of matrices for testing computational algorithms , 1969 .

[13]  Nicholas I. M. Gould,et al.  Trust Region Methods , 2000, MOS-SIAM Series on Optimization.

[14]  L. C W. Dixon,et al.  Quasi-newton algorithms generate identical points , 1972, Math. Program..

[15]  Yurii Nesterov,et al.  Smooth minimization of non-smooth functions , 2005, Math. Program..

[16]  José M. Bioucas-Dias,et al.  A New TwIST: Two-Step Iterative Shrinkage/Thresholding Algorithms for Image Restoration , 2007, IEEE Transactions on Image Processing.

[17]  R. Fletcher,et al.  Practical Methods of Optimization: Fletcher/Practical Methods of Optimization , 2000 .

[18]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[19]  C. G. Broyden Quasi-Newton methods and their application to function minimisation , 1967 .

[20]  Stephen J. Wright,et al.  Numerical Optimization (Springer Series in Operations Research and Financial Engineering) , 2000 .

[21]  William C. Davidon,et al.  Variance Algorithm for Minimization , 1968, Comput. J..

[22]  Anthony V. Fiacco,et al.  Nonlinear programming;: Sequential unconstrained minimization techniques , 1968 .

[23]  William C. Davidon,et al.  Variable Metric Method for Minimization , 1959, SIAM J. Optim..

[24]  L. Chambers Practical methods of optimization (2nd edn) , by R. Fletcher. Pp. 436. £34.95. 2000. ISBN 0 471 49463 1 (Wiley). , 2001, The Mathematical Gazette.

[25]  R. Fletcher,et al.  A New Approach to Variable Metric Algorithms , 1970, Comput. J..

[26]  H. Y. Huang Unified approach to quadratically convergent algorithms for function minimization , 1970 .

[27]  David G. Hull On the Huang Class of Variable Metric Methods , 2002 .

[28]  R. Fletcher Practical Methods of Optimization , 1988 .

[29]  D. Shanno Conditioning of Quasi-Newton Methods for Function Minimization , 1970 .

[30]  J. Nocedal Updating Quasi-Newton Matrices With Limited Storage , 1980 .

[31]  C. G. Broyden The Convergence of a Class of Double-rank Minimization Algorithms 2. The New Algorithm , 1970 .

[32]  Roger Fletcher,et al.  A Rapidly Convergent Descent Method for Minimization , 1963, Comput. J..