论文信息 - A proximal Newton framework for composite minimization: Graph learning without Cholesky decompositions and matrix inversions

A proximal Newton framework for composite minimization: Graph learning without Cholesky decompositions and matrix inversions

We propose an algorithmic framework for convex minimization problems of composite functions with two terms: a self-concordant part and a possibly nonsmooth regularization part. Our method is a new proximal Newton algorithm with local quadratic convergence rate. As a specific problem instance, we consider sparse precision matrix estimation problems in graph learning. Via a careful dual formulation and a novel analytic stepsize selection, we instantiate an algorithm within our framework for graph learning that avoids Cholesky decompositions and matrix inversions, making it attractive for parallel and distributed implementations.

Volkan Cevher | Anastasios Kyrillidis | Quoc Tran-Dinh

[1] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .

[2] Marc Teboulle,et al. A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[3] Zhaosong Lu,et al. Adaptive First-Order Methods for General Sparse Inverse Covariance Selection , 2009, SIAM J. Matrix Anal. Appl..

[4] Yurii Nesterov,et al. Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[5] Michael A. Saunders,et al. Proximal Newton-type methods for convex optimization , 2012, NIPS.

[6] Pradeep Ravikumar,et al. Sparse inverse covariance matrix estimation using quadratic approximation , 2011, MLSLP.

[7] Lu Li,et al. An inexact interior point method for L1-regularized sparse covariance selection , 2010, Math. Program. Comput..

[8] Shiqian Ma,et al. Sparse Inverse Covariance Selection via Alternating Linearization Methods , 2010, NIPS.

[9] Xiaoming Yuan,et al. Alternating Direction Method for Covariance Selection Models , 2011, Journal of Scientific Computing.

[10] Jorge Nocedal,et al. Newton-Like Methods for Sparse Inverse Covariance Estimation , 2012, NIPS.

[11] Y. Nesterov. Gradient methods for minimizing composite objective function , 2007 .

[12] Arian Maleki,et al. Iterative Thresholding Algorithm for Sparse Inverse Covariance Estimation , 2012, NIPS.

[13] Katya Scheinberg,et al. IBM Research Report SINCO - A Greedy Coordinate Ascent Method for Sparse Inverse Covariance Selection Problem , 2009 .

[14] Bin Yu,et al. High-dimensional covariance estimation by minimizing ℓ1-penalized log-determinant divergence , 2008, 0811.3628.

[15] Yurii Nesterov,et al. Interior-point polynomial algorithms in convex programming , 1994, Siam studies in applied mathematics.

[16] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[17] Alexandre d'Aspremont,et al. Model Selection Through Sparse Max Likelihood Estimation Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data , 2022 .