论文信息 - An inexact interior point method for L1-regularized sparse covariance selection

An inexact interior point method for L1-regularized sparse covariance selection

Sparse covariance selection problems can be formulated as log-determinant (log-det) semidefinite programming (SDP) problems with large numbers of linear constraints. Standard primal–dual interior-point methods that are based on solving the Schur complement equation would encounter severe computational bottlenecks if they are applied to solve these SDPs. In this paper, we consider a customized inexact primal–dual path-following interior-point algorithm for solving large scale log-det SDP problems arising from sparse covariance selection problems. Our inexact algorithm solves the large and ill-conditioned linear system of equations in each iteration by a preconditioned iterative solver. By exploiting the structures in sparse covariance selection problems, we are able to design highly effective preconditioners to efficiently solve the large and ill-conditioned linear systems. Numerical experiments on both synthetic and real covariance selection problems show that our algorithm is highly efficient and outperforms other existing algorithms.

Lu Li | Kim-Chuan Toh | K. Toh | Lu Li

[1] R. Tyrrell Rockafellar,et al. Augmented Lagrangians and Applications of the Proximal Point Algorithm in Convex Programming , 1976, Math. Oper. Res..

[2] J. N. R. Jeffers,et al. Graphical Models in Applied Multivariate Statistics. , 1990 .

[3] R. Freund,et al. A new Krylov-subspace method for symmetric indefinite linear systems , 1994 .

[4] D. Edwards. Introduction to graphical modelling , 1995 .

[5] Kim-Chuan Toh,et al. SDPT3 -- A Matlab Software Package for Semidefinite Programming , 1996 .

[6] Steffen L. Lauritzen,et al. Graphical models in R , 1996 .

[7] A. Wathen,et al. The convergence of iterative solution methods for symmetric and indefinite linear systems , 1997 .

[8] Michael J. Todd,et al. Primal-Dual Interior-Point Methods for Self-Scaled Cones , 1998, SIAM J. Optim..

[9] Yin Zhang,et al. On Extending Some Primal-Dual Interior-Point Algorithms From Linear Programming to Semidefinite Programming , 1998, SIAM J. Optim..

[10] Stephen P. Boyd,et al. Determinant Maximization with Linear Matrix Inequality Constraints , 1998, SIAM J. Matrix Anal. Appl..

[11] Kim-Chuan Toh,et al. On the Nesterov-Todd Direction in Semidefinite Programming , 1998, SIAM J. Optim..

[12] J. Mesirov,et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[13] Jeff A. Bilmes,et al. Natural statistical models for automatic speech recognition , 1999 .

[14] Ramesh A. Gopinath,et al. Model selection in acoustic modeling , 1999, EUROSPEECH.

[15] Jos F. Sturm,et al. A Matlab toolbox for optimization over symmetric cones , 1999 .

[16] Kim-Chuan Toh,et al. SDPT3 — a Matlab software package for semidefinite-quadratic-linear programming, version 3.0 , 2001 .

[17] E. Dougherty,et al. Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.

[18] Yin Zhang,et al. A computational study of a gradient-based log-barrier algorithm for a class of large-scale SDPs , 2003, Math. Program..

[19] Yousef Saad,et al. Iterative methods for sparse linear systems , 2003 .

[20] M. Pourahmadi,et al. Nonparametric estimation of large covariance matrices of longitudinal data , 2003 .

[21] R. Kohn,et al. Efficient estimation of covariance selection models , 2003 .

[22] John D. Storey,et al. Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[23] Kim-Chuan Toh,et al. Polynomiality of an inexact infeasible interior point algorithm for semidefinite programming , 2004, Math. Program..

[24] M. West,et al. Integrated modeling of clinical and gene expression information for personalized prediction of disease outcomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[25] Kim-Chuan Toh,et al. Solving Large Scale Semidefinite Programs via an Iterative Solver on the Augmented Systems , 2003, SIAM J. Optim..

[26] M. West,et al. Sparse graphical models for exploring gene expression data , 2004 .

[27] P. Bühlmann,et al. Sparse graphical Gaussian modeling of the isoprenoid gene network in Arabidopsis thaliana , 2004, Genome Biology.

[28] K. Sachs,et al. Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[29] Adrian E. Raftery,et al. Bayesian model averaging: development of an improved multi-class, gene selection and classification tool for microarray data , 2005, Bioinform..

[30] Yurii Nesterov,et al. Smooth minimization of non-smooth functions , 2005, Math. Program..

[31] N. Meinshausen,et al. High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[32] Franz Rendl,et al. A Boundary Point Method to Solve Semidefinite Programs , 2006, Computing.

[33] T. Tsuchiya,et al. An extension of the standard polynomial-time primal-dual path-following algorithm to the weighted determinant maximization problem with semidefinite constraints , 2006 .

[34] M. Yuan,et al. Model selection and estimation in the Gaussian graphical model , 2007 .