论文信息 - Natural coordinate descent algorithm for L1-penalised regression in generalised linear models

Natural coordinate descent algorithm for L1-penalised regression in generalised linear models

The problem of finding the maximum likelihood estimates for the regression coefficients in generalised linear models with an � 1 sparsity penalty is shown to be equivalent to minimising the unpenalised maximum log-likelihood function over a box with boundary defined by the � 1 -penalty parameter. In one-parameter models or when a single coefficient is estimated at a time, this result implies a generic soft-thresholding mechanism which leads to a novel coordinate descent algorithm for generalised linear models that is entirely described in terms of the natural formulation of the model and is guaranteed to converge to the true optimum. A prototype implementation for logistic regression tested on two large-scale cancer gene expression datasets shows that this algorithm is efficient, particularly so when a solution is computed at set values of the � 1 -penalty parameter as opposed to along a regularisation path. Source code and test data are available from http://tmichoel.github.io/glmnat/.

Tom Michoel | T. Michoel

[1] Alexandre d'Aspremont,et al. Model Selection Through Sparse Max Likelihood Estimation Model Selection Through Sparse Maximum Likelihood Estimation for Multivariate Gaussian or Binary Data , 2022 .

[2] R. Tibshirani. The Lasso Problem and Uniqueness , 2012, 1206.0313.

[3] Trevor Hastie,et al. Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[4] H. Zou,et al. Regularization and variable selection via the elastic net , 2005 .

[5] P. McCullagh,et al. Generalized Linear Models , 1984 .

[6] Laurent El Ghaoui,et al. Safe Feature Elimination for the LASSO and Sparse Supervised Learning Problems , 2010, 1009.4219.

[7] Chih-Jen Lin,et al. A Comparison of Optimization Methods and Software for Large-scale L1-regularized Linear Classification , 2010, J. Mach. Learn. Res..

[8] Steven J. M. Jones,et al. Comprehensive molecular characterization of human colon and rectal cancer , 2012, Nature.

[9] R. Tibshirani,et al. Least angle regression , 2004, math/0406456.

[10] Steven J. M. Jones,et al. Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[11] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[12] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[13] R. Tibshirani,et al. Strong rules for discarding predictors in lasso‐type problems , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[14] M. R. Osborne,et al. On the LASSO and its Dual , 2000 .

[15] I. Johnstone,et al. Statistical challenges of high-dimensional data , 2009, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[16] M. R. Osborne,et al. A new approach to variable selection in least squares problems , 2000 .

[17] P. Tseng. Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization , 2001 .