论文信息 - Least Angle Regression and LASSO for Large Datasets

Least Angle Regression and LASSO for Large Datasets

Least-Angle Regression and the LASSO (`1-penalized regression) offer a number of advantages in variable selection applications over procedures such as stepwise or ridge regression, including prediction accuracy, stability and interpretability. We discuss formulations of these algorithms that extend to datasets in which the number of observations could be so large that it would not be possible to access the matrix of predictors as a unit in computations. Our methods require a single pass through the data for orthogonal transformation, effectively reducing the dimension of the computations required to obtain the regression coefficients and residual sums-of-squares to the number of predictors, rather than the number of observations.

Tim Hesterberg | Chris Fraley | T. Hesterberg | C. Fraley

[1] Lars Elden,et al. Matrix methods in data mining and pattern recognition , 2007, Fundamentals of algorithms.

[2] Carl D. Meyer,et al. Matrix Analysis and Applied Linear Algebra , 2000 .

[3] Alan J. Miller. Subset Selection in Regression , 1992 .

[4] Gene H. Golub,et al. Matrix computations , 1983 .

[5] R. Tibshirani,et al. Least angle regression , 2004, math/0406456.

[6] Philipp Birken,et al. Numerical Linear Algebra , 2011, Encyclopedia of Parallel Computing.

[7] Tsai-Hung Fan,et al. Tests and variables selection on regression analysis for massive datasets , 2007, Data Knowl. Eng..

[8] M. R. Osborne,et al. On the LASSO and its Dual , 2000 .

[9] Charles L. Lawson,et al. Solving least squares problems , 1976, Classics in applied mathematics.

[10] J. Leader. Numerical Analysis and Scientific Computation , 2022 .

[11] T. Hesterberg,et al. Least angle and ℓ1 penalized regression: A review , 2008, 0802.0964.

[12] Tsai-Hung Fan,et al. Regression analysis for massive datasets , 2007, Data Knowl. Eng..

[13] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .