Post-selection adaptive inference for Least Angle Regression and the Lasso

We propose inference tools for least angle regression and the lasso, from the joint distribution of suitably normalized spacings of the LARS algorithm. From this we extend the results of the asymptotic null distribution of the “covariance test” of Lockhart et al. (2013). But we go much further, deriving exact finite sample results for a new asymptotically equivalent procedure called the “spacing test”. This provides exact conditional tests at any step of the LAR algorithm as well as “selection intervals” for the appropriate true underlying regression parameter. Remarkably, these tests and intervals account correctly for the adaptive selection done by LARS.

[1]  Dennis L. Sun,et al.  Exact inference after model selection via the Lasso , 2013 .

[2]  R. Tibshirani,et al.  The solution path of the generalized lasso , 2010, 1005.1971.

[3]  R. Tibshirani,et al.  A LASSO FOR HIERARCHICAL INTERACTIONS. , 2012, Annals of statistics.

[4]  Alexandra Chouldechova,et al.  False Discovery Rate Control for Sequential Selection Procedures, with Application to the Lasso , 2013 .

[5]  Adel Javanmard,et al.  Confidence Intervals and Hypothesis Testing for High-Dimensional Statistical Models , 2013 .

[6]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[7]  R. Tibshirani The Lasso Problem and Uniqueness , 2012, 1206.0313.

[8]  R. Tibshirani,et al.  A SIGNIFICANCE TEST FOR THE LASSO. , 2013, Annals of statistics.

[9]  Joshua R. Loftus,et al.  Inference in adaptive regression via the Kac–Rice formula , 2013, 1308.3020.

[10]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[11]  Adel Javanmard,et al.  Hypothesis Testing in High-Dimensional Regression Under the Gaussian Random Design Model: Asymptotic Theory , 2013, IEEE Transactions on Information Theory.

[12]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[13]  Peter Buhlmann Statistical significance in high-dimensional linear models , 2012, 1202.1377.

[14]  S. Geer,et al.  On asymptotically optimal confidence regions and tests for high-dimensional models , 2013, 1303.0518.

[15]  Cun-Hui Zhang,et al.  Confidence Intervals for Low-Dimensional Parameters With High-Dimensional Data , 2011 .

[16]  A. Buja,et al.  Valid post-selection inference , 2013, 1306.1059.