论文信息 - Confidence intervals and hypothesis testing for high-dimensional regression

Confidence intervals and hypothesis testing for high-dimensional regression

Fitting high-dimensional statistical models often requires the use of non-linear parameter estimation procedures. As a consequence, it is generally impossible to obtain an exact characterization of the probability distribution of the parameter estimates. This in turn implies that it is extremely challenging to quantify the uncertainty associated with a certain parameter estimate. Concretely, no commonly accepted procedure exists for computing classical measures of uncertainty and statistical significance as confidence intervals or p- values for these models. We consider here high-dimensional linear regression problem, and propose an efficient algorithm for constructing confidence intervals and p-values. The resulting confidence intervals have nearly optimal size. When testing for the null hypothesis that a certain parameter is vanishing, our method has nearly optimal power. Our approach is based on constructing a 'de-biased' version of regularized M-estimators. The new construction improves over recent work in the field in that it does not assume a special structure on the design matrix. We test our method on synthetic data and a high-throughput genomic data set about riboflavin production rate, made publicly available by Buhlmann et al. (2014).

Adel Javanmard | A. Montanari

[1] Peter Bühlmann,et al. High-Dimensional Statistics with a View Toward Applications in Biology , 2014 .

[2] A. Belloni,et al. HONEST CONFIDENCE REGIONS FOR A REGRESSION PARAMETER IN LOGISTIC REGRESSION WITH A LARGE NUMBER OF CONTROLS , 2013 .

[3] Andrea Montanari,et al. Estimating LASSO Risk and Noise Level , 2013, NIPS.

[4] R. Tibshirani,et al. A Study of Error Variance Estimation in Lasso Regression , 2013, 1311.5274.

[5] Adel Javanmard,et al. Nearly optimal sample size in hypothesis testing for high-dimensional regression , 2013, 2013 51st Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[6] Harrison H. Zhou,et al. Asymptotic normality and optimalities in estimation of large Gaussian graphical models , 2013, 1309.6024.

[7] S. Geer,et al. On asymptotically optimal confidence regions and tests for high-dimensional models , 2013, 1303.0518.

[8] R. Tibshirani,et al. A SIGNIFICANCE TEST FOR THE LASSO. , 2013, Annals of statistics.

[9] Adel Javanmard,et al. Hypothesis Testing in High-Dimensional Regression Under the Gaussian Random Design Model: Asymptotic Theory , 2013, IEEE Transactions on Information Theory.

[10] Christian P. Robert,et al. Large-scale inference , 2010 .

[11] Lee H. Dicker,et al. Residual variance and the signal-to-noise ratio in high-dimensional linear models , 2012, 1209.0012.

[12] Isaac Dialsingh,et al. Large-scale inference: empirical Bayes methods for estimation, testing, and prediction , 2012 .

[13] Peter Buhlmann. Statistical significance in high-dimensional linear models , 2012, 1202.1377.

[14] A. Belloni,et al. Inference on Treatment Effects after Selection Amongst High-Dimensional Controls , 2011, 1201.0224.

[15] Lu Tian,et al. A Perturbation Method for Inference on Regularized Regression Estimates , 2011, Journal of the American Statistical Association.

[16] Cun-Hui Zhang,et al. Confidence intervals for low dimensional parameters in high dimensional linear models , 2011, 1110.2563.

[17] Sara van de Geer,et al. Statistics for High-Dimensional Data: Methods, Theory and Applications , 2011 .

[18] Shuheng Zhou,et al. 25th Annual Conference on Learning Theory Reconstruction from Anisotropic Random Measurements , 2022 .

[19] Cun-Hui Zhang,et al. Scaled sparse linear regression , 2011, 1104.4595.

[20] Roman Vershynin,et al. Introduction to the non-asymptotic analysis of random matrices , 2010, Compressed Sensing.

[21] S. Geer,et al. ℓ1-penalization for mixture regression models , 2010, 1202.6046.

[22] Jianqing Fan,et al. Variance estimation using refitted cross‐validation in ultrahigh dimensional regression , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[23] Cun-Hui Zhang. Nearly unbiased variable selection under minimax concave penalty , 2010, 1002.4734.

[24] Trevor Hastie,et al. Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[25] Martin J. Wainwright,et al. A unified framework for high-dimensional analysis of $M$-estimators with decomposable regularizers , 2009, NIPS.

[26] Yichao Wu,et al. Ultrahigh Dimensional Feature Selection: Beyond The Linear Model , 2009, J. Mach. Learn. Res..

[27] S. Geer,et al. On the conditions used to prove oracle results for the Lasso , 2009, 0910.0722.

[28] Yehuda Koren,et al. Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[29] Martin J. Wainwright,et al. Sharp Thresholds for High-Dimensional and Noisy Sparsity Recovery Using $\ell _{1}$ -Constrained Quadratic Programming (Lasso) , 2009, IEEE Transactions on Information Theory.

[30] Ji Zhu,et al. Regularized Multivariate Regression for Identifying Master Predictors with Application to Integrative Genomics Study of Breast Cancer. , 2008, The annals of applied statistics.

[31] Peter Bühlmann,et al. p-Values for High-Dimensional Regression , 2008, 0811.2177.

[32] N. Meinshausen,et al. Stability selection , 2008, 0809.2932.

[33] M. Lustig,et al. Compressed Sensing MRI , 2008, IEEE Signal Processing Magazine.

[34] P. Bickel,et al. SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR , 2008, 0801.1095.

[35] E. Candès,et al. Near-ideal model selection by ℓ1 minimization , 2008, 0801.0345.

[36] L. Wasserman,et al. HIGH DIMENSIONAL VARIABLE SELECTION. , 2007, Annals of statistics.

[37] Jianqing Fan,et al. Sure independence screening for ultrahigh dimensional feature space , 2006, math/0612857.

[38] Peng Zhao,et al. On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..

[39] N. Meinshausen,et al. High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[40] E. Candès,et al. The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.

[41] Emmanuel J. Candès,et al. Decoding by linear programming , 2005, IEEE Transactions on Information Theory.

[42] Y. Ritov,et al. Persistence in high-dimensional linear predictor selection and the virtue of overparametrization , 2004 .

[43] Larry Wasserman,et al. All of Statistics: A Concise Course in Statistical Inference , 2004 .

[44] Jianqing Fan,et al. Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[45] Xiaoming Huo,et al. Uncertainty principles and ideal atomic decomposition , 2001, IEEE Trans. Inf. Theory.

[46] Scott Chen,et al. Examples of basis pursuit , 1995, Optics + Photonics.

[47] Stéphane Mallat,et al. Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[48] R. Tibshirani. A signicance test for the lasso , 2014 .

[49] Patrick Seemann,et al. Matrix Factorization Techniques for Recommender Systems , 2014 .

[50] Adel Javanmard,et al. Confidence Intervals and Hypothesis Testing for High-Dimensional Statistical Models , 2013 .

[51] Sara van de Geer,et al. Statistics for High-Dimensional Data , 2011 .

[52] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[53] N. S. Barnett,et al. Private communication , 1969 .

[54] E. L. Lehmann,et al. Theory of point estimation , 1950 .

[55] A. Belloni,et al. Massachusetts Institute of Technology Department of Economics Working Paper Series Least Squares after Model Selection in High-dimensional Sparse Models Least Squares after Model Selection in High-dimensional Sparse Models , 2022 .