论文信息 - Models as Approximations, Part I: A Conspiracy of Nonlinearity and Random Regressors in Linear Regression - 字舞流文

Models as Approximations, Part I: A Conspiracy of Nonlinearity and Random Regressors in Linear Regression

More than thirty years ago Halbert White inaugurated a "model-robust" form of statistical inference based on the "sandwich estimator" of standard error. This estimator is known to be "heteroskedasticity-consistent", but it is less well-known to be "nonlinearity-consistent" as well. Nonlinearity, however, raises fundamental issues because regressors are no longer ancillary, hence can't be treated as fixed. The consequences are severe: (1)~the regressor distribution affects the slope parameters, and (2)~randomness of the regressors conspires with the nonlinearity to create sampling variability in slope estimates --- even in the complete absence of error. For these observations to make sense it is necessary to re-interpret population slopes and view them not as parameters in a generative model but as statistical functionals associated with OLS fitting as it applies to largely arbitrary joint $\xy$~distributions. In such a "model-robust" approach to linear regression, the meaning of slope parameters needs to be rethought and inference needs to be based on model-robust standard errors that can be estimated with sandwich plug-in estimators or with the $\xy$~bootstrap. Theoretically, model-robust and model-trusting standard errors can deviate by arbitrary magnitudes either way. In practice, a diagnostic test can be used to detect significant deviations on a per-slope basis.

A. Buja | L. Brown | E. George | M. Traskin | R. Berk | Linda H. Zhao | E. Pitkin | K. Zhan | K. Zhan | R. Berk | L. Brown | E. George | L. Zhao

[1] A. Buja,et al. Statistica Sinica Preprint No : SS-2016-0546 R 1 Title Calibrated Percentile Double Bootstrap For Robust Linear Regression Inference , 2017 .

[2] Sara van de Geer,et al. High-dimensional inference in misspecified linear models , 2015, 1503.06426.

[3] D. Donoho,et al. Variance Breakdown of Huber (M)-estimators: $n/p \rightarrow m \in (1,\infty)$ , 2015, 1503.02106.

[4] Po-Ling Loh,et al. Statistical consistency and asymptotic normality for high-dimensional robust M-estimators , 2015, ArXiv.

[5] Laurie Davies,et al. Data Analysis and Approximate Models , 2015 .

[6] D. Donoho,et al. Variance Breakdown of Huber ( M )-estimators : n / p → m ∈ ( 1 , ∞ ) , 2015 .

[7] R Core Team,et al. R: A language and environment for statistical computing. , 2014 .

[8] Dennis L. Sun,et al. Exact post-selection inference, with application to the lasso , 2013, 1311.6238.

[9] Anthony O'Hagan,et al. Bayesian inference with misspecified models: Inference about what? , 2013 .

[10] Stephen G. Walker,et al. Bayesian inference with misspecified models , 2013 .

[11] P. Bickel,et al. Optimal M-estimation in high-dimensional regression , 2013, Proceedings of the National Academy of Sciences.

[12] A. Buja,et al. Valid post-selection inference , 2013, 1306.1059.

[13] Jon Wakefield,et al. Bayesian sandwich posteriors for pseudo-true parameters , 2012 .

[14] F. Götze,et al. RESAMPLING FEWER THAN n OBSERVATIONS: GAINS, LOSSES, AND REMEDIES FOR LOSSES , 2012 .

[15] L. Wasserman. Low Assumptions, High Dimensions , 2011 .

[16] Thomas Lumley,et al. Model-Robust Regression and a Bayesian `Sandwich' Estimator , 2010, 1101.1402.

[17] Donald Ylvisaker,et al. Counting the Homeless in Los Angeles County , 2008, 0805.2840.

[18] A. Gelman,et al. Splitting a Predictor at the Upper Quarter or Third and the Lower Quarter or Third , 2007 .

[19] M. Kenward,et al. An Introduction to the Bootstrap , 2007 .

[20] D. Freedman,et al. On The So-Called “Huber Sandwich Estimator” and “Robust Standard Errors” , 2006 .

[21] Donald Hedeker,et al. Longitudinal Data Analysis , 2006 .

[22] J. Aldrich. Fisher and Regression , 2005 .

[23] J. Fox. Bootstrapping Regression Models , 2002 .

[24] R. Carroll,et al. A Note on the Efficiency of Sandwich Covariance Matrix Estimation , 2001 .

[25] B. Everitt,et al. Analysis of longitudinal data , 1998, British Journal of Psychiatry.

[26] E. Mammen. Empirical process of residuals for high-dimensional linear models , 1996 .

[27] Paul D. Allison,et al. The Impact of Random Predictors on Comparisons of Coefficients Between Models: Comment on Clogg, Petkova, and Haritou , 1995, American Journal of Sociology.

[28] C. Clogg,et al. Statistical Methods for Comparing Regression Coefficients Between Models , 1995, American Journal of Sociology.

[29] R. Tibshirani,et al. Generalized additive models for medical research , 1995, Statistical methods in medical research.

[30] Joseph P. Romano,et al. Large Sample Confidence Regions Based on Subsamples under Minimal Assumptions , 1994 .

[31] Halbert White,et al. Estimation, inference, and specification analysis , 1996 .

[32] E. Mammen. Bootstrap and Wild Bootstrap for High Dimensional Linear Models , 1993 .

[33] L. Breiman,et al. Submodel selection and evaluation in regression. The X-random case , 1992 .

[34] P. Hall. The Bootstrap and Edgeworth Expansion , 1992 .

[35] M. Berman. A theorem of Jacobi and its generalization , 1988 .

[36] John Law,et al. Robust Statistics—The Approach Based on Influence Functions , 1986 .

[37] Changbao Wu,et al. Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis , 1986 .

[38] W. Newey,et al. A Simple, Positive Semi-Definite, Heteroskedasticity and Autocorrelationconsistent Covariance Matrix , 1986 .

[39] S. Zeger,et al. Longitudinal data analysis using generalized linear models , 1986 .

[40] N. Weber,et al. The jackknife and heteroskedasticity: Consistent variance estimation for regression models , 1986 .

[41] H. White,et al. Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties☆ , 1985 .

[42] B. Efron,et al. The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[43] J. A. Hartigan,et al. Asymptotic Normality of Posterior Distributions , 1983 .

[44] R. Welsch,et al. Efficient Bounded-Influence Regression Estimation , 1982 .

[45] L. Hansen. Large Sample Properties of Generalized Method of Moments Estimators , 1982 .

[46] J. Kent. Robust properties of likelihood ratio tests , 1982 .

[47] H. White. Consequences and Detection of Misspecified Nonlinear Regression Models , 1981 .

[48] H. White. A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[49] H. White. Using Least Squares to Approximate Unknown Regression Functions , 1980 .

[50] G. Box. Robustness in the Strategy of Scientific Model Building. , 1979 .

[51] D. Hinkley. Jackknifing in Unbalanced Situations , 1977 .

[52] R. Berk,et al. CONSISTENCY A POSTERIORI , 1970 .

[53] P. J. Huber. The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .

[54] R. Berk,et al. Limiting Behavior of Posterior Distributions when the Model is Incorrect , 1966 .

[55] D. Cox,et al. An Analysis of Transformations , 1964 .

[56] F. Eicker. Asymptotic Normality and Consistency of the Least Squares Estimators for Families of Linear Regressions , 1963 .