Models as Approximations, Part I: A Conspiracy of Nonlinearity and Random Regressors in Linear Regression

More than thirty years ago Halbert White inaugurated a "model-robust" form of statistical inference based on the "sandwich estimator" of standard error. This estimator is known to be "heteroskedasticity-consistent", but it is less well-known to be "nonlinearity-consistent" as well. Nonlinearity, however, raises fundamental issues because regressors are no longer ancillary, hence can't be treated as fixed. The consequences are severe: (1)~the regressor distribution affects the slope parameters, and (2)~randomness of the regressors conspires with the nonlinearity to create sampling variability in slope estimates --- even in the complete absence of error. For these observations to make sense it is necessary to re-interpret population slopes and view them not as parameters in a generative model but as statistical functionals associated with OLS fitting as it applies to largely arbitrary joint $\xy$~distributions. In such a "model-robust" approach to linear regression, the meaning of slope parameters needs to be rethought and inference needs to be based on model-robust standard errors that can be estimated with sandwich plug-in estimators or with the $\xy$~bootstrap. Theoretically, model-robust and model-trusting standard errors can deviate by arbitrary magnitudes either way. In practice, a diagnostic test can be used to detect significant deviations on a per-slope basis.

[1]  A. Buja,et al.  Statistica Sinica Preprint No : SS-2016-0546 R 1 Title Calibrated Percentile Double Bootstrap For Robust Linear Regression Inference , 2017 .

[2]  Sara van de Geer,et al.  High-dimensional inference in misspecified linear models , 2015, 1503.06426.

[3]  D. Donoho,et al.  Variance Breakdown of Huber (M)-estimators: $n/p \rightarrow m \in (1,\infty)$ , 2015, 1503.02106.

[4]  Po-Ling Loh,et al.  Statistical consistency and asymptotic normality for high-dimensional robust M-estimators , 2015, ArXiv.

[5]  Laurie Davies,et al.  Data Analysis and Approximate Models , 2015 .

[6]  D. Donoho,et al.  Variance Breakdown of Huber ( M )-estimators : n / p → m ∈ ( 1 , ∞ ) , 2015 .

[7]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[8]  Dennis L. Sun,et al.  Exact post-selection inference, with application to the lasso , 2013, 1311.6238.

[9]  Anthony O'Hagan,et al.  Bayesian inference with misspecified models: Inference about what? , 2013 .

[10]  Stephen G. Walker,et al.  Bayesian inference with misspecified models , 2013 .

[11]  P. Bickel,et al.  Optimal M-estimation in high-dimensional regression , 2013, Proceedings of the National Academy of Sciences.

[12]  A. Buja,et al.  Valid post-selection inference , 2013, 1306.1059.

[13]  Jon Wakefield,et al.  Bayesian sandwich posteriors for pseudo-true parameters , 2012 .

[14]  F. Götze,et al.  RESAMPLING FEWER THAN n OBSERVATIONS: GAINS, LOSSES, AND REMEDIES FOR LOSSES , 2012 .

[15]  L. Wasserman Low Assumptions, High Dimensions , 2011 .

[16]  Thomas Lumley,et al.  Model-Robust Regression and a Bayesian `Sandwich' Estimator , 2010, 1101.1402.

[17]  Donald Ylvisaker,et al.  Counting the Homeless in Los Angeles County , 2008, 0805.2840.

[18]  A. Gelman,et al.  Splitting a Predictor at the Upper Quarter or Third and the Lower Quarter or Third , 2007 .

[19]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[20]  D. Freedman,et al.  On The So-Called “Huber Sandwich Estimator” and “Robust Standard Errors” , 2006 .

[21]  Donald Hedeker,et al.  Longitudinal Data Analysis , 2006 .

[22]  J. Aldrich Fisher and Regression , 2005 .

[23]  J. Fox Bootstrapping Regression Models , 2002 .

[24]  R. Carroll,et al.  A Note on the Efficiency of Sandwich Covariance Matrix Estimation , 2001 .

[25]  B. Everitt,et al.  Analysis of longitudinal data , 1998, British Journal of Psychiatry.

[26]  E. Mammen Empirical process of residuals for high-dimensional linear models , 1996 .

[27]  Paul D. Allison,et al.  The Impact of Random Predictors on Comparisons of Coefficients Between Models: Comment on Clogg, Petkova, and Haritou , 1995, American Journal of Sociology.

[28]  C. Clogg,et al.  Statistical Methods for Comparing Regression Coefficients Between Models , 1995, American Journal of Sociology.

[29]  R. Tibshirani,et al.  Generalized additive models for medical research , 1995, Statistical methods in medical research.

[30]  Joseph P. Romano,et al.  Large Sample Confidence Regions Based on Subsamples under Minimal Assumptions , 1994 .

[31]  Halbert White,et al.  Estimation, inference, and specification analysis , 1996 .

[32]  E. Mammen Bootstrap and Wild Bootstrap for High Dimensional Linear Models , 1993 .

[33]  L. Breiman,et al.  Submodel selection and evaluation in regression. The X-random case , 1992 .

[34]  P. Hall The Bootstrap and Edgeworth Expansion , 1992 .

[35]  M. Berman A theorem of Jacobi and its generalization , 1988 .

[36]  John Law,et al.  Robust Statistics—The Approach Based on Influence Functions , 1986 .

[37]  Changbao Wu,et al.  Jackknife, Bootstrap and Other Resampling Methods in Regression Analysis , 1986 .

[38]  W. Newey,et al.  A Simple, Positive Semi-Definite, Heteroskedasticity and Autocorrelationconsistent Covariance Matrix , 1986 .

[39]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[40]  N. Weber,et al.  The jackknife and heteroskedasticity: Consistent variance estimation for regression models , 1986 .

[41]  H. White,et al.  Some heteroskedasticity-consistent covariance matrix estimators with improved finite sample properties☆ , 1985 .

[42]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[43]  J. A. Hartigan,et al.  Asymptotic Normality of Posterior Distributions , 1983 .

[44]  R. Welsch,et al.  Efficient Bounded-Influence Regression Estimation , 1982 .

[45]  L. Hansen Large Sample Properties of Generalized Method of Moments Estimators , 1982 .

[46]  J. Kent Robust properties of likelihood ratio tests , 1982 .

[47]  H. White Consequences and Detection of Misspecified Nonlinear Regression Models , 1981 .

[48]  H. White A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[49]  H. White Using Least Squares to Approximate Unknown Regression Functions , 1980 .

[50]  G. Box Robustness in the Strategy of Scientific Model Building. , 1979 .

[51]  D. Hinkley Jackknifing in Unbalanced Situations , 1977 .

[52]  R. Berk,et al.  CONSISTENCY A POSTERIORI , 1970 .

[53]  P. J. Huber The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .

[54]  R. Berk,et al.  Limiting Behavior of Posterior Distributions when the Model is Incorrect , 1966 .

[55]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[56]  F. Eicker Asymptotic Normality and Consistency of the Least Squares Estimators for Families of Linear Regressions , 1963 .