Model-free model-fitting and predictive distributions

The problem of prediction is revisited with a view towards going beyond the typical nonparametric setting and reaching a fully model-free environment for predictive inference, i.e., point predictors and predictive intervals. A basic principle of model-free prediction is laid out based on the notion of transforming a given setup into one that is easier to work with, namely i.i.d. or Gaussian. As an application, the problem of nonparametric regression is addressed in detail; the model-free predictors are worked out, and shown to be applicable under minimal assumptions. Interestingly, model-free prediction in regression is a totally automatic technique that does not necessitate the search for an optimal data transformation before model fitting. The resulting model-free predictive distributions and intervals are compared to their corresponding model-based analogs, and the use of cross-validation is extensively discussed. As an aside, improved prediction intervals in linear regression are also obtained.

[1]  R. Beran Calibrating Prediction Regions , 1990 .

[2]  Stefan Sperlich,et al.  Simple and effective boundary correction for kernel densities and regression with an application to the world income and Engel curve estimation , 2010, Comput. Stat. Data Anal..

[3]  W. Härdle,et al.  Bootstrapping in Nonparametric Regression: Local Adaptive Smoothing and Confidence Bands , 1988 .

[4]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[5]  David R. Cox,et al.  Prediction Intervals and Empirical Bayes Confidence Intervals , 1975, Journal of Applied Probability.

[6]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[7]  D. Freedman Bootstrapping Regression Models , 1981 .

[8]  Yongmiao Hong,et al.  Hypothesis Testing in Time Series via the Empirical Characteristic Function: A Generalized Spectral Density Approach , 1999 .

[9]  J. Shao,et al.  The jackknife and bootstrap , 1996 .

[10]  Guohua Pan,et al.  Local Regression and Likelihood , 1999, Technometrics.

[11]  Dimitris N. Politis,et al.  A normalizing and variance–stabilizing transformation for financial time series , 2003 .

[12]  David Ruppert,et al.  Prediction and tolerance intervals with transformation and/or weighting , 1991 .

[13]  T. Hastie,et al.  Local Regression: Automatic Kernel Carpentry , 1993 .

[14]  David M. Allen,et al.  The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction , 1974 .

[15]  David Ruppert,et al.  Bias reduction in kernel density estimation by smoothed empirical transformations , 1994 .

[16]  J. Horowitz Bootstrap Methods for Median Regression Models , 1996 .

[17]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[18]  Model-Free Prediction , 2020 .

[19]  Trevor Hastie,et al.  Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation , 2008 .

[20]  Seymour Geisser,et al.  The Predictive Sample Reuse Method with Applications , 1975 .

[21]  D. Politis,et al.  Bootstrap confidence intervals in nonparametric regression with built-in bias correction , 2008 .

[22]  James Stephen Marron,et al.  BOOTSTRAP SIMULTANEOUS ERROR BARS FOR NONPARAMETRIC REGRESSION , 1991 .

[23]  G. S. Watson,et al.  Smooth regression analysis , 1964 .

[24]  J. Wolfowitz The Minimum Distance Method , 1957 .

[25]  Seymour Geisser,et al.  8. Predictive Inference: An Introduction , 1995 .

[26]  W. R. Schucany Kernel Smoothers: An Overview of Curve Estimators for the First Graduate Course in Nonparametric Statistics , 2004 .

[27]  Jörg Polzehl,et al.  Simultaneous bootstrap confidence bands in nonparametric regression , 1998 .

[28]  E. Nadaraya On Estimating Regression , 1964 .

[29]  R. Schmoyer Asymptotically valid prediction intervals for linear models , 1992 .

[30]  Wolfgang Härdle,et al.  Applied Nonparametric Regression , 1991 .

[31]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[32]  George A. F. Seber,et al.  Linear regression analysis , 1977 .

[33]  David J. Olive Prediction intervals for regression models , 2007, Comput. Stat. Data Anal..

[34]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[35]  B. Efron Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation , 1983 .

[36]  P. Hall The Bootstrap and Edgeworth Expansion , 1992 .

[37]  Jeffrey S. Spence,et al.  Far Casting Cross-Validation , 2009 .

[38]  W. Härdle,et al.  Optimal Bandwidth Selection in Nonparametric Regression Function Estimation , 1985 .

[39]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[40]  N. Draper,et al.  Applied Regression Analysis , 1967 .

[41]  D. M. Allen Mean Square Error of Prediction as a Criterion for Selecting Variables , 1971 .

[42]  Jeffrey D. Hart,et al.  Nonparametric Smoothing and Lack-Of-Fit Tests , 1997 .

[43]  A. Ullah,et al.  Nonparametric Econometrics: Semiparametric and Nonparametric Estimation of Simultaneous Equation Models , 1999 .

[44]  P. Hall On Edgeworth Expansion and Bootstrap Confidence Bands in Nonparametric Curve Estimation , 1993 .

[45]  S. Geer,et al.  Regularization in statistics , 2006 .

[46]  Alan J. Lee,et al.  Linear Regression Analysis: Seber/Linear , 2003 .

[47]  D. Politis,et al.  Banded and tapered estimates for autocovariance matrices and the linear process bootstrap , 2010 .

[48]  S. Weisberg Plots, transformations, and regression , 1985 .

[49]  D. Ruppert,et al.  Transformation and Weighting in Regression , 1988 .

[50]  Jianqing Fan,et al.  Local polynomial modelling and its applications , 1994 .

[51]  H. White,et al.  ASYMPTOTIC DISTRIBUTION THEORY FOR NONPARAMETRIC ENTROPY MEASURES OF SERIAL DEPENDENCE , 2005 .

[52]  Remarks on a Multivariate Transformation , 2011 .

[53]  Jagdish K. Patel,et al.  Prediction intervals - a review , 1989 .

[54]  A. Dasgupta Asymptotic Theory of Statistics and Probability , 2008 .

[55]  Anthony C. Davison,et al.  Bootstrap Methods and Their Application , 1998 .

[56]  J. Friedman,et al.  Estimating Optimal Transformations for Multiple Regression and Correlation. , 1985 .

[57]  David Ruppert,et al.  Transformations and Weighting in Regression. , 1990 .

[58]  R. Stine Bootstrap Prediction Intervals for Regression , 1985 .

[59]  Oliver Linton,et al.  An analysis of transformations for additive nonparametric regression , 1997 .

[60]  Peter Hall,et al.  A Geometrical Method for Removing Edge Effects from Kernel-Type Nonparametric Regression Estimators , 1991 .

[61]  N. Draper,et al.  Applied Regression Analysis: Draper/Applied Regression Analysis , 1998 .

[62]  I. Keilegom,et al.  Estimation of a semiparametric transformation model , 2008, 0804.0719.

[63]  T. Tony Cai,et al.  Effect of mean on variance function estimation in nonparametric regression , 2006 .

[64]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[65]  A. Ullah,et al.  Nonparametric Econometrics , 1999 .

[66]  R. Tibshirani Estimating Transformations for Regression via Additivity and Variance Stabilization , 1988 .

[67]  Seongbaek Yi,et al.  One-Sided Cross-Validation , 1998 .

[68]  W. Härdle,et al.  How Far are Automatically Chosen Regression Smoothing Parameters from their Optimum , 1988 .

[69]  BOOTSTRAP CONFIDENCE INTERVALS FOR CONDITIONAL QUANTILE FUNCTIONS , 1988 .

[70]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[71]  D. Politis Model-free vs . Model-based Volatility Prediction ∗ , 2006 .

[72]  A. P. Dawid,et al.  Probability, Causality and the Empirical World: A Bayes-de Finetti-Popper-Borel Synthesis , 2004 .

[73]  M. Rosenblatt Remarks on a Multivariate Transformation , 1952 .

[74]  M. Wand Local Regression and Likelihood , 2001 .

[75]  Sheng G. Shi Local bootstrap , 1991 .

[76]  J. Hahn Bootstrapping Quantile Regression Estimators , 1995, Econometric Theory.

[77]  R. Koenker Quantile Regression: Name Index , 2005 .

[78]  A. Goldberger Best Linear Unbiased Prediction in the Generalized Linear Regression Model , 1962 .

[79]  Dimitris N. Politis,et al.  Model-Free Versus Model-Based Volatility Prediction , 2007 .