Model-free Bootstrap and Conformal Prediction in Regression: Conditionality, Conjecture Testing, and Pertinent Prediction Intervals

Predictive inference under a general regression setting is gaining more interest in the big-data era. In terms of going beyond point prediction to develop prediction intervals, two main threads of development are conformal prediction and Model-free prediction. Recently, Chernozhukov et al. [2019] proposed a new conformal prediction approach exploiting the same uniformization procedure as in the Model-free Bootstrap of Politis [2015]. Hence, it is of interest to compare and further investigate the performance of the two methods. In the paper at hand, we contrast the two approaches via theoretical analysis and numerical experiments with a focus on conditional coverage of prediction intervals. We discuss suitable scenarios for applying each algorithm, underscore the importance of conditional vs. unconditional coverage, and show that, under mild conditions, the Model-free bootstrap yields prediction intervals with guaranteed better conditional coverage compared to quantile estimation. We also extend the concept of ‘pertinence’ of prediction intervals in Politis [2015] to the nonparametric regression setting, and give concrete examples where its importance emerges under finite sample scenarios. Finally, we define the new notion of ‘conjecture testing’ that is the analog of hypothesis testing as applied to the prediction problem; we also devise a modified conformal score to allow conformal prediction to handle one-sided ‘conjecture tests’, and compare to the Model-free bootstrap.

[1]  J. Neyman Outline of a Theory of Statistical Estimation Based on the Classical Theory of Probability , 1937 .

[2]  Saharon Rosset,et al.  From Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation , 2017, Journal of the American Statistical Association.

[3]  Trevor Hastie,et al.  Cross-validation: what does it estimate and how well does it do it? , 2021, 2104.00673.

[4]  J. Robins,et al.  Distribution-Free Prediction Sets , 2013, Journal of the American Statistical Association.

[5]  D. Cox Some problems connected with statistical inference , 1958 .

[6]  A. Birnbaum On the Foundations of Statistical Inference , 1962 .

[7]  James M. Robins,et al.  Conditioning, Likelihood, and Coherence: A Review of Some Foundational Concepts , 2000 .

[8]  Halbert White,et al.  Subsampling the distribution of diverging statistics with applications to finance , 2004 .

[9]  Jagdish K. Patel,et al.  Prediction intervals - a review , 1989 .

[10]  G. A. Young,et al.  The bootstrap: To smooth or not to smooth? , 1987 .

[11]  Dimitris N. Politis,et al.  Asymptotic validity of bootstrap confidence intervals in nonparametric regression without an additive model , 2021, Electronic Journal of Statistics.

[12]  Larry Wasserman,et al.  Distribution‐free prediction bands for non‐parametric regression , 2014 .

[13]  D. Politis Model-Free Prediction and Regression: A Transformation-Based Approach to Inference , 2015 .

[14]  Vladimir Vovk,et al.  A tutorial on conformal prediction , 2007, J. Mach. Learn. Res..

[15]  M. Lawera Predictive inference : an introduction , 1995 .

[16]  Dimitris N. Politis,et al.  Model-free model-fitting and predictive distributions , 2010 .

[17]  S. Resnick Heavy-Tail Phenomena: Probabilistic and Statistical Modeling , 2006 .

[18]  Qi Li,et al.  Nonparametric Econometrics: Theory and Practice , 2006 .

[19]  D. Politis,et al.  Nonparametric Estimation of the Conditional Distribution at Regression Boundary Points , 2017, The American Statistician.

[20]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[21]  Victor Chernozhukov,et al.  Distributional conformal prediction , 2019, Proceedings of the National Academy of Sciences.

[22]  Alessandro Rinaldo,et al.  Distribution-Free Predictive Inference for Regression , 2016, Journal of the American Statistical Association.

[23]  Yaniv Romano,et al.  Conformalized Quantile Regression , 2019, NeurIPS.