Nonparametric predictive distributions based on conformal prediction

This paper applies conformal prediction to derive predictive distributions that are valid under a nonparametric assumption. Namely, we introduce and explore predictive distribution functions that always satisfy a natural property of validity in terms of guaranteed coverage for IID observations. The focus is on a prediction algorithm that we call the Least Squares Prediction Machine (LSPM). The LSPM generalizes the classical Dempster–Hill predictive distributions to nonparametric regression problems. If the standard parametric assumptions for Least Squares linear regression hold, the LSPM is as efficient as the Dempster–Hill procedure, in a natural sense. And if those parametric assumptions fail, the LSPM is still valid, provided the observations are IID.

[1]  Vladimir Vovk,et al.  Conformal Prediction for Reliable Machine Learning: Theory, Adaptations and Applications , 2014 .

[2]  L. Mark Berliner,et al.  Bayesian Nonparametric Survival Analysis , 1988 .

[3]  Le Roy Dougherty Weld,et al.  Theory of Errors and Least Squares , 2009, Nature.

[4]  J. A. Díaz-García,et al.  SENSITIVITY ANALYSIS IN LINEAR REGRESSION , 2022 .

[5]  P. Billingsley,et al.  Convergence of Probability Measures , 1970, The Mathematical Gazette.

[6]  George A. F. Seber,et al.  Linear regression analysis , 1977 .

[7]  Iosif Pinelis,et al.  Exact bounds on the closeness between the Student and standard normal distributions , 2011, 1101.3328.

[8]  Harold Jeffreys,et al.  On the theory of errors and least squares , 1932 .

[9]  A. Dempster On Direct Probabilities , 1963 .

[10]  Gemai Chen,et al.  Empirical processes based on regression residuals : theory and applications , 1991 .

[11]  R. M. Dudley,et al.  Weak Convergence of Probabilities on Nonseparable Metric Spaces and Empirical Measures on Euclidean Spaces , 1966 .

[12]  S. Hansen Student , 1972 .

[13]  M. Donsker Justification and Extension of Doob's Heuristic Approach to the Kolmogorov- Smirnov Theorems , 1952 .

[14]  Elizabeth A. Peck,et al.  Introduction to Linear Regression Analysis , 2001 .

[15]  L. A. Mukantseva Testing Normality in One-Dimensional and Multi-Dimensional Linear Regression , 1978 .

[16]  Douglas C. Montgomery,et al.  Introduction to Linear Regression Analysis, Solutions Manual (Wiley Series in Probability and Statistics) , 2007 .

[17]  Christian Genest,et al.  Bayesian Nonparametric Survival Analysis: Comment , 1988 .

[18]  Minge Xie,et al.  Prediction with confidence—A general framework for predictive inference , 2017 .

[19]  Teddy Seidenfeld,et al.  Jeffreys, Fisher, and Keynes: Predicting the Third Observation, Given the First Two , 1995 .

[20]  B. M. Hill,et al.  Posterior Distribution of Percentiles: Bayes' Theorem for Sampling From a Population , 1968 .

[21]  J. Lawless,et al.  Frequentist prediction intervals and predictive distributions , 2005 .

[22]  Y. Belyaev,et al.  Weakly approaching sequences of random distributions , 2000, Journal of Applied Probability.

[23]  Glen D Meeden,et al.  Fuzzy and randomized confidence intervals and P-values , 2005 .

[24]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[25]  Vladimir Vovk,et al.  Conformal predictive decision making , 2018, COPA.

[26]  Alexander Gammerman,et al.  Inductive Venn-Abers predictive distribution , 2018, COPA.

[27]  Kenneth J. Kopecky,et al.  Testing goodness of fit for the distribution of errors in regression models , 1979 .

[28]  G. Shafer,et al.  Algorithmic Learning in a Random World , 2005 .

[29]  Mohammad Mohammadi ON THE BOUNDS FOR DIAGONAL AND OFF-DIAGONAL ELEMENTS OF THE HAT MATRIX IN THE LINEAR REGRESSION MODEL , 2016 .

[30]  A. Gammerman,et al.  On-line predictive linear regression , 2005, math/0511522.

[31]  Tore Schweder,et al.  Confidence, Likelihood, Probability: Statistical Inference with Confidence Distributions , 2016 .

[32]  Thomas Augustin,et al.  Nonparametric predictive inference and interval probability , 2004 .

[33]  Alan J. Lee,et al.  Linear Regression Analysis: Seber/Linear , 2003 .

[34]  S. Chatterjee Sensitivity analysis in linear regression , 1988 .

[35]  D. Lane Fisher, Jeffreys, and the Nature of Probability , 1980 .

[36]  Vladimir Vovk,et al.  Efficiency of conformalized ridge regression , 2014, COLT.

[37]  Richard M. Dudley,et al.  Measures on Non-Separable Metric Spaces , 1967 .

[38]  Jan Hannig,et al.  Fiducial prediction intervals , 2012 .