The Maximal Smoothing Principle in Density Estimation

Abstract We propose a widely applicable method for choosing the smoothing parameters for nonparametric density estimators. It has come to be realized in recent years (e.g., see Hall and Marron 1987; Scott and Terrell 1987) that cross-validation methods for finding reasonable smoothing parameters from raw data are of very limited practical value. Their sampling variability is simply too large. The alternative discussed here, the maximal smoothing principle, suggests that we consider using the most smoothing that is consistent with the estimated scale of our data. This greatly generalizes and exploits a phenomenon noted in Terrell and Scott (1985), that measures of scale tend to place upper bounds on the smoothing parameters that minimize asymptotic mean integrated squared error of density estimates such as histograms and frequency polygons. The method avoids the extreme sampling variability of cross-validation by using ordinary scale estimators such as the standard deviation and interquartile range, which ...

[1]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[2]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[3]  V. A. Epanechnikov Non-Parametric Estimation of a Multivariate Probability Density , 1969 .

[4]  P. Deheuvels Estimation non paramétrique de la densité par histogrammes généralisés , 1977 .

[5]  H. D. Brunk Univariate density estimation by orthogonal series , 1978 .

[6]  D. W. Scott,et al.  Plasma lipids as collateral risk factors in coronary artery disease--a study of 371 males with chest pain. , 1978, Journal of chronic diseases.

[7]  D. W. Scott On optimal and data based histograms , 1979 .

[8]  I. Good,et al.  Density Estimation and Bump-Hunting by the Penalized Likelihood Method Exemplified by Scattering and Meteorite Data , 1980 .

[9]  M. Rudemo Empirical Choice of Histograms and Kernel Density Estimators , 1982 .

[10]  B. W. Silverman,et al.  Probability, Statistics and Analysis: Some properties of a test for multimodality based on kernel density estimates , 1983 .

[11]  R. John,et al.  Boundary modification for kernel regression , 1984 .

[12]  A. Bowman An alternative method of cross-validation for the smoothing of density estimates , 1984 .

[13]  David W. Scott,et al.  Frequency Polygons: Theory and Application , 1985 .

[14]  Eugene F. Schuster,et al.  Incorporating support constraints into nonparametric estimators of densities , 1985 .

[15]  D. W. Scott,et al.  Oversmoothed Nonparametric Density Estimates , 1985 .

[16]  H. Müller,et al.  Kernels for Nonparametric Curve Estimation , 1985 .

[17]  G. Terrell Projection Pursuit via Multivariate Histograms , 1985 .

[18]  James Stephen Marron,et al.  On the Amount of Noise Inherent in Bandwidth Selection for a Kernel Density Estimator , 1987 .

[19]  D. W. Scott,et al.  Biased and Unbiased Cross-Validation in Density Estimation , 1987 .

[20]  D. Donoho One-sided inference about functionals of a density , 1988 .