Bootstrap bandwidth selection in kernel density estimation from a contaminated sample

In this paper we consider kernel estimation of a density when the data are contaminated by random noise. More specifically we deal with the problem of how to choose the bandwidth parameter in practice. A theoretical optimal bandwidth is defined as the minimizer of the mean integrated squared error. We propose a bootstrap procedure to estimate this optimal bandwidth, and show its consistency. These results remain valid for the case of no measurement error, and hence also summarize part of the theory of bootstrap bandwidth selection in ordinary kernel density estimation. The finite sample performance of the proposed bootstrap selection procedure is demonstrated with a simulation study. An application to a real data example illustrates the use of the method.

[1]  Peter J. Diggle,et al.  Choosing the smoothing parameter in a fourier approach to nonparametric deconvolution of a density estimate , 1995 .

[2]  Clifford B. Cordy,et al.  Deconvolution of a Distribution Function , 1997 .

[3]  Charles C. Taylor,et al.  Bootstrap choice of the smoothing parameter in kernel density estimation , 1989 .

[4]  P. Hall,et al.  Optimal Rates of Convergence for Deconvolving a Density , 1988 .

[5]  Jianqing Fan,et al.  Deconvolution with supersmooth distributions , 1992 .

[6]  M. Feinleib,et al.  Statistical Models for Longitudinal Studies of Health , 1992 .

[7]  R. Sabre,et al.  Consistent estimates of the mode of the probability density function in nonparametric deconvolution problems , 2000 .

[8]  Jianqing Fan,et al.  Global Behavior of Deconvolution Kernel Estimates , 1989 .

[9]  Matt P. Wand,et al.  Finite sample performance of deconvolving density estimators , 1998 .

[10]  Christian H. Hesse,et al.  Data-driven deconvolution , 1999 .

[11]  Martin L. Hazelton,et al.  An optimal local bandwidth selector for kernel density estimation , 1999 .

[12]  J. Faraway,et al.  Bootstrap choice of bandwidth for density estimation , 1990 .

[13]  Leonard A. Stefanski,et al.  Rates of convergence of some estimators in a class of deconvolution problems , 1990 .

[14]  P. Hall Large Sample Optimality of Least Squares Cross-Validation in Density Estimation , 1983 .

[15]  Bootstrap optimal bandwidth selection for kernel density estimates , 1992 .

[16]  D. W. Scott,et al.  Biased and Unbiased Cross-Validation in Density Estimation , 1987 .

[17]  J. Marr,et al.  Diet and heart: a postscript. , 1977, British medical journal.

[18]  J. Marron,et al.  Progress in data-based bandwidth selection for kernel density estimation , 1996 .

[19]  Michael H. Neumann,et al.  On the effect of estimating the error density in nonparametric deconvolution , 1997 .

[20]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .

[21]  J. Marron,et al.  Smoothed cross-validation , 1992 .

[22]  Peter Hall,et al.  Using the bootstrap to estimate mean squared error and select smoothing parameter in nonparametric problems , 1990 .

[23]  R. Karunamuni,et al.  Boundary Bias Correction for Nonparametric Deconvolution , 2000 .

[24]  Irène Gijbels,et al.  Practical bandwidth selection in deconvolution kernel density estimation , 2004, Comput. Stat. Data Anal..

[25]  L. Devroye Consistent deconvolution in density estimation , 1989 .

[26]  Jianqing Fan On the Optimal Rates of Convergence for Nonparametric Deconvolution Problems , 1991 .

[27]  M. C. Jones Rough‐and‐ready assessment of the degree and importance of smoothing in functional estimation , 2000 .

[28]  Estimation of integrated squared density derivatives from a contaminated sample , 2002 .

[29]  J. Stephen Marron,et al.  Bootstrap bandwidth selection , 1990 .

[30]  Jianqing Fan ASYMPTOTIC NORMALITY FOR DECONVOLVING KERNEL DENSITY ESTIMATORS , 1989 .

[31]  James Stephen Marron,et al.  A simple root n bandwidth selector , 1991 .

[32]  R. Carroll,et al.  Deconvolving kernel density estimators , 1987 .

[33]  Jörg Polzehl,et al.  Bias corrected bootstrap bandwidth selection , 1997 .