A Bayesian model for local smoothing in kernel density estimation

A new procedure is proposed for deriving variable bandwidths in univariate kernel density estimation, based upon likelihood cross-validation and an analysis of a Bayesian graphical model. The procedure admits bandwidth selection which is flexible in terms of the amount of smoothing required. In addition, the basic model can be extended to incorporate local smoothing of the density estimate. The method is shown to perform well in both theoretical and practical situations, and we compare our method with those of Abramson (The Annals of Statistics 10: 1217–1223) and Sain and Scott (Journal of the American Statistical Association 91: 1525–1534). In particular, we note that in certain cases, the Sain and Scott method performs poorly even with relatively large sample sizes.We compare various bandwidth selection methods using standard mean integrated square error criteria to assess the quality of the density estimates. We study situations where the underlying density is assumed both known and unknown, and note that in practice, our method performs well when sample sizes are small. In addition, we also apply the methods to real data, and again we believe our methods perform at least as well as existing methods.

[1]  M. C. Jones,et al.  A Brief Survey of Bandwidth Selection for Density Estimation , 1996 .

[2]  D. W. Scott,et al.  On Locally Adaptive Density Estimation , 1996 .

[3]  A. Cuevas,et al.  A comparative study of several smoothing methods in density estimation , 1994 .

[4]  M. C. Jones,et al.  A reliable data-based bandwidth selection method for kernel density estimation , 1991 .

[5]  E. F. Schuster,et al.  On the Nonconsistency of Maximum Likelihood Nonparametric Density Estimators , 1981 .

[6]  A. Bowman,et al.  Adaptive Smoothing and Density-Based Tests of Multivariate Normality , 1993 .

[7]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[8]  D. W. Scott,et al.  Variable Kernel Density Estimation , 1992 .

[9]  Robert P. W. Duin,et al.  On the Choice of Smoothing Parameters for Parzen Estimators of Probability Density Functions , 1976, IEEE Transactions on Computers.

[10]  A. Molli'e Bayesian mapping of disease , 1996 .

[11]  Bu. Park,et al.  Rejoinder to ``Practical performance of several data driven bandwidth selectors" , 1992 .

[12]  Mark J. Brewer,et al.  A comparison of hybrid strategies for Gibbs sampling in mixed graphical models , 1996 .

[13]  M. C. Jones,et al.  Simple boundary correction for kernel density estimation , 1993 .

[14]  Ian Abramson On Bandwidth Variation in Kernel Estimates-A Square Root Law , 1982 .

[15]  N. Wermuth,et al.  On Substantive Research Hypotheses, Conditional Independence Graphs and Graphical Chain Models , 1990 .

[16]  Sylvia Richardson,et al.  Markov Chain Monte Carlo in Practice , 1997 .

[17]  Mark J. Brewer A Modelling Approach for Bandwidth Selection in Kernel Density Estimation , 1998, COMPSTAT.