Kernel density estimation via diffusion

We present a new adaptive kernel density estimator based on linear diffusion processes. The proposed estimator builds on existing ideas for adaptive smoothing by incorporating information from a pilot density estimate. In addition, we propose a new plug-in bandwidth selection method that is free from the arbitrary normal reference rules used by existing methods. We present simulation examples in which the proposed approach outperforms existing methods in terms of accuracy and reliability.

[1]  W. Feller THE PARABOLIC DIFFERENTIAL EQUATIONS AND THE ASSOCIATED SEMI-GROUPS OF TRANSFORMATIONS , 1952 .

[2]  Richard Bellman,et al.  A Brief Introduction to Theta Functions , 1960 .

[3]  C. Quesenberry,et al.  A nonparametric estimate of a multivariate density function , 1965 .

[4]  Jan Havrda,et al.  Quantification method of classification processes. Concept of structural a-entropy , 1967, Kybernetika.

[5]  O. Ladyženskaja Linear and Quasilinear Equations of Parabolic Type , 1968 .

[6]  I. Csiszár A class of measures of informativity of observation channels , 1972 .

[7]  Joseph B. Keller,et al.  Short time asymptotic expansions of solutions of parabolic equations , 1972 .

[8]  Stanislav Molchanov,et al.  DIFFUSION PROCESSES AND RIEMANNIAN GEOMETRY , 1975 .

[9]  Yakar Kannai,et al.  Off diagonal short time asymptotics for fundamental solution of diffusion equation , 1977 .

[10]  Ian Abramson On Bandwidth Variation in Kernel Estimates-A Square Root Law , 1982 .

[11]  A. Friedman Partial Differential Equations of Parabolic Type , 1983 .

[12]  J. Marron An Asymptotically Efficient Solution to the Bandwidth Problem of Kernel Density Estimation , 1985 .

[13]  Hiremaglur K. Kesavan,et al.  The generalized maximum entropy principle (with applications) , 1987 .

[14]  James Stephen Marron,et al.  Comparison of data-driven bandwith selectors , 1988 .

[15]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[16]  H. K. Kesavan,et al.  The generalized maximum entropy principle , 1989, IEEE Trans. Syst. Man Cybern..

[17]  E. Lehmann Model Specification: The Views of Fisher and Neyman, and Later Developments , 1990 .

[18]  M. Samiuddin,et al.  On nonparametric kernel density estimates , 1990 .

[19]  P. Hall On the bias of variable bandwidth curve estimators , 1990 .

[20]  M. C. Jones,et al.  A reliable data-based bandwidth selection method for kernel density estimation , 1991 .

[21]  James Stephen Marron,et al.  A simple root n bandwidth selector , 1991 .

[22]  Brian Kent Aldershof,et al.  Estimation of integrated squared density derivatives , 1991 .

[23]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[24]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[25]  M. C. Jones,et al.  Simple boundary correction for kernel density estimation , 1993 .

[26]  J. Marron,et al.  Transformations to reduce boundary bias in kernel density estimation , 1994 .

[27]  M. C. Jones,et al.  Variable location and scale kernel density estimation , 1994 .

[28]  M. Wand,et al.  Multivariate plug-in bandwidth selection , 1994 .

[29]  Matthew P. Wand,et al.  Kernel Smoothing , 1995 .

[30]  J. Marron,et al.  Improved Variable Window Kernel Estimates of Probability Densities , 1995 .

[31]  M. C. Jones,et al.  A Brief Survey of Bandwidth Selection for Density Estimation , 1996 .

[32]  M. C. Jones,et al.  A SIMPLE NONNEGATIVE BOUNDARY CORRECTION METHOD FOR KERNEL DENSITY ESTIMATION , 1996 .

[33]  J. Marron,et al.  Progress in data-based bandwidth selection for kernel density estimation , 1996 .

[34]  M. C. Jones,et al.  A Comparison of Higher-Order Bias Kernel Density Estimators , 1997 .

[35]  M. C. Jones,et al.  Universal smoothing factor selection in density estimation: theory and practice , 1997 .

[36]  J. Simonoff Smoothing Methods in Statistics , 1998 .

[37]  P. Hall,et al.  Data sharpening as a prelude to density estimation , 1999 .

[38]  C. Loader Bandwidth selection: classical or plug-in? , 1999 .

[39]  J. Marron,et al.  SCALE SPACE VIEW OF CURVE ESTIMATION , 2000 .

[40]  Peter Hall,et al.  High order data sharpening for density estimation , 2002 .

[41]  B. Park,et al.  New methods for bias correction at endpoints and boundaries , 2002 .

[42]  M. Hazelton Variable kernel density estimation , 2003 .

[43]  Timothy J. Robinson,et al.  Sequential Monte Carlo Methods in Practice , 2003 .

[44]  Kee-Hoon Kang,et al.  Adaptive variable location kernel density estimators with good performance at boundaries , 2003 .

[45]  Rohana J. Karunamuni,et al.  A generalized reflection method of boundary correction in kernel density estimation , 2005 .

[46]  Stig Larsson,et al.  Partial differential equations with numerical methods , 2003, Texts in applied mathematics.

[47]  S. Ethier,et al.  Markov Processes: Characterization and Convergence , 2005 .

[48]  Z. Botev Nonparametric Density Estimation via Diffusion Mixing , 2007 .

[49]  J. S. Long,et al.  Adaptive kernel density estimation , 2007 .

[50]  Rohana J. Karunamuni,et al.  Some improvements on a boundary corrected kernel density estimator , 2008 .