Statistically Efficient Estimation for Non-Smooth Probability Densities

We investigate statistical efficiency of estimators for non-smooth density functions. The density estimation problem appears in various situations, and it is intensively used in statistics and machine learning. The statistical efficiencies of estimators, i.e., their convergence rates, play a central role in advanced statistical analysis. Although estimators and their convergence rates for smooth density functions are well investigated in the literature, those for non-smooth density functions remain elusive despite their importance in application fields. In this paper, we propose new estimators for non-smooth density functions by employing the notion of Szemerédi partitions from graph theory. We derive convergence rates of the proposed estimators. One of them has the optimal convergence rate in minimax sense, and the other has slightly worse convergence rate but runs in polynomial time. Experimental results support the theoretical performance of our estimators.

[1]  L. Zhao,et al.  Almost Sure $L_r$-Norm Convergence for Data-Based Histogram Density Estimates , 1991 .

[2]  W. T. Gowers,et al.  Hypergraph regularity and the multidimensional Szemerédi theorem , 2007, 0710.3032.

[3]  H. D. Brunk,et al.  Statistical inference under order restrictions : the theory and application of isotonic regression , 1973 .

[4]  B. Szegedy,et al.  Szemerédi’s Lemma for the Analyst , 2007 .

[5]  Dirk P. Kroese,et al.  Kernel density estimation via diffusion , 2010, 1011.2602.

[6]  D. W. Scott On optimal and data based histograms , 1979 .

[7]  T. Cai,et al.  Testing composite hypotheses, Hermite polynomials and optimal estimation of a nonsmooth functional , 2011, 1105.3039.

[8]  L. Devroye The Equivalence of Weak, Strong and Complete Convergence in $L_1$ for Kernel Density Estimates , 1983 .

[9]  A. Goldenshluger,et al.  Bandwidth selection in kernel density estimation: Oracle inequalities and adaptive minimax optimality , 2010, 1009.1016.

[10]  Rocco A. Servedio,et al.  Near-Optimal Density Estimation in Near-Linear Time Using Variable-Width Histograms , 2014, NIPS.

[11]  M. Simonovits,et al.  Szemeredi''s Regularity Lemma and its applications in graph theory , 1995 .

[12]  L. Zhao,et al.  Almost sure L 1 -norm convergence for data-based histogram density estimates , 1987 .

[13]  Eric R. Ziegel,et al.  New Perspectives in Theoretical and Applied Statistics , 1988 .

[14]  Prakasa Rao Nonparametric functional estimation , 1983 .

[15]  Atsuyuki Kogure,et al.  Asymptotically Optimal Cells for a Historgram , 1987 .

[16]  Luc Devroye,et al.  Combinatorial methods in density estimation , 2001, Springer series in statistics.

[17]  E. Szemerédi Regular Partitions of Graphs , 1975 .

[18]  D. W. Scott,et al.  Variable Kernel Density Estimation , 1992 .

[19]  G. Walter Properties of Hermite Series Estimation of Probability Density , 1977 .

[20]  S. Efromovich Adaptive estimation of and oracle inequalities for probability densities and characteristic functions , 2008, 0806.2909.

[21]  Taoufik Bouezmarni,et al.  L1-rate of convergence of smoothed histogram , 2007 .

[22]  O. Lepskii On a Problem of Adaptive Estimation in Gaussian White Noise , 1991 .

[23]  David A. Freedman,et al.  On the maximum deviation between the histogram and the underlying density , 1981 .

[24]  Yuichiro Kanazawa An optimal variable cell histogram , 1988 .

[25]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[26]  G. Lugosi,et al.  Consistency of Data-driven Histogram Methods for Density Estimation and Classification , 1996 .

[27]  Tom Leonard Density Estimation, Stochastic Processes and Prior Information , 1978 .

[28]  Edward J. Wegman,et al.  Maximum Likelihood Estimation of a Unimodal Density Function , 1970 .

[29]  Alexandre B. Tsybakov,et al.  Introduction to Nonparametric Estimation , 2008, Springer series in statistics.

[30]  Van Der Vaart,et al.  Rates of contraction of posterior distributions based on Gaussian process priors , 2008 .

[31]  Alan M. Frieze,et al.  A Simple Algorithm for Constructing Szemere'di's Regularity Partition , 1999, Electron. J. Comb..