Optimization of Barron density estimates

We investigate a nonparametric estimator of the probability density introduced by Barron (1988, 1989). Earlier papers established its consistency in a strong sense, e.g., in the expected information divergence or expected chi-square divergence. This paper pays main attention to the expected chi-square divergence criterion. We give a new motivation of the Barron estimator by showing that a maximum-likelihood estimator (MLE) of a density from a family important in practice is consistent in expected information divergence but not in expected chi-square divergence. We also present new and practically applicable conditions of consistency in the expected chi-square divergence. Main attention is paid to optimization (in the sense of the mentioned criterion) of the two objects specifying the Barron estimator: the dominating probability density and the decomposition of the observation space into finitely many bins. Both problems are explicitly solved under certain regularity assumptions about the estimated density. A simulation study illustrates the results in exponential, Rayleigh, and Weibull families.

[1]  Igor Vajda,et al.  The chi-square error of Barron estimator of regular density is asymptotically normal , 1998 .

[2]  László Györfi,et al.  A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[3]  I. Vajda,et al.  Convex Statistical Distances , 2018, Statistical Inference for Engineers and Data Scientists.

[4]  László Györfi,et al.  Distribution estimation consistent in total variation and in two types of information divergence , 1992, IEEE Trans. Inf. Theory.

[5]  P. Bickel,et al.  On Some Global Measures of the Deviations of Density Function Estimates , 1973 .

[6]  A. Barron Uniformly Powerful Goodness of Fit Tests , 1989 .

[7]  Andrew R. Barron,et al.  A bound on the financial value of information , 1988, IEEE Trans. Inf. Theory.

[8]  G. Lugosi,et al.  On the asymptotic normality of the L1‐ and L2‐errors in histogram density estimation , 1994 .

[9]  Jean C. Walrand,et al.  An introduction to queueing networks , 1989, Prentice Hall International editions.

[10]  Nico M. van Dijk Queueing networks and product forms - a systems approach , 1993, Wiley-Interscience series in systems and optimization.

[11]  久孝 久保木 Statistical information and inference in parametric models , 1986 .

[12]  L. Györfi,et al.  Distribution Estimates Consistent in χ2-Divergence , 1998 .

[13]  Edward C. van der Meulen,et al.  About the Asymptotic Accuracy of Barron Density Estimates , 1998, IEEE Trans. Inf. Theory.

[14]  L. Devroye,et al.  No Empirical Probability Measure can Converge in the Total Variation Sense for all Distributions , 1990 .

[15]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[16]  T. Ferguson A Course in Large Sample Theory , 1996 .

[17]  P. Hall On Kullback-Leibler loss and density estimation , 1987 .

[18]  Timothy R. C. Read,et al.  Goodness-Of-Fit Statistics for Discrete Multivariate Data , 1988 .