Least-squares estimation of a convex discrete distribution

The least squares estimator of a discrete distribution under the constraint of convexity is introduced. Its existence and uniqueness are shown and consistency and rate of convergence are established. Moreover it is shown that it always outperforms the classical empirical estimator in terms of the Euclidean distance. Results are given both in the well- and the mis-specified cases. The performance of the estimator is checked throughout a simulation study. An algorithm, based on the support reduction algorithm, is provided. Application to the estimation of species abundance distribution is discussed.

[1]  Anne Chao,et al.  Nonparametric prediction in species sampling , 2004 .

[2]  M. Cule,et al.  Maximum likelihood estimation of a multi‐dimensional log‐concave density , 2008, 0804.3989.

[3]  J. Kalbfleisch Statistical Inference Under Order Restrictions , 1975 .

[4]  J. Norris,et al.  Non-parametric MLE for Poisson species abundance models allowing for heterogeneity between species , 1998, Environmental and Ecological Statistics.

[5]  J. Wellner,et al.  The Support Reduction Algorithm for Computing Non‐Parametric Function Estimates in Mixture Models , 2008, Scandinavian journal of statistics, theory and applications.

[6]  J. Wellner,et al.  Estimation of a k-monotone density: limit distribution theory and the Spline connection , 2005, math/0509081.

[7]  Jon A Wellner,et al.  Estimation of a discrete monotone distribution. , 2009, Electronic journal of statistics.

[8]  J. Wellner,et al.  Estimation of a convex function: characterizations and asymptotic theory. , 2001 .

[9]  Ji-ping Wang SPECIES: An R Package for Species Richness Estimation , 2011 .

[10]  B. Lindsay,et al.  A Penalized Nonparametric Maximum Likelihood Approach to Species Richness Estimation , 2005 .

[11]  Dankmar Böhning,et al.  Equivalence of Truncated Count Mixture Distributions and Mixtures of Truncated Count Distributions , 2006, Biometrics.

[12]  Kaspar Rufibach,et al.  Active Set and EM Algorithms for Log-Concave Densities Based on Complete and Censored Data , 2007, 0707.4643.

[13]  Valentin Patilea,et al.  Convex models, MLE and misspecification , 2001 .

[14]  Dankmar Böhning,et al.  Mixture models for capture-recapture count data , 2005, Stat. Methods Appl..

[15]  Fadoua Balabdaoui,et al.  Asymptotics of the discrete log‐concave maximum likelihood estimator and related applications , 2011, 1107.3904.

[16]  Richard J. Samworth,et al.  Maximum likelihood estimation of a multi-dimensional log-concave density: Estimation of a Multi-dimensional Log-concave Density , 2010 .

[17]  Dankmar Böhning,et al.  An extension of Chao's estimator of population size based on the first three capture frequency counts , 2011, Comput. Stat. Data Anal..

[18]  L. Dümbgen,et al.  logcondens: Computations Related to Univariate Log-Concave Density Estimation , 2011 .

[19]  G. Walther Detecting the Presence of Mixing with Multiscale Maximum Likelihood , 2002 .

[20]  Kazuo Murota,et al.  MATHEMATICAL ENGINEERING TECHNICAL REPORTS Recent Developments in Discrete Convex Analysis , 2008 .