The EM algorithm with gradient function update for discrete mixtures with known (fixed) number of components

The paper is focussing on some recent developments in nonparametric mixture distributions. It discusses nonparametric maximum likelihood estimation of the mixing distribution and will emphasize gradient type results, especially in terms of global results and global convergence of algorithms such as vertex direction or vertex exchange method. However, the NPMLE (or the algorithms constructing it) provides also an estimate of the number of components of the mixing distribution which might be not desirable for theoretical reasons or might be not allowed from the physical interpretation of the mixture model. When the number of components is fixed in advance, the before mentioned algorithms can not be used and globally convergent algorithms do not exist up to now. Instead, the EM algorithm is often used to find maximum likelihood estimates. However, in this case multiple maxima are often occuring. An example from a meta-analyis of vitamin A and childhood mortality is used to illustrate the considerable, inferential importance of identifying the correct global likelihood. To improve the behavior of the EM algorithm we suggest a combination of gradient function steps and EM steps to achieve global convergence leading to the EM algorithm with gradient function update (EMGFU). This algorithms retains the number of components to be exactly k and typically converges to the global maximum. The behavior of the algorithm is highlighted at hand of several examples.

[1]  T. Louis,et al.  Bayes and Empirical Bayes Methods for Data Analysis. , 1997 .

[2]  Dankmar Böhning,et al.  Computer-Assisted Analysis of Mixtures and Applications: Meta-Analysis, Disease Mapping, and Others , 1999 .

[3]  F. Mosteller,et al.  Vitamin A supplementation and child mortality. A meta-analysis. , 1993, JAMA.

[4]  B. Lindsay The Geometry of Mixture Likelihoods, Part II: The Exponential Family , 1983 .

[5]  N. Laird Nonparametric Maximum Likelihood Estimation of a Mixing Distribution , 1978 .

[6]  B. Leroux Consistent estimation of a mixing distribution , 1992 .

[7]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[8]  Maimunah A Hamid,et al.  Bimodality in blood glucose distribution: is it universal? , 2002, Diabetes care.

[9]  Karl Mosler,et al.  A Cautionary Note on Likelihood Ratio Tests in Mixture Models , 2000 .

[10]  Dankmar Bohning Convergence of Simar's Algorithm for Finding the Maximum Likelihood Estimate of a Compound Poisson Process , 1982 .

[11]  L. Simar Maximum Likelihood Estimation of a Compound Poisson Process , 1976 .

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  A. Sommer,et al.  VITAMIN A SUPPLEMENTATION AND CHILD MORTALITY , 1986, The Lancet.

[14]  V. Hasselblad Finite mixtures of distributions from the exponential family , 1969 .

[15]  Dankmar Böhning,et al.  Computer-Assisted Analysis of Mixtures and Applications , 2000, Technometrics.

[16]  Bradley P. Carlin,et al.  BAYES AND EMPIRICAL BAYES METHODS FOR DATA ANALYSIS , 1996, Stat. Comput..