论文信息 - An effective strategy for initializing the EM algorithm in finite mixture models

An effective strategy for initializing the EM algorithm in finite mixture models

Finite mixture models represent one of the most popular tools for modeling heterogeneous data. The traditional approach for parameter estimation is based on maximizing the likelihood function. Direct optimization is often troublesome due to the complex likelihood structure. The expectation–maximization algorithm proves to be an effective remedy that alleviates this issue. The solution obtained by this procedure is entirely driven by the choice of starting parameter values. This highlights the importance of an effective initialization strategy. Despite efforts undertaken in this area, there is no uniform winner found and practitioners tend to ignore the issue, often finding misleading or erroneous results. In this paper, we propose a simple yet effective tool for initializing the expectation–maximization algorithm in the mixture modeling setting. The idea is based on model averaging and proves to be efficient in detecting correct solutions even in those cases when competitors perform poorly. The utility of the proposed methodology is shown through comprehensive simulation study and applied to a well-known classification dataset with good results.

Volodymyr Melnykov | Semhar Michael | Volodymyr Melnykov | Semhar Michael

[1] Ranjan Maitra. Initializing Partition-Optimization Algorithms , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2] M. Clyde,et al. Bayesian model averaging: A tutorial - Comment , 1999 .

[3] Christian Hennig,et al. Methods for merging Gaussian mixture components , 2010, Adv. Data Anal. Classif..

[4] Gérard Govaert,et al. Rmixmod: The R Package of the Model-Based Unsupervised, Supervised and Semi-Supervised Classification Mixmod Library , 2015 .

[5] Gilles Celeux,et al. Combining Mixture Components for Clustering , 2010, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[6] Volodymyr Melnykov,et al. Initializing the EM algorithm in Gaussian mixture models with an unknown number of components , 2012, Comput. Stat. Data Anal..

[7] J. MacQueen. Some methods for classification and analysis of multivariate observations , 1967 .

[8] Wei-Chen Chen,et al. MixSim: An R Package for Simulating Data to Study Performance of Clustering Algorithms , 2012 .

[9] Charles Bouveyron,et al. Model-based clustering of high-dimensional data: A review , 2014, Comput. Stat. Data Anal..

[10] Peter J. Rousseeuw,et al. Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[11] Daniel Ståhl,et al. Model‐based cluster analysis , 2012 .

[12] N. Campbell,et al. A multivariate study of variation in two species of rock crab of the genus Leptograpsus , 1974 .

[13] T. Sørensen,et al. A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .

[14] Adrian E. Raftery,et al. Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[15] A. Azzalini,et al. The multivariate skew-normal distribution , 1996 .

[16] Adrian E. Raftery,et al. MCLUST Version 3 for R: Normal Mixture Modeling and Model-Based Clustering † , 2007 .

[17] Ranjan Maitra,et al. Simulating Data to Study Performance of Finite Mixture Modeling and Clustering Algorithms , 2010 .