mclust Version 4 for R : Normal Mixture Modeling for Model-Based Clustering , Classification , and Density Estimation

mclust is a contributed R package for model-based clustering, classification, and density estimation based on finite normal mixture modeling. It provides functions for parameter estimation via the EM algorithm for normal mixture models with a variety of covariance structures, and functions for simulation from these models. Also included are functions that combine model-based hierarchical clustering, EM for mixture estimation and the Bayesian Information Criterion (BIC) in comprehensive strategies for clustering, density estimation and discriminant analysis. There is additional functionality for displaying and visualizing the models along with clustering, classification, and density estimation results. Several features of the software have been changed in this version, in particular the functionality for discriminant analysis and density estimation has been largely expanded. mclust is licensed under the GPL and distributed through CRAN; see http://cran.r-project.org/web/packages/mclust/index.html.

[1]  Adrian E. Raftery,et al.  MCLUST: Software for Model-Based Cluster Analysis , 1999 .

[2]  C. Fraley,et al.  Nonparametric Maximum Likelihood Estimation of Features in Spatial Point Processes Using Voronoï Tessellation , 1997 .

[3]  A. Raftery,et al.  Detecting features in spatial point processes with clutter via model-based clustering , 1998 .

[4]  Adrian E. Raftery,et al.  Bayesian Regularization for Normal Mixture Estimation and Model-Based Clustering , 2007, J. Classif..

[5]  Guohua Pan,et al.  Local Regression and Likelihood , 1999, Technometrics.

[6]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[7]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[8]  Adrian E. Raftery,et al.  Model-Based Clustering for Image Segmentation and Large Datasets via Sampling , 2004, J. Classif..

[9]  Luca Scrucca,et al.  Dimension reduction for model-based clustering , 2015, Stat. Comput..

[10]  G. Celeux,et al.  Regularized Gaussian Discriminant Analysis through Eigenvalue Decomposition , 1996 .

[11]  Adrian E. Raftery,et al.  Linear flaw detection in woven textiles using model-based clustering , 1997, Pattern Recognit. Lett..

[12]  A. Raftery,et al.  Model-based Gaussian and non-Gaussian clustering , 1993 .

[13]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[14]  A. Scott,et al.  Clustering methods based on likelihood ratio criteria. , 1971 .

[15]  Adrian E. Raftery,et al.  How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis , 1998, Comput. J..

[16]  Chris Fraley,et al.  Algorithms for Model-Based Gaussian Hierarchical Clustering , 1998, SIAM J. Sci. Comput..

[17]  Gérard Govaert,et al.  Gaussian parsimonious clustering models , 1995, Pattern Recognit..

[18]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[19]  A. Raftery,et al.  Nearest-Neighbor Clutter Removal for Estimating Features in Spatial Point Processes , 1998 .

[20]  Adrian E. Raftery,et al.  Incremental Model-Based Clustering for Large Datasets With Small Clusters , 2005 .

[21]  P. Deb Finite Mixture Models , 2008 .

[22]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[23]  C. Fraley,et al.  Nonparametric Maximum Likelihood Estimation of Features in Spatial Point Processes Using Voronoï Tessellation , 1997 .

[24]  H. P. Friedman,et al.  On Some Invariant Criteria for Grouping Data , 1967 .

[25]  Gilles Celeux,et al.  Combining Mixture Components for Clustering , 2010, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[26]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[27]  Adrian E. Raftery,et al.  Fitting straight lines to point patterns , 1984, Pattern Recognit..

[28]  Adrian E. Raftery,et al.  Model-based Methods of Classification: Using the mclust Software in Chemometrics , 2007 .