Dimension reduction for model-based clustering via mixtures of shifted asymmetric Laplace distributions

A dimension reduction method for model-based clustering via a finite mixture of shifted asymmetric Laplace distributions is introduced. The approach is based on existing work within the Gaussian paradigm and relies on identification of a reduced subspace. This subspace contains linear combinations of the original data, ordered by importance using the associated eigenvalues. This clustering approach is illustrated on simulated and real data, where it performs favourably compared to its Gaussian analogue.

[1]  H. Riedwyl,et al.  Multivariate Statistics: A Practical Approach , 1988 .

[2]  Paul Horton,et al.  A Probabilistic Classification System for Predicting the Cellular Localization Sites of Proteins , 1996, ISMB.

[3]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[4]  Paul D. McNicholas,et al.  Model-based clustering, classification, and discriminant analysis via mixtures of multivariate t-distributions , 2011, Statistics and Computing.

[5]  A. Raftery,et al.  Variable Selection for Model-Based Clustering , 2006 .

[6]  S. Weisberg,et al.  An Introduction to Regression Graphics , 1994 .

[7]  Ker-Chau Li,et al.  Sliced Inverse Regression for Dimension Reduction , 1991 .

[8]  Adrian E. Raftery,et al.  MCLUST: Software for Model-Based Cluster Analysis , 1999 .

[9]  Ker-Chau Li Sliced inverse regression for dimension reduction (with discussion) , 1991 .

[10]  Gérard Govaert,et al.  Gaussian parsimonious clustering models , 1995, Pattern Recognit..

[11]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[12]  Luca Scrucca,et al.  Dimension reduction for model-based clustering , 2015, Stat. Comput..

[13]  Adrian E. Raftery,et al.  Normal Mixture Modelling for Model-Based Clustering,Classification, and Density Estimation , 2015 .

[14]  Ryan P. Browne,et al.  Mixtures of Shifted AsymmetricLaplace Distributions , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Ryan P. Browne,et al.  Estimating common principal components in high dimensions , 2013, Advances in Data Analysis and Classification.

[16]  Paul D. McNicholas,et al.  Parsimonious Gaussian mixture models , 2008, Stat. Comput..