Bayesian feature and model selection for Gaussian mixture models

We present a Bayesian method for mixture model training that simultaneously treats the feature selection and the model selection problem. The method is based on the integration of a mixture model formulation that takes into account the saliency of the features and a Bayesian approach to mixture learning that can be used to estimate the number of mixture components. The proposed learning algorithm follows the variational framework and can simultaneously optimize over the number of components, the saliency of the features, and the parameters of the mixture model. Experimental results using high-dimensional artificial and real data illustrate the effectiveness of the method.

[1]  Christopher J. Merz,et al.  UCI Repository of Machine Learning Databases , 1996 .

[2]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[3]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[4]  Hagai Attias,et al.  A Variational Bayesian Framework for Graphical Models , 1999 .

[5]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[7]  Adrian Corduneanu,et al.  Variational Bayesian Model Selection for Mixture Distributions , 2001 .

[8]  J. Friedman Clustering objects on subsets of attributes , 2002 .

[9]  Jun S. Liu,et al.  Bayesian Clustering with Variable and Transformation Selections , 2003 .

[10]  Nando de Freitas,et al.  Bayesian Feature Weighting for Unsupervised Learning, with Application to Object Recognition , 2003, AISTATS.

[11]  J. Friedman,et al.  Clustering objects on subsets of attributes (with discussion) , 2004 .

[12]  Carla E. Brodley,et al.  Feature Selection for Unsupervised Learning , 2004, J. Mach. Learn. Res..

[13]  Anil K. Jain,et al.  Simultaneous feature selection and clustering using mixture models , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  D. Hand,et al.  Clustering objects on subsets of attributes , 2004 .

[15]  Peter D. Hoff,et al.  Model-based subspace clustering , 2006 .

[16]  P. Deb Finite Mixture Models , 2008 .