Learning of Multivariate Beta Mixture Models via Entropy-based component splitting

Finite mixture models are progressively employed in various fields of science due to their high potential as inference engines to model multimodal and complex data. To develop them, we face some crucial issues such as choosing proper distributions with enough flexibility to well-fit the data. To learn our model, two other significant challenges, namely, parameter estimation and defining model complexity have to be addressed. Some methods such as maximum likelihood and Bayesian inference have been widely considered to tackle the first problem and both have some drawbacks such as local maxima or high computational complexity. Simultaneously, the proper number of components was determined with some approaches such as minimum message length. In this work, multivariate Beta mixture models have been deployed thanks to their flexibility and we propose a novel variational inference via an entropy-based splitting method. The performance of this approach is evaluated on real-world applications, namely, breast tissue texture classification, cytological breast data analysis, cell image categorization and age estimation.

[1]  Prabhat,et al.  Celeste: Variational inference for a generative model of astronomical images , 2015, ICML.

[2]  Hoon Kim,et al.  Monte Carlo Statistical Methods , 2000, Technometrics.

[3]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[4]  Adrian Corduneanu,et al.  Variational Bayesian Model Selection for Mixture Distributions , 2001 .

[5]  Jacob Goldberger,et al.  ICA based on a Smooth Estimation of the Differential Entropy , 2008, NIPS.

[6]  Aristidis Likas,et al.  Unsupervised Learning of Gaussian Mixtures Based on Variational Component Splitting , 2007, IEEE Transactions on Neural Networks.

[7]  Arne Leijon,et al.  Bayesian Estimation of Beta Mixture Models with Variational Inference , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  D. Chandler,et al.  Introduction To Modern Statistical Mechanics , 1987 .

[9]  M. Evans,et al.  Methods for Approximating Integrals in Statistics with Special Emphasis on Bayesian Integration Problems , 1995 .

[10]  Bahram Parvin,et al.  Automated Histology Analysis: Opportunities for signal processing , 2015, IEEE Signal Processing Magazine.

[11]  Timothy F. Cootes,et al.  Toward Automatic Simulation of Aging Effects on Face Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Ingram Olkin,et al.  A bivariate beta distribution , 2003 .

[13]  Nizar Bouguila,et al.  A Novel Model-Based Approach for Medical Image Segmentation Using Spatially Constrained Inverted Dirichlet Mixture Models , 2017, Neural Processing Letters.

[14]  Francisco Escolano,et al.  Entropy-Based Incremental Variational Bayes Learning of Gaussian Mixtures , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Mark W. Woolrich,et al.  Variational bayes inference of spatial mixture models for segmentation , 2006, IEEE Transactions on Medical Imaging.

[16]  Nizar Bouguila,et al.  A novel statistical approach for clustering positive data based on finite inverted Beta-Liouville mixture models , 2019, Neurocomputing.

[17]  Nizar Bouguila,et al.  Spatial Finite Non-gaussian Mixture for Color Image Segmentation , 2011, ICONIP.

[18]  Nizar Bouguila,et al.  Variational Learning for Finite Dirichlet Mixture Models and Applications , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[19]  Nizar Bouguila,et al.  Proportional data modeling via entropy-based variational bayes learning of mixture models , 2017, Applied Intelligence.

[20]  Prabhat,et al.  Celeste : Scalable variational inference for a generative model of astronomical images , 2014 .

[21]  Nizar Bouguila,et al.  A hierarchical nonparametric Bayesian approach for medical images and gene expressions classification , 2014, Soft Computing.

[22]  Ingram Olkin,et al.  Constructions for a bivariate beta distribution , 2014, 1406.5881.

[23]  Gilles Celeux,et al.  EM procedures using mean field-like approximations for Markov model-based image segmentation , 2003, Pattern Recognit..

[24]  Kenji Fukumizu,et al.  Local minima and plateaus in hierarchical structures of multilayer perceptrons , 2000, Neural Networks.

[25]  D. Demetrick,et al.  BreCaHAD: a dataset for breast cancer histopathological annotation and diagnosis , 2019, BMC Research Notes.

[26]  Udi Makov,et al.  Mixture Models in Statistics , 2001 .

[27]  O. Mangasarian,et al.  Multisurface method of pattern separation for medical diagnosis applied to breast cytology. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Nizar Bouguila,et al.  High-Dimensional Unsupervised Selection and Estimation of a Finite Generalized Dirichlet Mixture Model Based on Minimum Message Length , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Nizar Bouguila,et al.  An Accelerated Variational Framework for Face Expression Recognition , 2018, 2018 IEEE International Black Sea Conference on Communications and Networking (BlackSeaCom).

[30]  Sami Bourouis,et al.  Entropy-based variational Bayes learning framework for data clustering , 2018, IET Image Process..

[31]  Nizar Bouguila,et al.  Variational Learning of Finite Inverted Dirichlet Mixture Models and Applications , 2015, Artificial Intelligence Applications in Information and Communication Technologies.

[32]  David M. Blei,et al.  Decomposing spatiotemporal brain patterns into topographic latent sources , 2014, NeuroImage.

[33]  R. Tibshirani,et al.  Discriminant Analysis by Gaussian Mixtures , 1996 .

[34]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[35]  Dimitris Karlis,et al.  Choosing Initial Values for the EM Algorithm for Finite Mixtures , 2003, Comput. Stat. Data Anal..