Unsupervised Learning of Finite Gaussian Mixture Models (GMMs): A Greedy Approach

In this work we propose a clustering algorithm that learns on-line a finite gaussian mixture model from multivariate data based on the expectation maximization approach. The convergence of the right number of components as well as their means and covariances is achieved without requiring any careful initialization. Our methodology starts from a single mixture component covering the whole data set and sequentially splits it incrementally during the expectation maximization steps. Once the stopping criteria has been reached, the classical EM algorithm with the best selected mixture is run in order to optimize the solution. We show the effectiveness of the method in a series of simulated experiments and compare in with a state-of-the-art alternative technique both with synthetic data and real images, including experiments with the iCub humanoid robot.

[1]  Ben J. A. Kröse,et al.  Efficient Greedy Learning of Gaussian Mixture Models , 2003, Neural Computation.

[2]  Markus Vincze,et al.  Robust tracking of ellipses at frame rate , 2001, Pattern Recognit..

[3]  Andrew R. Barron,et al.  Mixture Density Estimation , 1999, NIPS.

[4]  Geoffrey E. Hinton,et al.  SMEM Algorithm for Mixture Models , 1998, Neural Computation.

[5]  Manuel Lopes,et al.  Learning Object Affordances: From Sensory--Motor Coordination to Imitation , 2008, IEEE Transactions on Robotics.

[6]  Daniel P. W. Ellis,et al.  Evaluation of Distance Measures Between Gaussian Mixture Models of MFCCs , 2007, ISMIR.

[7]  Alexandre Bernardino,et al.  A Practical Method for Self-adapting Gaussian Expectation Maximization , 2010, ICINCO.

[8]  Emanuele Menegatti,et al.  Simulation of small humanoid robots for soccer domain , 2009, J. Frankl. Inst..

[9]  Peter Ahrendt,et al.  The Multivariate Gaussian Probability Distribution , 2005 .

[10]  Zhihua Zhang,et al.  EM algorithms for Gaussian mixtures with split-and-merge operation , 2003, Pattern Recognition.

[11]  Sang Uk Lee,et al.  Robust segmentation of cerebral arterial segments by a sequential Monte Carlo method: Particle filtering , 2006, Comput. Methods Programs Biomed..

[12]  Stefano Carpin,et al.  Bridging the Gap Between Simulation and Reality in Urban Search and Rescue , 2006, RoboCup.

[13]  Anil K. Jain,et al.  Unsupervised Learning of Finite Mixture Models , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  R. Okafor Maximum likelihood estimation from incomplete data , 1987 .

[15]  C A Grimbergen,et al.  Measurement of the distribution of red blood cell deformability using an automated rheoscope. , 2002, Cytometry.