论文信息 - Tech Report A Variational HEM Algorithm for Clustering Hidden Markov Models

Tech Report A Variational HEM Algorithm for Clustering Hidden Markov Models

The hidden Markov model (HMM) is a generative model that treats sequential data under the assumption that each observation is conditioned on the state of a discrete hidden variable that evolves in time as a Markov chain. In this paper, we derive a novel algorithm to cluster HMMs through their probability distributions. We propose a hierarchical EM algorithm that i) clusters a given collection of HMMs into groups of HMMs that are similar, in terms of the distributions they represent, and ii) characterizes each group by a "cluster center", i.e., a novel HMM that is representative for the group. We present several empirical studies that illustrate the benefits of the proposed algorithm.

Antoni B. Chan | Gert R. G. Lanckriet | Emanuele Coviello | E. Coviello | Antoni B. Chan

[1] D. Haussler,et al. Hidden Markov models in computational biology. Applications to protein modeling. , 1993, Journal of molecular biology.

[2] John R. Hershey,et al. Variational Kullback-Leibler divergence for Hidden Markov models , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[3] Antoni B. Chan,et al. Automatic Music Tagging With Time Series Models , 2010, ISMIR.

[4] Tony Jebara,et al. Spectral Clustering and Embedding with Hidden Markov Models , 2007, ECML.

[5] Gert R. G. Lanckriet,et al. Semantic Annotation and Retrieval of Music and Sound Effects , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[6] Nuno Vasconcelos,et al. Learning Mixture Hierarchies , 1998, NIPS.

[7] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[8] Padhraic Smyth,et al. Clustering Sequences with Hidden Markov Models , 1996, NIPS.

[9] Nuno Vasconcelos. Image indexing with mixture hierarchies , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10] Gustavo Carneiro,et al. Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Tommi S. Jaakkola,et al. Tutorial on variational approximation methods , 2000 .

[12] Michael I. Jordan,et al. An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[13] Tony Jebara,et al. Probability Product Kernels , 2004, J. Mach. Learn. Res..

[14] Antoni B. Chan. Derivation of the Hierarchical EM algorithm for Dynamic Textures , 2010 .

[15] Lawrence Carin,et al. Music Analysis Using Hidden Markov Mixture Models , 2007, IEEE Transactions on Signal Processing.

[16] Kin Hong Wong,et al. Script recognition using hidden Markov models , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17] Marc Toussaint,et al. Extracting Motion Primitives from Natural Handwriting Data , 2006, ICANN.

[18] John R. Hershey,et al. Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.