Learning Hidden Markov Models with Hidden Markov Trees as Observation Distributions

Hidden Markov models have been found very useful for a wide range of applications in artificial intelligence. The wavelet transform arises as a new tool for signal and image analysis, with a special emphasis on nonlinearities and nonstationarities. However, learning models for wavelet coecients have been mainly based on fixed-length sequences. We propose a novel learning architecture for sequences analyzed on a short-term basis, but not assuming stationarity within each frame. Long-term dependencies are modeled with a hidden Markov model which, in each internal state, deals with the local dynamics in the wavelet domain using a hidden Markov tree. The training algorithms for all the parameters in the composite model are developed using the expectation-maximization framework. This novel learning architecture can be useful for a wide range of applications. We detail experiments with real data for speech recognition. In the results, recognition rates were better than the state of the art technologies for this task.

[1]  Miquel Barceló,et al.  Inteligencia Artificial , 2001 .

[2]  A. Willsky Multiresolution Markov models for signal and image processing , 2002, Proc. IEEE.

[3]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[4]  J. R. Rohlicek,et al.  Parameter estimation of dependence tree models using the EM algorithm , 1995, IEEE Signal Processing Letters.

[5]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[6]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[7]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Kevin P. Murphy,et al.  Linear-time inference in Hierarchical HMMs , 2001, NIPS.

[9]  Robert D. Nowak,et al.  Wavelet-based statistical signal processing using hidden Markov models , 1998, IEEE Trans. Signal Process..

[10]  Ali Mohammad-Djafari,et al.  Hidden Markov models for wavelet-based blind source separation , 2006, IEEE Transactions on Image Processing.

[11]  Xiang-Gen Xia,et al.  Improved hidden Markov models in the wavelet-domain , 2001, IEEE Trans. Signal Process..

[12]  Samy Bengio,et al.  An EM Algorithm for HMMs with Emission Distributions Represented by HMMs , 2000 .

[13]  Lawrence Carin,et al.  Infrared-Image Classification Using Hidden Markov Trees , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Yoshua Bengio,et al.  Markovian Models for Sequential Data , 2004 .

[15]  Yoram Singer,et al.  The Hierarchical Hidden Markov Model: Analysis and Applications , 1998, Machine Learning.

[16]  Padhraic Smyth,et al.  Segmental Hidden Markov Models with Random Effects for Waveform Modeling , 2006, J. Mach. Learn. Res..

[17]  Samy Bengio,et al.  Robust speech recognition and feature extraction using HMM2 , 2003, Comput. Speech Lang..

[18]  Gregory R. Grant,et al.  Bioinformatics - The Machine Learning Approach , 2000, Comput. Chem..

[19]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[20]  Lawrence Carin,et al.  Dual hidden Markov model for characterizing wavelet coefficients from multi-aspect scattering data , 2001, Signal Process..

[21]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .