Nonstationary hidden Markov model

Abstract The standard hidden Markov model (HMM) has often been pointed out for its inappropriateness in capturing state duration behavior. Explicit state duration modeling in the HMM has been developed but it is not sufficient for modeling the intrinsically dynamic, or nonstationary, transition process. Nevertheless, most research efforts have been concerned with only within-state nonstationarity, e.g., variable state duration and regional symbol correlation. In this paper we explore the nonstationarity of Markov chains and propose a nonstationary HMM that is defined with a set of dynamic transition probability parameters A ( τ ) = { a ij ( τ )}, a function of time duration τ. The model, when compared to the traditional models, is defined as a generalization of the standard HMM and the state duration HMM, with the description being given for discrete observation distributions. Through a set of experiments, it has been shown that the proposed model is more capable of capturing the dynamic nature of signals with higher discrimination power in on-line character recognition.

[1]  Biing-Hwang Juang,et al.  Hidden Markov Models for Speech Recognition , 1991 .

[2]  Stephen E. Levinson,et al.  Continuously variable duration hidden Markov models for speech analysis , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Mari Ostendorf,et al.  A stochastic segment model for phoneme-based continuous speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..

[4]  S. Roucos,et al.  A stochastic segment model for phoneme-based continuous speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Louis A. Liporace,et al.  Maximum likelihood estimation for multivariate observations of Markov sources , 1982, IEEE Trans. Inf. Theory.

[6]  Biing-Hwang Juang,et al.  Maximum likelihood estimation for multivariate mixture observations of markov chains , 1986, IEEE Trans. Inf. Theory.

[7]  Oded Ghitza,et al.  Hidden Markov models with templates as non-stationary states: an application to speech recognition , 1993, Comput. Speech Lang..

[8]  L. Baum,et al.  An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology , 1967 .

[9]  C. J. Wellekens,et al.  Explicit time correlation in hidden Markov models for speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Biing-Hwang Juang,et al.  Mixture autoregressive hidden Markov models for speech signals , 1985, IEEE Trans. Acoust. Speech Signal Process..

[11]  A. B. Poritz,et al.  Linear predictive hidden Markov models and the speech signal , 1982, ICASSP.

[12]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[13]  Patrick Kenny,et al.  A linear predictive HMM for vector-valued observations with applications to speech recognition , 1990, IEEE Trans. Acoust. Speech Signal Process..

[14]  Li Deng,et al.  A stochastic model of speech incorporating hierarchical nonstationarity , 1993, IEEE Trans. Speech Audio Process..