This paper describes the implementation of fast hidden Markov model (HMM) match algorithm in a phoneme-based Malay continuous speech recognition system. The decoding algorithm decouples the computations of state-likelihoods of phone HMM's from the main search which is bounded by syntactical and lexical constraints. This avoids the redundant state-likelihood computations of identical phone HMM's for different word models in the tightly integrated search and thus substantially reduce the decoding time. The algorithm is implemented in the framework of one pass dynamic programming search. For a 541-word task, the fast HMM match reduces the real-time factor (RTF) by a factor of 31.8 from 286.45 × RT to 9.02 × RT, compared to without decoupling. The word accuracy is maintained at 91.6% without loss for a test set perplexity of 15.45 in speaker-dependent mode.
[1]
Hermann Ney,et al.
Dynamic programming search for continuous speech recognition
,
1999,
IEEE Signal Process. Mag..
[2]
Biing-Hwang Juang,et al.
Fundamentals of speech recognition
,
1993,
Prentice Hall signal processing series.
[3]
Alexander Seward,et al.
A fast HMM match algorithm for very large vocabulary speech recognition
,
2004,
Speech Commun..
[4]
Mark J. F. Gales,et al.
State-based Gaussian selection in large vocabulary continuous speech recognition using HMMs
,
1999,
IEEE Trans. Speech Audio Process..
[5]
Sh-Hussain Salleh,et al.
Speaker dependent Malay phone recognition using continuous density hidden markov model
,
2007
.