A hybrid coder for hidden Markov models using a recurrent neural networks

A hybrid coder is introduced for obtaining descriptions of speech patterns. This coder uses vector quantization (VQ) techniques on mel-scale cepstral coefficients and their derivatives together with a recurrent network (RN) for describing suprasegmental features of speech. The purpose of these features is to focus the search when hidden Markov models (HMMs) are used for speech unit or word models. Preliminary experiments of speaker-independent connected digit recognition show that using a hybrid coder based on a RN improves recognition performance.<<ETX>>

[1]  John S. Bridle,et al.  Training Stochastic Model Recognition Algorithms as Networks can Lead to Maximum Mutual Information Estimation of Parameters , 1989, NIPS.

[2]  Jerome R. Bellegarda,et al.  Tied mixture continuous parameter models for large vocabulary isolated speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3]  Pietro Laface,et al.  Parallel Algorithms for Syllable Recognition in Continuous Speech , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[5]  R. G. Leonard,et al.  A database for speaker-independent digit recognition , 1984, ICASSP.

[6]  Michael Witbrock,et al.  A connectionist approach to continuous speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[7]  Mei-Yuh Hwang,et al.  The SPHINX speech recognition system , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[8]  Biing-Hwang Juang,et al.  HMM clustering for connected word recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[9]  Michael Picheny,et al.  Large vocabulary natural language continuous speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.