A new speech recognition method based on VQ-distortion measure and HMM

A speech recognition method which integrates a VQ (vector quantization)-distortion measure and a discrete HMM (hidden Markov model) is proposed. This VQ-distortion-based HMM uses a VQ-distortion measure at each state instead of the discrete output probability used by a discrete HMM. Although this method is regarded as a refined version of the VQ-distribution based recognition method proposed by D.K. Burton et al (IEEE Trans. vol. ASSP-33, no.4, p.837-49 of 1985), it is also considered as a special case of a mixtured distribution density HMM. The authors describe the relationship between the VQ-distortion-based HMM and conventional HMMs, and compare their speech recognition performance through experiments on speaker-dependent digit recognition. A recognition accuracy of 100% using the new method was obtained.<<ETX>>

[1]  John E. Shore,et al.  Discrete utterance speech recognition without time alignment , 1983, IEEE Trans. Inf. Theory.

[2]  D. Burton,et al.  Isolated-word speech recognition using multisection vector quantization codebooks , 1984, IEEE Trans. Acoust. Speech Signal Process..

[3]  Seiichi Nakagawa,et al.  Speaker-Independent English Consonant and Japanese Word Recognition by a Stochastic Dynamic Time Warping Method , 1988 .

[4]  Frank K. Soong,et al.  On the use of instantaneous and transitional spectral information in speaker recognition , 1988, IEEE Trans. Acoust. Speech Signal Process..

[5]  Xuedong Huang,et al.  Semi-continuous hidden Markov models for speech signals , 1990 .

[6]  Hervé Bourlard,et al.  Probability estimation by feed-forward networks in continuous speech recognition , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.

[7]  M. Sugiyama,et al.  Automatic language recognition using acoustic features , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Seiichi Nakagawa,et al.  Speaker-independent, text-independent language identification by HMM , 1992, ICSLP.

[9]  Sadaoki Furui,et al.  Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.