论文信息 - A new speech recognition method based on VQ-distortion measure and HMM

A new speech recognition method based on VQ-distortion measure and HMM

A speech recognition method which integrates a VQ (vector quantization)-distortion measure and a discrete HMM (hidden Markov model) is proposed. This VQ-distortion-based HMM uses a VQ-distortion measure at each state instead of the discrete output probability used by a discrete HMM. Although this method is regarded as a refined version of the VQ-distribution based recognition method proposed by D.K. Burton et al (IEEE Trans. vol. ASSP-33, no.4, p.837-49 of 1985), it is also considered as a special case of a mixtured distribution density HMM. The authors describe the relationship between the VQ-distortion-based HMM and conventional HMMs, and compare their speech recognition performance through experiments on speaker-dependent digit recognition. A recognition accuracy of 100% using the new method was obtained.<<ETX>>

Hideyuki Suzuki | Seiichi Nakagawa | S. Nakagawa | Hideyuki Suzuki

[1] John E. Shore,et al. Discrete utterance speech recognition without time alignment , 1983, IEEE Trans. Inf. Theory.

[2] D. Burton,et al. Isolated-word speech recognition using multisection vector quantization codebooks , 1984, IEEE Trans. Acoust. Speech Signal Process..

[3] Seiichi Nakagawa,et al. Speaker-Independent English Consonant and Japanese Word Recognition by a Stochastic Dynamic Time Warping Method , 1988 .

[4] Frank K. Soong,et al. On the use of instantaneous and transitional spectral information in speaker recognition , 1988, IEEE Trans. Acoust. Speech Signal Process..

[5] Xuedong Huang,et al. Semi-continuous hidden Markov models for speech signals , 1990 .

[6] Hervé Bourlard,et al. Probability estimation by feed-forward networks in continuous speech recognition , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.

[7] M. Sugiyama,et al. Automatic language recognition using acoustic features , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[8] Seiichi Nakagawa,et al. Speaker-independent, text-independent language identification by HMM , 1992, ICSLP.

[9] Sadaoki Furui,et al. Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMMs , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.