Matrix quantization with vector quantization error compensation for robust speech recognition

This paper proposes a robust, speaker-independent IWSR system which combines dual fuzzy matrix quantization (FMQ) and fuzzy vector quantization (FVQ) pairs, or dual MQ/VQ quantization pair with a discrete HMM to efficiently utilize processing resources and improve speech recognition performance. This system exploits the "evolution" of the speech short-term spectral envelopes with error compensation from FVQ/HMM, or VQ/HMM processes to target noise-affected input signal parameters and minimize noise influence. The enhanced processing technology employs a weighted LSP distance measure in the LBG algorithm. Computer simulation using gender-dependent HMMs clearly indicates the superiority over conventional FVQ/HMM and FMQ/HMM systems with 96.48% and 92.8% recognition accuracy at 20 dB and 5 dB SNR levels, respectively in a car noise environment, based on database TIDIGITS.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  Lin Cong,et al.  Robust speech recognition using fuzzy matrix quantisation and neural networks , 1996, Proceedings of International Conference on Communication Technology. ICCT '96.

[3]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[4]  L. Cong,et al.  Combining neural network classification with fuzzy vector quantization and hidden Markov models for robust isolated word speech recognition , 1995, Proceedings of 1995 IEEE International Symposium on Information Theory.

[5]  Lin Cong,et al.  Robust speech recognition algorithms in a car noise environment , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[6]  Thomas W. Parsons,et al.  Voice and Speech Processing , 1986 .

[7]  L. Cong,et al.  Combining fuzzy vector quantization and neural network classification for robust isolated word speech recognition , 1994, Proceedings of ICCS '94.

[8]  Sadaoki Furui,et al.  Advances in Speech Signal Processing , 1991 .