Context dependent vector quantization for continuous speech recognition

The authors present a method for designing a vector quantizer for speech recognition that uses decision networks constructed by examining the phonetic context to obtain models for classes in the quantizer. Diagonal Gaussian models are constructed for the vector quantizer classes at each terminal node of the network and are used to label speech parameter vectors during recognition. Experimental results indicate that this method leads to superior vector quantizers for continuous speech.<<ETX>>

[1]  John Makhoul,et al.  Context-dependent modeling for acoustic-phonetic recognition of continuous speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  R. Gray,et al.  Applications of information theory to pattern recognition and the design of decision trees and trellises , 1988 .

[3]  Michael Picheny,et al.  Large vocabulary natural language continuous speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[4]  Hsiao-Wuen Hon,et al.  Allophone clustering for continuous speech recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[5]  Michael Picheny,et al.  Decision trees for phonological rules in continuous speech , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[6]  Victor Zue,et al.  Modelling Context Dependency in Acoustic-Phonetic and Lexical Representations , 1991, HLT.