Variable rate spectral quantization for phonetically classified CELP coding

Variable rate quantization of the linear predictive coding (LPC) parameters based on phonetic classification of the speech frame results in substantial performance gain. Speech frames are classified as unvoiced or voiced and are separately quantized with VQ codebooks designed for each class. Performance results, including listening tests, show that for transparent quality roughly 9 bits is sufficient for unvoiced frames and 24 bits for voiced frames. Test results of LPC quantization are described for a variable rate phonetically segmented CELP coder and for the synthesis of speech from the prediction residual.

[1]  Allen Gersho,et al.  Phonetically-based vector excitation coding of speech at 3.6 kbps , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[2]  Allen Gersho,et al.  Variable rate speech coding with phonetic segmentation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Allen Gersho,et al.  Variable bit-rate CELP coding of speech with phonetic classification , 1994, Eur. Trans. Telecommun..

[4]  K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1990 .

[5]  Gernot Kubin,et al.  Performance of noise excitation for unvoiced speech , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[6]  T. M. Liu,et al.  Phonetically-based LPC vector quantization of high quality speech , 1989, EUROSPEECH.