Speech coding with time-varying bit allocations to excitation and LPC parameters

The authors explore the benefits of time-varying bit allocation to excitation and LPC (linear predictive coding) parameters for the case of codebook-excited LPC. The overall bit rate in the experiment was 4.8, 6.4, or 8.0 kb/s. In each case, permissible bit rates for the LPC component were 0, 24, 36, or 48 bits per frame, one of which was selected for each speech frame using a brute-force search maximum performance. Average SNR gains over conventional time-invariant methods were modest, on the order of 1 to 2 dB, but gains for certain speech segments were as high as 3 to 5 dB. Perceptually, gains due to variable bit allocation were most noticeable in the 6.4 kb/s system, especially with female speakers. However, even in this case, the benefits of flexible bit allocation were somewhat offset by distortions due to other inadequacies in the coding algorithm.<<ETX>>

[1]  W. B. Kleijn,et al.  Analysis and improvement of the vector quantization in SELP (Stochastically Excited Linear Prediction) , 1988 .

[2]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[3]  Robert M. Gray,et al.  Multimode coding: A novel approach to narrow‐ and medium‐band coding , 1988 .

[4]  Allen Gersho,et al.  Real-time vector APC speech coding at 4800 bps with adaptive postfiltering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  B. Atal,et al.  Strategies for improving the performance of CELP coders at low bit rates (speech analysis) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[6]  Man Mohan Sondhi,et al.  Enhancement of ADPCM speech coding with backward-adaptive algorithms for postfiltering and noise feedback , 1988, IEEE J. Sel. Areas Commun..

[7]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Allen Gersho,et al.  Vector excitation coding with dynamic bit allocation , 1988, IEEE Global Telecommunications Conference and Exhibition. Communications for the Information Age.

[9]  Willem Bastiaan Kleijn,et al.  Improved speech quality and efficient vector quantization in SELP , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[10]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.