Harmonic and noise coding of LPC residuals with classified vector quantization

An efficient coding scheme for linear predictive coding (LPC) residuals is proposed based on harmonic and noise representation. New features of the scheme include classified vector quantization of the spectral envelope of LPC residuals with a weighted distortion measure. The improvement in performance obtained by classifying codebooks based on a voiced/unvoiced (V/UV) decision is shown. Sequences of the short-term RMS power of the time domain waveforms are also vector quantized and transmitted for unvoiced signals. A fast synthesis algorithm for voiced signals using an FFT is also presented, which reduces the high complexity of the direct sinusoidal synthesis method with interpolated magnitudes and phases. Informal listening tests indicate that, in combination with a known LSP quantization technique, this residual coding scheme provides good communication quality at a total bit rate of less than 2.0 kbps.

[1]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[2]  Luís B. Almeida,et al.  Harmonic coding at 4.8 kb/s , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[3]  Takehiro Moriya Two-Channel Conjugate Vector Quantizer for Noisy Channel Speech Coding , 1992, IEEE J. Sel. Areas Commun..

[4]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[5]  Jun Matsumoto,et al.  Vector quantized MBE with simplified V/UV division at 3.0 kbit/s , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[7]  Yair Shoham High-quality speech coding at 2.4 to 4.0 kbit/s based on time-frequency interpolation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.