Short-time synthesis procedures in vector adaptive transform coding of speech

The authors discuss various possible synthesis procedures for frequency-domain vector-excited coding (VXC). The algorithm, called vector adaptive transform coding (VATC), is a two-stage vector quantizer for the short-time Fourier transform (STFT) of speech. One stage represented by an adaptive codebook based on long-term prediction and the other based by a random codebook. The authors focus on the differences between time and frequency VXC, stressing the importance of the long-term prediction. A fast algorithm for long-term prediction is introduced, and different strategies for noise reduction are proposed. VATC is presented as a particular case of a general vector linear prediction (VLP) scheme. Following this approach, a coder based on a self-spectral shaping procedure is proposed. This coder has the potential of producing telephonic-quality speech at rates lower than 4.8 kb/s.<<ETX>>

[1]  Allen Gersho,et al.  Encoding of LPC spectral parameters using switched-adaptive interframe vector prediction (speech coding) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[2]  Willem Bastiaan Kleijn,et al.  Improved speech quality and efficient vector quantization in SELP , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[3]  Luis A. Hernández Gómez,et al.  High-quality vector adaptive transform coding at 4.8 kb/s , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[4]  Allen Gersho,et al.  Real-time vector excitation coding of speech at 4800 bps , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Allen Gersho,et al.  s9.9 ENCODING OF LPC SPECTRAL PARAMETERS USING SWITCHED-ADAPTIVE INTERFRAME VECTOR PREDICTION? , 1988 .

[6]  Jack May,et al.  Fourier Transform Vector Quantization for Speech Coding , 1987, IEEE Trans. Commun..

[7]  B. Atal,et al.  Strategies for improving the performance of CELP coders at low bit rates (speech analysis) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.