Low-rate quantization of spectrum parameters

In this paper, we generalize the standard blockwise linear predictive (LP) coding by introducing low-pass filtering and downsampling of the LPC vectors at the encoder side, accompanied by interpolation at the decoder. Several concepts in LP coding, such as overlapping frames, interpolation and long analysis frames, can be described in the proposed framework. We also note that the proposed methods are in agreement with previous work on spectral dynamics, which indicates that spectral dynamics is (at least) as important as spectral distortion for speech quality. We have applied the proposed method to low-rate quantization of spectrum parameters. Objective and subjective performance of the new approach is compared to a standard blockwise LP coding, at various rates. We show that both the subjective and objective quality of the new method compare favorably with standard methods. Even at as low rate as 500 bit/s, the proposed method has a good quality.

[1]  P. Hedelin,et al.  Recursive coding of spectrum parameters , 1999, 1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351).

[2]  K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1990 .

[3]  Jonas Samuelsson,et al.  Recursive coding of spectrum parameters , 2001, IEEE Trans. Speech Audio Process..

[4]  Thomas P. Barnwell,et al.  A 2.4 kbit/s MELP coder candidate for the new U.S. Federal Standard , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5]  R. Hagen,et al.  On memoryless quantization in speech coding , 1996, IEEE Signal Processing Letters.

[6]  W. Bastiaan Kleijn,et al.  Spectral dynamics is more important than spectral distortion , 1995, ICASSP.

[7]  Costas S. Xydeas,et al.  Segmental prototype interpolation coding , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[8]  Yair Shoham Very low complexity interpolative speech coding at 1.2 to 2.4 kbps , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Thomas Eriksson,et al.  Interframe LSF quantization for noisy channels , 1999, IEEE Trans. Speech Audio Process..

[10]  Yair Shoham,et al.  A low-complexity waveform interpolation coder , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[11]  K. Paliwal,et al.  Quantization of LPC Parameters , 2022 .