Scelp: Lowdelay audio coding with noise shaping based on spherical vector quantization

In this contribution a new wideband audio coding concept is presented that provides good audio quality at bit rates below 3 bits per sample with an algorithmic delay of less than 10 ms. The new concept is based on the principle of Linear Predictive Coding (LPC) in an analysis-by-synthesis framework, as known from speech coding. A spherical codebook is used for quantization at bit rates which are higher in comparison to low bit rate speech coding for improved performance for audio signals. For superior audio quality, noise shaping is employed to mask the coding noise. In order to reduce the computational complexity of the encoder, the analysis-by-synthesis framework has been adapted for the spherical codebook to enable a very efficient excitation vector search procedure. The codec principle can be adapted to a large variety of application scenarios. In terms of audio quality, the new codec outperforms ITU-T G.722 [4] at the same bit rate of 48 kbit/sec and a sample rate of 16 kHz.

[1]  Bernd Matschkal,et al.  Spherical logarithmic quantization and its application for DPCM , 2004 .

[2]  K. H. Barratt Digital Coding of Waveforms , 1985 .

[3]  Lane A. Hemaspaandra,et al.  Using simulated annealing to design good codes , 1987, IEEE Trans. Inf. Theory.

[4]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Jon Hamkins,et al.  Design and analysis of spherical codes , 1996 .

[6]  Kuldip K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1993, IEEE Trans. Speech Audio Process..

[7]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[8]  Xavier Maitre,et al.  7 kHz audio coding within 64 kbit/s , 1988, IEEE J. Sel. Areas Commun..

[9]  Claude Lamblin,et al.  Baseband speech coding at 2400 bps using "Spherical vector quantization" , 1984, ICASSP.

[10]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.