Spectral amplitude warping (SAW) for noise spectrum shaping in audio coding

In this paper, we present a new approach to shape the coding noise in speech and audio coders. The approach, called spectral amplitude warping (SAW), consists essentially of a pre- and post-processing which apply a nonlinear transformation to the signal short-term spectrum prior to, and after, encoding. Since it is possible to view SAW as a separate entity from the coder, the noise shaping capability of an existing coder can be improved without modifying the coder itself. Using SAW as a pre- and post-process to the G.722 wideband speech coding standard, it was found in an informal listening test that the quality of the 64 kb/s operating mode can be achieved at only 48 kb/s. The price to be paid is an additional delay.

[1]  A. Gersho Advances in speech and audio compression : Data compression , 1994 .

[2]  Xavier Maitre,et al.  7 kHz audio coding within 64 kbit/s , 1988, IEEE J. Sel. Areas Commun..

[3]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Yair Shoham,et al.  Coding of wideband speech , 1991, Speech Commun..

[5]  Allen Gersho,et al.  Advances in speech and audio compression , 1994, Proc. IEEE.