MDCT-based coder for highly adaptive speech and audio coding

Coding audio material at low bit rates with a consistent quality over a wide range of signals is a current and challenging problem. The high-granularity switched speech and audio coder AMR-WB+ performs especially well for speech and mixed content by promptly adapting its coding model scheme to the signal. However, the high adaptation rate is done at the price of limited performance for non-speech signals. The aim of the paper is to enhance the coding efficiency of AMR-WB+ while maintaining its high flexibility. For this purpose, the original DFT was replaced by the state-of-art transformation MDCT, and the vector quantization by the combination of a scalar quantization and an evolved context-adaptive arithmetic coder. The improvements were measured by both objective and subjective evaluations.

[1]  Sean A. Ramprashad The multimode transform predictive coding paradigm , 2003, IEEE Trans. Speech Audio Process..

[2]  Roch Lefebvre,et al.  8 kbit/s coding of speech with 6 ms frame-length , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Heiko Purnhagen,et al.  A Closer Look into MPEG-4 High Efficiency AAC , 2003 .

[4]  Juin-Hwey Chen,et al.  Transform predictive coding of wideband speech signals , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5]  Marc Antonini,et al.  Transform Audio Coding with Arithmetic-Coded Scalar Quantization and Model-Based Bit Allocation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[6]  Philippe Gournay,et al.  Unified speech and audio coding scheme for high quality at low bitrates , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Pasi Ojala,et al.  AMR-WB+: a new audio coding standard for 3rd generation mobile audio services , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[8]  Bernd Edler,et al.  Improved Quantization and Lossless Coding for Subband Audio Coding , 2005 .

[9]  Roch Lefebvre,et al.  The adaptive multirate wideband speech codec (AMR-WB) , 2002, IEEE Trans. Speech Audio Process..