Low-complexity Bandwidth Extension in MDCT domain for low-bitrate speech coding

We propose a low-complexity Bandwidth Extension (BWE) method operating in the Modified Discrete Cosine Transform (MDCT) domain to reduce the bitrate of wideband and super-wideband speech codecs. The proposed method generates a high-frequency signal by copying the MDCT spectrum from the low frequency part, and then adjusts tonality to improve the subjective quality of the generated high-frequency signal. In combination with an MDCT-based transform codec, it requires only 64.9% of the computational complexity of MPEG-4 Spectral Band Replication (SBR). It also achieves subjective quality better than SBR for many speech samples.

[1]  Ronaldus Maria Aarts,et al.  Bandwidth Extension for Speech , 2005 .

[2]  Sugato Chakravarty,et al.  Method for the subjective assessment of intermedi-ate quality levels of coding systems , 2001 .

[3]  Peter Jax,et al.  Bandwidth Extension for Hierarchical Speech and Audio Coding in ITU-T Rec. G.729.1 , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Per Ekstrand BANDWIDTH EXTENSION OF AUDIO SIGNALS BY SPECTRAL BAND REPLICATION , 2002 .

[5]  Kristofer Kjörling,et al.  Spectral Band Replication, a Novel Approach in Audio Coding , 2002 .

[6]  Yuichiro Takamizawa,et al.  Low power spectral band replication technology for the MPEG-4 audio standard , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[7]  Jürgen Herre,et al.  Enhanced Mpeg-4 Low Delay AAC - Low Bitrate High Quality Communication , 2007 .

[8]  Pasi Ojala,et al.  AMR-WB+: a new audio coding standard for 3rd generation mobile audio services , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..