Efficient transform coding of two-channel audio signals by means of complex-valued stereo prediction

Traditional MDCT-based perceptual audio coding schemes employ mid/side and intensity stereo techniques to allow efficient joint coding of the two channels of a stereophonic signal. These techniques, however, provide only little coding gain for critical stereo signals characterized by spectral components with a distinct level or phase difference between the channels. To overcome this deficiency, we propose an extension to the mid/side coding paradigm that utilizes complex-valued inter-channel linear prediction in the MDCT spectral domain. The required imaginary spectrum (MDST) is calculated in a computationally efficient manner without additional algorithmic delay. A formal listening test conducted in the course of the ISO/MPEG standardization of the unified speech and audio codec USAC illustrates that the proposed stereo prediction approach provides significant improvements in coding efficiency and shows that at 96 kb/s, excellent quality can be obtained even for critical signals.

[1]  J. D. Johnston,et al.  Sum-difference stereo transform coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Raymond N. J. Veldhuis,et al.  Subband coding of stereophonic digital audio signals , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Philippe Gournay,et al.  Unified speech and audio coding scheme for high quality at low bitrates , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Ruimin Hu,et al.  Estimating spatial cues for audio coding in MDCT domain , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[5]  J. Hilpert,et al.  The MPEG Surround Audio Coding Standard [Standards in a Nutshell] , 2009, IEEE Signal Processing Magazine.

[6]  RECOMMENDATION ITU-R BS.1534-1 - Method for the subjective assessment of intermediate quality level of coding systems , 2003 .

[7]  Corey I. Cheng,et al.  Method for Estimating Magnitude and Phase in the MDCT Domain , 2004 .

[8]  Jeroen Breebaart,et al.  Low Complexity Parametric Stereo Coding , 2004 .

[9]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[10]  Bernd Edler,et al.  Aliasing Reduction for Modified Discrete Cosine Transform Domain Filtering and its Application to Speech Enhancement , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.