Nonuniform Orthogonal Filterbanks Based on MDCT Analysis/Synthesis and Time-Domain Aliasing Reduction

In this letter we describe nonuniform orthogonal modified discrete cosine transform (MDCT) filterbanks and time-domain aliasing reduction (TDAR). By adding a postprocessing step to the MDCT, our method allows for arbitrary nonuniform frequency resolutions using subband merging with smooth windowing and overlap in frequency. This overlap allows for an improved temporal compactness of the impulse response, which is especially useful for audio coders. The postprocessing step comprises another lapped MDCT transform along the frequency axis and TDAR along each subband signal.

[1]  John Princen,et al.  Subband/Transform coding using filter bank designs based on time domain aliasing cancellation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Bernd Edler Codierung von Audiosignalen mit überlappender Transformation und adaptiven Fensterfunktionen , 1989 .

[3]  John Princen,et al.  Analysis/Synthesis filter bank design based on time domain aliasing cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[4]  Timothy B. Terriberry,et al.  High-Quality, Low-Delay Music Coding in the Opus Codec , 2016, ArXiv.

[5]  Jelena Kovacevic,et al.  Perfect reconstruction filter banks with rational sampling factors , 1993, IEEE Trans. Signal Process..

[6]  Philippe Gournay,et al.  Efficient Cross-Fade Windows for Transitions between LPC-Based and Non-LPC Based Audio Coding , 2009 .

[7]  Henrique S. Malvar Biorthogonal and nonuniform lapped transforms for transform coding with reduced blocking and ringing artifacts , 1998, IEEE Trans. Signal Process..

[8]  Henrique S. Malvar,et al.  Fast algorithm for modulated lapped transform , 1991 .

[9]  Frédéric Bimbot,et al.  Adaptive Filter Banks using Fixed Size MDCT and Subband Merging for Audio Coding- Comparison with the MPEG AAC Flter Banks , 2006 .

[10]  Thibaud Necciari,et al.  The ERBlet transform: An auditory-based time-frequency representation with perfect reconstruction , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[11]  Ferran Marqués,et al.  The MPEG-4 Book , 2003, J. Electronic Imaging.

[12]  Mark J. T. Smith,et al.  New framework for modulated perfect reconstruction filter banks , 1996, IEEE Trans. Signal Process..

[13]  R. Heusdens,et al.  Subband merging in cosine-modulated filter banks , 2003, IEEE Signal Processing Letters.

[14]  B. Moore,et al.  Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.

[15]  Henrique S. Malvar,et al.  The Design of Nonuniform Lapped Transforms , 2005 .

[16]  Thibaud Necciari,et al.  A quasi-orthogonal, invertible, and perceptually relevant time-frequency transform for audio coding , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[17]  Vladimir Britanak,et al.  Fast computational structures for an efficient implementation of the complete TDAC analysis/synthesis MDCT/MDST filter banks , 2009, Signal Process..

[18]  V. Devarajan,et al.  Efficient algorithms for MPEG-4 AAC-ELD, AAC-LD and AAC-LC filterbanks , 2008, 2008 International Conference on Audio, Language and Image Processing.