Integer Approximate Cosine/Sine-Modulated Filter Banks

Digital audio signals to be processed are integer-valued. But the perfect reconstruction cosine/sine-modulated filter banks and cosine-modulated QMF banks are real-valued transforms which map integer signal into real-valued spectral coefficients. Although their fast algorithms reduce the computational complexity, due to floating-point finite-length representation and corresponding rounding-off errors they cannot be used for lossless audio coding. Actually, almost all modern perceptual audio coding schemes developed so far operate in floating-point arithmetic and therefore, are lossy in nature. However, some audio coding applications require completely lossless preservation of the audio signal. An enabling technology for transform-based lossless audio coding is the integer transform. Integer transform is a transform which maps integers to integers by a reversible (invertible) way so that it preserves all mathematical properties of the original real-valued transform such as perfect reconstruction, energy compaction property and fast algorithm. Indeed, the integer modified discrete cosine transform (IntMDCT) or integer modulated lapped transform (IntMLT) enabled to design and implement this innovative coding technology for scalable lossy to lossless audio coding. The local and global methods to integer approximation of perfect reconstruction cosine/sine-modulated filter banks and cosine-modulated QMF banks are discussed in detail. They are based on computational methods of linear algebra, matrix theory and matrix computations, and in particular, on the matrix decompositions. In fact, the scalar and block matrix decompositions are powerful mathematical tools to construct the reversible (invertible) integer transforms.

[1]  Yong Zhang,et al.  A Scalable and Lossless Audio Coding System based on Integer Transform , 2006, 2006 International Symposium on Communications and Information Technologies.

[2]  Miriam Primbs Worst-case error analysis of lifting-based fast DCT-algorithms , 2005, IEEE Transactions on Signal Processing.

[3]  K.R. Rao,et al.  A comparison of integer fast Fourier transforms for lossless coding , 2004, IEEE International Symposium on Communications and Information Technology, 2004. ISCIT 2004..

[4]  I. Daubechies,et al.  Wavelet Transforms That Map Integers to Integers , 1998 .

[5]  K.R. Rao,et al.  Improved lossless audio coding using the noise-shaped IntMDCT , 2004, 3rd IEEE Signal Processing Education Workshop. 2004 IEEE 11th Digital Signal Processing Workshop, 2004..

[6]  Soontorn Oraintara,et al.  The integer MDCT and its application in the MPEG layer III audio , 2003, Proceedings of the 2003 International Symposium on Circuits and Systems, 2003. ISCAS '03..

[7]  Susanto Rahardja,et al.  A fast algorithm of integer MDCT for lossless audio coding , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  R. W. Schafer,et al.  Lossless compression of digital audio , 2001, IEEE Signal Process. Mag..

[9]  Fábio M. Bayer,et al.  Low-complexity 8-point DCT approximations based on integer functions , 2013, Signal Process..

[10]  Susanto Rahardja,et al.  Integer MDCT with enhanced approximation of the DCT-IV , 2006, IEEE Transactions on Signal Processing.

[11]  Susanto Rahardja,et al.  On Integer MDCT for Perceptual Audio Coding , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Henrique S. Malvar Lossless and Near-Lossless Audio Compression Using Integer-Reversible Modulated Lapped Transforms , 2007, 2007 Data Compression Conference (DCC'07).

[13]  Michael T. Orchard,et al.  On implementing transforms from integers to integers , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[14]  Pierre Duhamel,et al.  On computing the inverse DFT , 1988, IEEE Trans. Acoust. Speech Signal Process..

[16]  Trac D. Tran The LiftLT: fast-lapped transforms via lifting steps , 2000, IEEE Signal Process. Lett..

[17]  Susanto Rahardja,et al.  A scalable lossy to lossless audio coder for MPEG-4 lossless audio coding , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Manfred Tasche,et al.  Invertible integer DCT algorithms , 2003 .

[19]  Susanto Rahardja,et al.  MPEG-4 Scalable to Lossless Audio Coding - Emerging International Standard for Digital Audio Compression , 2005, 2005 IEEE 7th Workshop on Multimedia Signal Processing.

[20]  Susanto Rahardja,et al.  Perceptually Enhanced Bit-Plane Coding for Scalable Audio , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[21]  RuiMin Hu,et al.  Scalable Audio Coding Based on Integer Transform , 2006, 2006 First International Conference on Communications and Networking in China.

[22]  Songyu Yu,et al.  1-D and 2-D transforms from integers to integers , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[23]  Gerald Schuller,et al.  Fine grain scalable perceptual and lossless audio coding based on IntMDCT , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[24]  Susanto Rahardja,et al.  MPEG-4 Scalable to Lossless Audio Coding , 2004 .

[25]  Ralf Geiger,et al.  Audio Coding based on Integer Transforms , 2001 .

[26]  G. Plonka,et al.  Fast and numerically stable algorithms for discrete cosine transforms , 2005 .

[27]  Soontorn Oraintara,et al.  Approximation error analysis for transform-based lossless audio coding , 2004, IEEE Global Telecommunications Conference, 2004. GLOBECOM '04..

[28]  Marcus Purat,et al.  Lossless Transform Coding of Audio Signals , 1997 .

[29]  Sridhar Srinivasan,et al.  Modulo transforms - an alternative to lifting , 2006, IEEE Transactions on Signal Processing.

[30]  Vivek K. Goyal Transform coding with integer-to-integer transforms , 2000, IEEE Trans. Inf. Theory.

[31]  Liang Zhao,et al.  The application of integer MDCT in MP3 audio , 2011, Proceedings of 2011 International Conference on Computer Science and Network Technology.

[32]  Pengwei Hao,et al.  Matrix factorizations for reversible integer mapping , 2001, IEEE Trans. Signal Process..

[33]  Tilman Liebchen,et al.  MPEG-4 ALS: an emerging standard for lossless audio coding , 2004, Data Compression Conference, 2004. Proceedings. DCC 2004.

[34]  Gerald Schuller,et al.  Improved integer transforms for lossless audio coding , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[35]  Truong Q. Nguyen,et al.  Integer fast Fourier transform , 2002, IEEE Trans. Signal Process..

[36]  Soo Ngee Koh,et al.  Study on rounding errors of IntMDCT in perceptual audio coding , 2005, Seventh IEEE International Symposium on Multimedia (ISM'05).

[37]  Jin Li Reversible FFT and MDCT via matrix lifting , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[38]  Takehiro Moriya,et al.  A design of lossy and lossless scalable audio coding , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[39]  Takehiro Moriya,et al.  Lossless scalable audio coder and quality enhancement , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[40]  Kaoru Sezaki,et al.  Design of lossless LOT and its performance evaluation , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[41]  Jin Li A progressive to lossless embedded audio coder (PLEAC) with reversible modulated lapped transform , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[42]  Gerlind Plonka,et al.  A global method for invertible integer DCT and integer wavelet algorithms , 2004 .

[43]  Jin Li Low noise reversible MDCT (RMDCT) and its application in progressive-to-lossless embedded audio coding , 2005, IEEE Transactions on Signal Processing.

[44]  Jiasong Wu,et al.  Mixed-Radix Algorithm for the Computation of Forward and Inverse MDCTs , 2009, IEEE Transactions on Circuits and Systems I: Regular Papers.

[45]  Thomas Sporer,et al.  Improving Lossless Audio Coding , 1999 .

[46]  Jürgen Herre,et al.  IntMDCT - A link between perceptual and lossless audio coding , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[47]  Soontorn Oraintara,et al.  Fast and lossless implementation of the forward and inverse MDCT computation in MPEG audio coding , 2002, 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353).

[48]  Soontorn Oraintara,et al.  Lossless Audio Coding Using the IntMDCT and Rounding Error Shaping , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[49]  Susanto Rahardja,et al.  Method for realising reversible integer type-IV discrete cosine transform , 2004 .