Lossless and Near-Lossless Audio Compression Using Integer-Reversible Modulated Lapped Transforms

We present a simple lossless audio codec, composed of an integer-reversible modulated lapped transform (MLT) followed by a backward-adaptive run-length/Golomb-Rice (RLGR) encoder. Its compression performance matches those of state-of-the-art predictive codecs, and it has the advantage that its compressed bitstream contains frequency-domain data that can be used for applications such as search, identification, and visualization. Its compression gain can be improved through a novel data model based on cross-block smoothed spectral magnitude estimates. Its bitstream can be transcoded into a lossy format, for transfers to portable players, at about twice the speed of other codecs. The codec also supports a near-lossless mode, which allows for an extra factor of two in compression without noticeable distortions

[1]  Jin Li Low noise reversible MDCT (RMDCT) and its application in progressive-to-lossless embedded audio coding , 2005, IEEE Transactions on Signal Processing.

[2]  Susanto Rahardja,et al.  A statistics study of the MDCT coefficient distribution for audio , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[3]  Susanto Rahardja,et al.  A fine granular scalable to lossless audio coder , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Thomas Sporer,et al.  -NMR- and -Masking Flag-: Evaluation of Quality Using Perceptual Criteria , 1992 .

[5]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[6]  Henrique S. Malvar,et al.  Using audio fingerprinting for duplicate detection and thumbnail generation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[7]  Tilman Liebchen,et al.  MPEG-4 ALS: an emerging standard for lossless audio coding , 2004, Data Compression Conference, 2004. Proceedings. DCC 2004.

[8]  Henrique S. Malvar Adaptive run-length/Golomb-Rice encoding of quantized generalized Gaussian sources with unknown statistics , 2006, Data Compression Conference (DCC'06).

[9]  Susanto Rahardja,et al.  Integer MDCT with enhanced approximation of the DCT-IV , 2006, IEEE Transactions on Signal Processing.

[10]  T. Robinson Simple Lossless and Near-lossless Waveform Compression , 1994 .

[11]  Soontorn Oraintara,et al.  Fast and lossless implementation of the forward and inverse MDCT computation in MPEG audio coding , 2002, 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353).

[12]  Jin Li A progressive to lossless embedded audio coder (PLEAC) with reversible modulated lapped transform , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..