Low delay perceptually lossless coding of audio signals

A novel predictive lossless coding scheme is proposed. The prediction is based on a new weighted cascaded least mean squared (WCLMS) method. To obtain both a high compression ratio and a very low encoding and decoding delay, the residuals from the prediction are encoded using either a variant of adaptive Huffman coding or a version of adaptive arithmetic coding. WCLMS is especially designed for music/speech signals. It can be used either in combination with psycho-acoustically pre-filtered signals to obtain perceptually lossless coding, or as a stand-alone lossless coder. Experiments on a database of moderate size and a variety of pre-filtered mono-signals show that the proposed lossless coder (which needs about 2 bit/sample for pre-filtered signals) outperforms competing lossless coders, such as ppmz, bzip2, Shorten, and LPAC, in terms of compression ratios. The combination of WCLMS with either of the adaptive coding schemes is also shown to achieve better compression ratios and lower delay than an earlier scheme combining WCLMS with Huffman coding over blocks of 4096 samples.

[1]  Bernd Edler,et al.  Audio coding using a psychoacoustic pre- and post-filter , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[2]  R. Nigel Horspool,et al.  Algorithms for Adaptive Huffman Codes , 1984, Inf. Process. Lett..

[3]  Dawei Huang,et al.  Lossless coding of audio signals using cascaded prediction , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[4]  Vijay K. Madisetti,et al.  The Digital Signal Processing Handbook , 1997 .

[5]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[6]  Jeffrey Scott Vitter,et al.  Design and analysis of dynamic Huffman codes , 1987, JACM.

[7]  Alistair Moffat,et al.  Piecewise integer mapping for arithmetic coding , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[8]  Peter Deutsch,et al.  GZIP file format specification version 4.3 , 1996, RFC.

[9]  Christof Faller,et al.  Perceptual Audio Coding Using a Time-Varying Linear Pre- and Post-Filter , 2001 .

[10]  Glen G. Langdon,et al.  An Introduction to Arithmetic Coding , 1984, IBM J. Res. Dev..

[11]  Martin Vetterli,et al.  An FIR cascade structure for adaptive linear prediction , 1998, IEEE Trans. Signal Process..

[12]  Ian H. Witten,et al.  Arithmetic coding revisited , 1998, TOIS.

[13]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[14]  Jorma Rissanen,et al.  The Minimum Description Length Principle in Coding and Modeling , 1998, IEEE Trans. Inf. Theory.

[15]  Alistair Moffat An improved data structure for cumulative probability tables , 1999 .

[16]  Donald E. Knuth,et al.  Dynamic Huffman Coding , 1985, J. Algorithms.

[17]  J. Robert Stuart,et al.  The MLP Lossless Compression System , 1999 .