Extending Fine-Grain Scalable Audio Coding to Very Low Bitrates using Overcomplete Dictionaries

Signal representations in overcomplete dictionaries are considered here as an alternative to the traditional transform representations for fine-grain scalable audio coding. Such representations produce sparser decompositions and thus allow better coding efficiency than transform coding at very low bitrates. Moreover, the decomposition algorithms are intrinsically progressive, and flexible enough to allow an efficient transient modeling. We propose in this paper a fine-grain scalable audio coder which works on a large range of bitrates (2kbs to 128kbs). Objective measures as well as informal subjective evaluation show that this coder outperforms a comparable transform-based coder at very low bitrates.

[1]  S.L.J.D.E. van de Par,et al.  Rate-distortion optimized hybrid sound coding , 2005 .

[2]  Richard Heusdens,et al.  RD optimal time segmentations for the time-varying MDCT , 2004, 2004 12th European Signal Processing Conference.

[3]  Xavier Rodet,et al.  Analysis of sound signals with high resolution matching pursuit , 1996, Proceedings of Third International Symposium on Time-Frequency and Time-Scale Analysis (TFTS-96).

[4]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[5]  Birger Kollmeier,et al.  PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Yeon-Bae Kim,et al.  Multi-Layer Bit-Sliced Bit-Rate Scalable Audio Coding , 1997 .

[7]  Seymour Shlien,et al.  The modulated lapped transform, its time-varying forms, and its applications to audio coding standards , 1997, IEEE Trans. Speech Audio Process..

[8]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[9]  Joel A. Tropp,et al.  Greed is good: algorithmic results for sparse approximation , 2004, IEEE Transactions on Information Theory.

[10]  Heiko Purnhagen,et al.  A Closer Look into MPEG-4 High Efficiency AAC , 2003 .

[11]  Chris Dunn Scalable Bitplane Runlength Coding , 2006 .

[12]  Alfred Mertins,et al.  Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT) , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..