Perceptually Enhanced Bit-Plane Coding for Scalable Audio

The MPEG-4 scalable to lossless (SLS) audio coding is recently being developed to provide a unified solution for high-compression perceptual audio coding and high-quality lossless audio coding. SLS provides efficient fine granular scalable (FGS) coding from AAC core layer to lossless, and achieves reasonable perceptual quality at its scalable coding range using a sequential bit-plane scanning method, which minimizes the audio distortion according to the spectral shape of the core layer quantization errors. In this paper, it is shown that the perceptual quality performance of SLS at intermediate rates can be further improved by incorporating psycho acoustic model into the bit-plane coding process. In addition, it is also found that such an improvement can be achieved by slightly tweaking the original bit-plane coding process of SLS and hence preserving its nice features such as compatibility to lossless coding and low complexity

[1]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[2]  Susanto Rahardja,et al.  Bit-plane Golomb coding for sources with Laplacian distributions , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[3]  RuiMin Hu,et al.  Scalable Audio Coding Based on Integer Transform , 2006, 2006 First International Conference on Communications and Networking in China.

[4]  Susanto Rahardja,et al.  MPEG-4 Scalable to Lossless Audio Coding , 2004 .