HIGH QUALITY SCALABLE STEREO AUDIO CODING

This paper proposes an efficient, low complexity, scalable audio coder based on a combination of two embedded coding algorithms: the SPIHT (set partitioning in hierarchical trees) coding algorithm [1] and an embedded, nested binary set partitioning (NBSP) algorithm. The SPIHT algorithm, considered to be the premier state-of-the-art algorithm in still image compression, is used for the low frequency subbands in a wavelet packet audio signal decomposition, while the NBSP algorithmencodes the high frequency audio subbands. Both left and right channels are encoded together to form a single embedded stereo audio bitstream, that can be truncated at any point to produce an optimal lower rate and quality bitstream for delivery to lower quality user services. Using standard MPEG test materials, we evaluate the performance of the proposed encoder compared to the MPEG II standard audio coder through informal listening tests at bit rates of 48Kbs/sec and 64Kbs/sec per channel. We conclude that our coder is comparable with MPEG II at 48Kbs/sec and better at 64 Kbs/sec per channel. The algorithm also features exact bit rate control, progressive transmission and low complexity for both the encoder and decoder. These features show its potential for interactive audio transmission over networks.

[1]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[2]  Deepen Sinha,et al.  Low bit rate transparent audio compression using adapted wavelets , 1993, IEEE Trans. Signal Process..

[3]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[4]  Ahmed H. Tewfik,et al.  Low bit rate high quality audio coding with combined harmonic and wavelet representations , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5]  Marcus Purat,et al.  Audio coding with a dynamic wavelet packet decomposition based on frequency-varying modulated lapped transforms , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6]  William A. Pearlman,et al.  An efficient, low-complexity audio coder delivering multiple levels of quality for interactive applications , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).