Modified EZW and SPIHT algorithms for perceptually audio and high quality speech coding

This paper evaluates the problems of implementing two well-known zero-tree-based re-encoding schemes of Embedded Zero-tree Wavelet (EZW) and the set partitioning in hierarchical trees (SPIHT) for perceptually audio and high quality speech coding. Since the original EZW and SPIHT algorithms are designed for image compression, some new modifications have been implemented in these schemes for their better matching with audio signals. The performances of these two re-encoders are compared in terms of average output bit rate and computation time of a same codec. It is concluded that the proposed modifications can improve the performance of both algorithms by about 20–30%. Furthermore, it is shown that the modified EZW algorithm achieves relatively better average bit rates although it has lower speed in comparison to the modified SPIHT algorithm.

[1]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[2]  Mislav Grgic,et al.  Modified SPIHT algorithm for wavelet packet image coding , 2005, Real Time Imaging.

[3]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[4]  Ronald R. Coifman,et al.  Adaptive wavelet packet basis selection for zerotree image coding , 2003, IEEE Trans. Image Process..

[5]  William A. Pearlman,et al.  Quantifying the Coding Performance of Zerotrees of Wavelet Coefficients: Degree-$k$ Zerotree , 2007, IEEE Transactions on Signal Processing.

[6]  Bryan Usevitch,et al.  A tutorial on modern lossy wavelet image compression: foundations of JPEG 2000 , 2001, IEEE Signal Process. Mag..

[7]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[8]  Pao-Chi Chang,et al.  Scalable embedded zero tree wavelet packet audio coding , 2001, 2001 IEEE Third Workshop on Signal Processing Advances in Wireless Communications (SPAWC'01). Workshop Proceedings (Cat. No.01EX471).

[9]  William A. Pearlman,et al.  An efficient, low-complexity audio coder delivering multiple levels of quality for interactive applications , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[10]  M. Savoji,et al.  Adaptive Wavelet Coding of Audio and High Quality Speech at 32 Kb / s Using PsychoAcoustic Noise Masking Effects , 2004 .

[11]  K. Ashok Babu,et al.  Modified SPIHT Algorithm for Wavelet Packet Image Coding , 2009 .

[12]  Michael T. Orchard,et al.  Smooth wavelets, transform coding, and Markov-1 processes , 1993, IEEE Trans. Signal Process..

[13]  A. Gersho,et al.  Perceptual zerotrees for scalable wavelet coding of wideband audio , 1999, 1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351).

[14]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[15]  S. Mallat A wavelet tour of signal processing , 1998 .

[16]  Stéphane Mallat,et al.  Analysis of low bit rate image transform coding , 1998, IEEE Trans. Signal Process..

[17]  Andy C. Downton,et al.  Reduced bit rate uniform quantisation for SPIHT encoding , 2003 .