Adaptive Variable Degree-k Zero-Trees for Re-Encoding of Perceptually Quantized Wavelet-Packet Transformed Audio and High Quality Speech

A fast, efficient, and scalable algorithm is proposed, in this paper, for re-encoding of perceptually quantized wavelet-packet transform (WPT) coefficients of audio and high quality speech and is called “adaptive variable degree-𝑘 zero-trees” (AVDZ). The quantization process is carried out by taking into account some basic perceptual considerations and achieves good subjective quality with low complexity. The performance of the proposed AVDZ algorithm is compared with two other zero-tree-based schemes comprising (1) embedded zero-tree wavelet (EZW) and (2) the set partitioning in hierarchical trees (SPIHT). Since EZW and SPIHT are designed for image compression, some modifications are incorporated in these schemes for their better matching to audio signals. It is shown that the proposed modifications can improve their performance by about 15–25%. Furthermore, it is concluded that the proposed AVDZ algorithm outperforms these modified versions in terms of both output average bit-rates and computation times.

[1]  Alfred Mertins,et al.  AUDIO CODING BASED ON THE MODULATED LAPPED TRANSFORM (MLT) AND SET PARTITIONING IN HIERARCHICAL TREES , 2001 .

[2]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[3]  Deepen Sinha,et al.  Low bit rate transparent audio compression using adapted wavelets , 1993, IEEE Trans. Signal Process..

[4]  Davis Pan,et al.  A Tutorial on MPEG/Audio Compression , 1995, IEEE Multim..

[5]  N. Ruiz Reyes,et al.  A new algorithm for translating psycho-acoustic information to the wavelet domain , 2001 .

[6]  Marina Bosi,et al.  Overview of MPEG audio : Current and future standards for low-bit-rate audio coding , 1997 .

[7]  Bryan Usevitch,et al.  A tutorial on modern lossy wavelet image compression: foundations of JPEG 2000 , 2001, IEEE Signal Process. Mag..

[8]  S. Mallat A wavelet tour of signal processing , 1998 .

[9]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[10]  Mark Black,et al.  Computationally efficient wavelet packet coding of wide-band stereo audio signals , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[11]  M. Savoji,et al.  Adaptive Wavelet Coding of Audio and High Quality Speech at 32 Kb / s Using PsychoAcoustic Noise Masking Effects , 2004 .

[12]  Stéphane Mallat,et al.  Analysis of low bit rate image transform coding , 1998, IEEE Trans. Signal Process..

[13]  Marina Bosi,et al.  Introduction to Digital Audio Coding and Standards , 2004, J. Electronic Imaging.

[14]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[15]  Xing He,et al.  An enhanced psychoacoustic model based on the discrete wavelet packet transform , 2006, J. Frankl. Inst..

[16]  A. Gersho,et al.  Perceptual zerotrees for scalable wavelet coding of wideband audio , 1999, 1999 IEEE Workshop on Speech Coding Proceedings. Model, Coders, and Error Criteria (Cat. No.99EX351).

[17]  Andy C. Downton,et al.  Reduced bit rate uniform quantisation for SPIHT encoding , 2003 .

[18]  William A. Pearlman,et al.  Quantifying the Coding Performance of Zerotrees of Wavelet Coefficients: Degree-$k$ Zerotree , 2007, IEEE Transactions on Signal Processing.

[19]  Mislav Grgic,et al.  Modified SPIHT algorithm for wavelet packet image coding , 2005, Real Time Imaging.

[20]  W. Pearlman,et al.  HIGH QUALITY SCALABLE STEREO AUDIO CODING , 2007 .

[21]  Ronald R. Coifman,et al.  Adaptive wavelet packet basis selection for zerotree image coding , 2003, IEEE Trans. Image Process..

[22]  Pao-Chi Chang,et al.  Scalable embedded zero tree wavelet packet audio coding , 2001, 2001 IEEE Third Workshop on Signal Processing Advances in Wireless Communications (SPAWC'01). Workshop Proceedings (Cat. No.01EX471).

[23]  Renu Vig,et al.  An improved method of speech compression using warped LPC and MLT-SPIHT algorithm , 2006 .

[24]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[25]  Manuel Rosa-Zurera,et al.  Adaptive wavelet-packet analysis for audio coding purposes , 2003, Signal Process..

[26]  Alfred Mertins,et al.  Audio Compression using the MLT and SPIHT , 2001 .

[27]  Alfred Mertins,et al.  FROM LOSSY TO LOSSLESS AUDIO CODING USING SPIHT , 2002 .

[28]  Leah H. Jamieson,et al.  High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling , 1998, IEEE Trans. Signal Process..

[29]  Huan Zhou,et al.  An adaptive tree-based progressive audio compression scheme , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[30]  Mohammad H. Savoji,et al.  Modified EZW and SPIHT algorithms for perceptually audio and high quality speech coding , 2009, 2009 IEEE International Conference on Signal and Image Processing Applications.

[32]  Wideband Speech Coding Standards and Applications , 2005 .

[33]  Andrzej Drygajlo,et al.  Perceptual speech coding and enhancement using frame-synchronized fast wavelet packet transform algorithms , 1999, IEEE Trans. Signal Process..

[34]  Takeshi Yamada,et al.  Objective Quality Assessment of Wideband Speech Coding , 2005, IEICE Trans. Commun..

[35]  Eliathamby Ambikairajah,et al.  Auditory masking and MPEG-1 audio compression , 1997 .

[36]  William A. Pearlman,et al.  An efficient, low-complexity audio coder delivering multiple levels of quality for interactive applications , 1998, 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175).

[37]  Michael T. Orchard,et al.  Smooth wavelets, transform coding, and Markov-1 processes , 1993, IEEE Trans. Signal Process..