TOWARDS A HYBRID AUDIO CODER

The main features of a novel approach for audio signal encoding are described. The approach combines non-linear transform coding and structured approximation techniques, together with hybrid modeling of the signal class under consideration. Essentially, several different components of the signal are estimated and transform coded using an appropriately chosen orthonormal basis. Different models and estimation procedures are discussed, and numerical results are provided.

[1]  Jelena Kovacevic,et al.  Wavelets and Subband Coding , 2013, Prentice Hall Signal Processing Series.

[2]  Bruno Torrésani,et al.  Determining local transientness of audio signals , 2004, IEEE Signal Processing Letters.

[3]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[4]  Robert D. Nowak,et al.  Wavelet-based statistical signal processing using hidden Markov models , 1998, IEEE Trans. Signal Process..

[5]  M. Victor Wickerhauser,et al.  Adapted wavelet analysis from theory to software , 1994 .

[6]  Xiaoming Huo,et al.  Uncertainty principles and ideal atomic decomposition , 2001, IEEE Trans. Inf. Theory.

[7]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[8]  Michael Elad,et al.  A generalized uncertainty principle and sparse representation in pairs of bases , 2002, IEEE Trans. Inf. Theory.

[9]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[10]  I. Daubechies,et al.  Tree Approximation and Optimal Encoding , 2001 .

[11]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[12]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[13]  K. H. Barratt Digital Coding of Waveforms , 1985 .

[14]  Teresa H. Meng,et al.  A perceptually based audio signal model with application to scalable audio compression , 1999 .

[15]  Maxim J. Goldberg,et al.  Removing noise from music using local trigonometric bases and wavelet packets , 1994 .

[16]  Mark B. Sandler,et al.  MDCT analysis of sinusoids: exact results and applications to coding artifacts reduction , 2004, IEEE Transactions on Speech and Audio Processing.

[17]  Ronald R. Coifman,et al.  Multilayered image representation: application to image compression , 2002, IEEE Trans. Image Process..

[18]  Julius O. Smith,et al.  Audio representations for data compression and compressed domain processing , 1998 .

[19]  Xavier Serra,et al.  A system for sound analysis/transformation/synthesis based on a deterministic plus stochastic decomposition , 1989 .