Hybrid representations for audiophonic signal encoding

In this paper, we discuss a new approach for signal models in the context of audio signal encoding. The method is based upon hybrid models featuring simultaneously transient, tonal and stochastic components in the signal. Contrary to several existing approaches, our method does not rely on any prior segmentation of the signal. The three components are estimated and encoded using a strategy very much in the spirit of transform coding. While the details of the method described here are tailored to audio signals, the general strategy should also apply to other types of signals exhibiting significantly different features, for example images.

[1]  I. Daubechies,et al.  Tree Approximation and Optimal Encoding , 2001 .

[2]  Teresa H. Y. Meng,et al.  Extending Spectral Modeling Synthesis with Transient Modeling Synthesis , 2000, Computer Music Journal.

[3]  Ronald A. DeVore,et al.  Image compression through wavelet transform coding , 1992, IEEE Trans. Inf. Theory.

[4]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[5]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[6]  K. H. Barratt Digital Coding of Waveforms , 1985 .

[7]  Rémi Gribonval,et al.  Représentations parcimonieuses pour la séparation de sources avec un seul capteur , 2001 .

[8]  Maxim J. Goldberg,et al.  Removing noise from music using local trigonometric bases and wavelet packets , 1994 .

[9]  Teresa H. Y. Meng,et al.  Transient Modeling Synthesis: a flexible analysis/synthesis tool for transient signals , 1998, ICMC.

[10]  Ernst Terhardt,et al.  Calculating virtual pitch , 1979, Hearing Research.

[11]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[12]  Xavier Serra,et al.  A system for sound analysis/transformation/synthesis based on a deterministic plus stochastic decomposition , 1989 .

[13]  Julius O. Smith,et al.  Audio representations for data compression and compressed domain processing , 1998 .

[14]  Robert Bregovic,et al.  Multirate Systems and Filter Banks , 2002 .

[15]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[16]  S. Mallat,et al.  Adaptive covariance estimation of locally stationary processes , 1998 .

[17]  Ting Chen,et al.  Time-scale modification of audio signals with combined harmonic and wavelet representations , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  K. Gröchenig,et al.  Nonlinear Approximation with Local Fourier Bases , 2000 .

[19]  I. Daubechies,et al.  Multiresolution analysis, wavelets and fast algorithms on an interval , 1993 .

[20]  M. Victor Wickerhauser,et al.  Adapted wavelet analysis from theory to software , 1994 .

[21]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[22]  P. P. Vaidyanathan,et al.  A Review of the Theory and Applications of Optimal Subband and Transform Coders , 2001 .

[23]  William M. Hartmann,et al.  Psychoacoustics: Facts and Models , 2001 .

[24]  Laurent Daudet Représentations structurelles de signaux audiophoniques : méthodes hybrides pour des applications à la compression , 2000 .

[25]  Rémi Gribonval Approximations non-linéaires pour l'analyse de signaux sonores , 1999 .

[26]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[27]  Deepen Sinha,et al.  Low bit rate transparent audio compression using adapted wavelets , 1993, IEEE Trans. Signal Process..

[28]  I. Daubechies,et al.  Factoring wavelet transforms into lifting steps , 1998 .

[29]  Ahmed H. Tewfik,et al.  Low bit rate high quality audio coding with combined harmonic and wavelet representations , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[30]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[31]  Karlheinz Brandenburg,et al.  MP3 and AAC Explained , 1999 .

[32]  S. Mallat A wavelet tour of signal processing , 1998 .

[33]  Wim Sweldens,et al.  The lifting scheme: a construction of second generation wavelets , 1998 .

[34]  Bernard Delyon,et al.  On the Computation of Wavelet Coefficients , 1997 .

[35]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[36]  Jelena Kovacevic,et al.  Wavelets and Subband Coding , 2013, Prentice Hall Signal Processing Series.

[37]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..