Audio encoding based on the Empirical Mode Decomposition

This paper deals with a new approach for perceptual audio encoding, based on the Empirical Mode Decomposition (EMD). The audio signal is decomposed adaptively into intrinsic oscillatory components by EMD called Intrinsic Mode Functions (IMFs), which can be fully described by their extrema. These extrema are encoded after an appropriate thresholding scheme controlled by a psycho-acoustic model. The decoder recovers the original signal after IMFs reconstruction by means of spline interpolation and their summation. The proposed approach is applied to different audio signals and results are compared to wavelets and to MPEG1-layer3 (MP3) approaches. Relying on exhaustive simulations, the obtained results show that the proposed compression scheme performs better than the MP3 and the wavelet approach in terms of bit rate and audio quality.

[1]  Gabriel Rilling,et al.  Empirical mode decomposition as a filter bank , 2004, IEEE Signal Processing Letters.

[2]  W. Robertson,et al.  Comparing audio compression using wavelets with other audio compression schemes , 1999, Engineering Solutions for the Next Millennium. 1999 IEEE Canadian Conference on Electrical and Computer Engineering (Cat. No.99TH8411).

[3]  Terry A. Welch,et al.  A Technique for High-Performance Data Compression , 1984, Computer.

[4]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[5]  Gerhard Stoll,et al.  ISO-MPEG-1 Audio: A Generic Standard for Coding of High-: Quality Digital Audio , 1994 .

[6]  Alfred Mertins,et al.  Signal Analysis: Wavelets, Filter Banks, Time-Frequency Transforms and Applications , 1999 .

[7]  Jean-Christophe Cexus,et al.  Denoising via empirical mode decomposition , 2006 .

[8]  P. Noll,et al.  MPEG digital audio coding , 1997, IEEE Signal Process. Mag..

[9]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[10]  Abdel-Ouahab Boudraa,et al.  Speech Enhancement via EMD , 2008, EURASIP J. Adv. Signal Process..

[11]  Leah H. Jamieson,et al.  High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling , 1998, IEEE Trans. Signal Process..

[12]  John Wiley Signal Analysis: Wavelets, Filter Banks, Time-frequency Transforms and Filter Banks , 2001 .

[13]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[14]  Birger Kollmeier,et al.  PEMO-Q—A New Method for Objective Audio Quality Assessment Using a Model of Auditory Perception , 2006, IEEE Transactions on Audio, Speech, and Language Processing.