HHT-based audio coding

In this paper, a new audio coding scheme combining the Hilbert transform and the empirical mode decomposition (EMD) is introduced. Based on the EMD, the coding is fully a data-driven approach. Audio signal is first decomposed adaptively, by EMD, into intrinsic oscillatory components called intrinsic mode functions (IMFs). The key idea of this work is to code both instantaneous amplitude (IA) and instantaneous frequency (IF), of the extracted IMFs, calculated using Hilbert transform. Since IA (resp. IF) is strongly correlated, it is encoded via a linear prediction technique. The decoder recovers the original signal by superposition of the demodulated IMFs. The proposed approach is applied to audio signals, and the results are compared to those obtained by advanced audio coding (AAC) and MP3 codecs, and wavelets-based compression. Coding performances are evaluated using the bit rate, objective difference grade (ODG) and noise to mask ratio (NMR) measures. Based on the analyzed audio signals, overall, our coding scheme performs better than wavelet compression, AAC and MP3 codecs. Results also show that this new scheme has good coding performances without significant perceptual distortion, resulting in an ODG in range $$[-1,0]$$ and large negative NMR values.

[1]  Gerhard Stoll,et al.  Generic Architecture of the ISO/MPEG Layer I and II: Compatible Developments to Improve the Quality and Addition of New Features , 1993 .

[2]  Thierry Chonavel,et al.  Audio encoding based on the Empirical Mode Decomposition , 2009, 2009 17th European Signal Processing Conference.

[3]  Silvio Montrésor,et al.  Improved entropic gain for speech signals analysis/synthesis based on an adaptive time-frequency segmentation scheme , 2001, INTERSPEECH.

[4]  Gerhard Stoll,et al.  Extension of ISO/MPEG-Audio Layer II to Multichannel Coding: The Future Standard for Broadcasting, Telecommunication, and Multimedia Applications , 1993 .

[5]  Jean-Bernard Rault,et al.  MUSICAM (ISO-MPEG Audio) Very Low Bit-Rate Coding at a Reduced Sampling Frequency , 1993 .

[6]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[7]  Gabriel Rilling,et al.  Empirical mode decomposition as a filter bank , 2004, IEEE Signal Processing Letters.

[8]  Norden E. Huang,et al.  Ensemble Empirical Mode Decomposition: a Noise-Assisted Data Analysis Method , 2009, Adv. Data Sci. Adapt. Anal..

[9]  P. Flandrin,et al.  Detection of changes of signal structure by using the Wigner-Ville spectrum , 1985 .

[10]  Xavier Rodet,et al.  Analysis of Sound for Additive Synthesis: Tracking of Partials Using Hidden Markov Models , 1993, ICMC.

[11]  A. Boudraa,et al.  Audio encoding using Huang and Hilbert transforms , 2010, 2010 4th International Symposium on Communications, Control and Signal Processing (ISCCSP).

[12]  Leah H. Jamieson,et al.  High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling , 1998, IEEE Trans. Signal Process..

[13]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[14]  Les E. Atlas,et al.  Some Properties of an Empirical Mode Type Signal Decomposition Algorithm , 2008, IEEE Signal Processing Letters.

[15]  Howard Jay Chizeck,et al.  Some Properties of an Empirical Mode Type Signal Decomposition Algorithm , 2010, IEEE Signal Process. Lett..

[16]  P. Noll,et al.  MPEG digital audio coding , 1997, IEEE Signal Process. Mag..

[17]  Gerhard Stoll,et al.  ISO-MPEG-1 Audio: A Generic Standard for Coding of High-: Quality Digital Audio , 1994 .

[18]  Ted Painter,et al.  Audio Signal Processing and Coding , 2007 .

[19]  Patrick Flandrin,et al.  A complete ensemble empirical mode decomposition with adaptive noise , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  Anne Marsden,et al.  International Organization for Standardization , 2014 .

[21]  C. Doncarli,et al.  Stationarity index for abrupt changes detection in the time-frequency plane , 1996, IEEE Signal Processing Letters.

[22]  Edward Jones,et al.  Audio quality assessment techniques - A review, and recent developments , 2009, Signal Process..

[23]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[24]  Thomas Sporer,et al.  -NMR- and -Masking Flag-: Evaluation of Quality Using Perceptual Criteria , 1992 .

[25]  渡辺馨 Objective measurement method of audio quality in accordance with ITU-R Recommendation BS. 1387 , 2001 .

[26]  Terry A. Welch,et al.  A Technique for High-Performance Data Compression , 1984, Computer.