Spectral Analysis of Audio Signals with Noise Assisted Empirical Mode Decomposition

A data adaptive approach to spectral analysis of audio signals is implemented in this paper. The audio signals are non-stationary as well as non-linear in nature and the traditional Fourier based spectral representation is not effective. The Hilbert spectral analysis implemented by noise assisted bivariate empirical mode decomposition (NABEMD) is introduced here as an efficient spectral representation scheme of audio signals. In BEMD, the fractional Gaussian noise (fGn) and analyzing speech signal are used as two separate variables. Both signals are decomposed together yielding a finite set of intrinsic mode functions (IMFs) for individual variables (signals). The use of fGn implements BEMD with dyadic filterbank characteristics. The instantaneous frequencies of individual IMFs are computed by applying Hilbert transform and then the timefrequency representation is achieved by arranging the energy with respect to time and frequency simultaneously. Such representation is called Hilbert spectrum (HS) which is analogous to spectrogram. The marginal HS derived from HS corresponds the total energy at each frequency component. The experimental results show that the Hilbert spectral analysis provides better representation of audio signal contents compared to the Fourier based approach.

[1]  N. Huang,et al.  A new view of nonlinear water waves: the Hilbert spectrum , 1999 .

[2]  Sylvain Marchand,et al.  Informed spectral analysis for isolated audio source parameters estimation , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[3]  Hong Qian,et al.  Fractional Brownian Motion and Fractional Gaussian Noise , 2003 .

[4]  Toshihisa Tanaka,et al.  Complex Empirical Mode Decomposition , 2007, IEEE Signal Processing Letters.

[5]  N. Huang,et al.  A New Spectral Representation of Earthquake Data: Hilbert Spectral Analysis of Station TCU129, Chi-Chi, Taiwan, 21 September 1999 , 2004 .

[6]  Wolfgang Effelsberg,et al.  Automatic audio content analysis , 1997, MULTIMEDIA '96.

[7]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[8]  Richard G. Baraniuk,et al.  Empirical Mode Decomposition Based Frequency Attributes , 1999 .

[9]  Norden E. Huang,et al.  A review on Hilbert‐Huang transform: Method and its applications to geophysical studies , 2008 .

[10]  Jieun Kim,et al.  Speech/Audio Signal Classification Using Spectral Flux Pattern Recognition , 2012, 2012 IEEE Workshop on Signal Processing Systems.

[11]  Muhammad Altaf,et al.  Rotation Invariant Complex Empirical Mode Decomposition , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[12]  Danilo P. Mandic,et al.  Emd via mEMD: multivariate noise-Aided Computation of Standard EMD , 2013, Adv. Data Sci. Adapt. Anal..

[13]  N. Huang,et al.  A study of the characteristics of white noise using the empirical mode decomposition method , 2004, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[14]  Gabriel Rilling,et al.  Empirical mode decomposition as a filter bank , 2004, IEEE Signal Processing Letters.

[15]  Demetris Koutsoyiannis The Hurst phenomenon and fractional Gaussian noise made easy , 2002 .

[16]  S. S. Shen,et al.  A confidence limit for the empirical mode decomposition and Hilbert spectral analysis , 2003, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[17]  Jürgen Herre,et al.  Robust matching of audio signals using spectral flatness features , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[18]  Philippe-Aubert Gauthier,et al.  Spectral and Spatial Multichannel Analysis/Synthesis of Interior Aircraft Sounds , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[20]  Gabriel Rilling,et al.  Bivariate Empirical Mode Decomposition , 2007, IEEE Signal Processing Letters.

[21]  Norden E. Huang,et al.  Ensemble Empirical Mode Decomposition: a Noise-Assisted Data Analysis Method , 2009, Adv. Data Sci. Adapt. Anal..

[22]  Keikichi Hirose,et al.  Separation of Mixed Audio Signals by Decomposing Hilbert Spectrum with Modified EMD , 2006, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..