Hilbert Spectrum in Time-Frequency Representation of Audio Signals Considering Disjoint Orthogonality

The performance of Hilbert spectrum (HS) in time-frequency representation (TFR) of audio signals is investigated in this paper. HS offers a fine-resolution TFR of time domain signals. It is derived by applying empirical mode decomposition (EMD), a newly developed data adaptive method for nonlinear and non-stationary signal analysis together with Hilbert transform. EMD represents any time domain signal as a sum of a finite number of bases called intrinsic mode functions (IMFs). The instantaneous frequency responses of the IMFs derived through Hilbert transform are arranged to obtain the TFR of the analyzing signal yielding the HS. The disjoint orthogonal property of audio signals is used as the decisive factor to measure the efficiency in TFR. Several audio signals are considered as disjoint orthogonal if not more than one source is active at any time-frequency cell. The performance of HS is compared with well known and widely used short-time Fourier transform technique for TFR. The experimental results show that HS based method performs better in time-frequency representation of the audio signals with the consideration of disjoint orthogonality.

[1]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[2]  Miguel Arjona Ramírez,et al.  Split-order linear prediction for segmentation and harmonic spectral modeling , 2006, IEEE Signal Processing Letters.

[3]  Gabriel Rilling,et al.  Empirical mode decomposition as a filter bank , 2004, IEEE Signal Processing Letters.

[4]  Guy J. Brown,et al.  Speech segregation based on sound localization , 2003 .

[5]  Alfred Mertins,et al.  Signal Analysis: Wavelets, Filter Banks, Time-Frequency Transforms and Applications , 1999 .

[6]  Keikichi Hirose,et al.  Separation of Mixed Audio Signals by Decomposing Hilbert Spectrum with Modified EMD , 2006, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[7]  N. Huang,et al.  The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis , 1998, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[8]  S. S. Shen,et al.  Applications of Hilbert–Huang transform to non‐stationary financial time series analysis , 2003 .

[9]  David G. Long Comments on Hilbert Transform Based Signal Analysis , 2004 .

[10]  N. Huang,et al.  A study of the characteristics of white noise using the empirical mode decomposition method , 2004, Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[11]  Richard G. Baraniuk,et al.  Empirical Mode Decomposition Based Frequency Attributes , 1999 .

[12]  Kyle Forinash,et al.  Time-frequency analysis with the continuous wavelet transform , 1998 .

[13]  C. Marchesi,et al.  Estimation of the power spectral density in nonstationary cardiovascular time series: assessing the role of the time-frequency representations (TFR) , 1996, IEEE Transactions on Biomedical Engineering.

[14]  Matthias Baeck,et al.  REAL-TIME IMPLEMENTATION OF A SOURCE SEPARATION ALGORITHM , 2003 .

[15]  Ronald L. Allen,et al.  Signal Analysis: Time, Frequency, Scale and Structure , 2003 .

[16]  Keikichi Hirose,et al.  Audio source separation from the mixture using empirical mode decomposition with independent subspace analysis , 2004, INTERSPEECH.

[17]  Keikichi Hirose,et al.  Multiband linear prediction of speech signals with adaptive order using empirical mode decomposition (Special issue on nonlinear circuits and signal processing) , 2007 .

[18]  Norden E. Huang,et al.  On Instantaneous Frequency , 2009, Adv. Data Sci. Adapt. Anal..

[19]  DeLiang Wang,et al.  Speech segregation based on sound localization , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[20]  Keikichi Hirose,et al.  Audio source separation by source localization with Hilbert spectrum , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[21]  Martin Cooke,et al.  Modelling auditory processing and organisation , 1993, Distinguished dissertations in computer science.