Perceptual Speech Enhancement Using Hilbert Transform

A new speech enhancement algorithm using a Hubert transform (HT) based time-frequency (TF) representation of speech signal with respect to human perception is proposed. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing critical bands (CB) where the envelope and phase components of the analytic signals are used. For the purpose of enhancement the envelope in each CB is modified, based on the conventional spectral subtraction method, using a time varying gain function which takes into account the threshold of hearing. This threshold is calculated on the basis of masking effects of all bands using a perception model. Signal is reconstructed from the modified envelopes and the original phases of noisy signal in critical bands. Experimental results show that using the threshold of hearing in which temporal masking is included can effectively eliminate the musical noise without a significant decrease in intelligibility

[1]  Dennis Gabor,et al.  Theory of communication , 1946 .

[2]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[3]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[4]  M. Savoji,et al.  Adaptive Wavelet Coding of Audio and High Quality Speech at 32 Kb / s Using PsychoAcoustic Noise Masking Effects , 2004 .

[5]  M.R. Schroeder,et al.  Models of hearing , 1975, Proceedings of the IEEE.

[6]  Logan Volkers,et al.  PHASE VOCODER , 2008 .

[7]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[8]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[9]  Min-Seok Choi,et al.  An improved estimation of a priori speech absence probability for speech enhancement: in perspective of speech perception , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[10]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[11]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[12]  John Mourjopoulos,et al.  Speech enhancement based on audible noise suppression , 1997, IEEE Trans. Speech Audio Process..

[13]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[14]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[15]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[16]  Boualem Boashash,et al.  Estimating and interpreting the instantaneous frequency of a signal. II. A/lgorithms and applications , 1992, Proc. IEEE.