Noisy speech enhancement based on an adaptive threshold and a modified hard thresholding function in wavelet packet domain

This paper proposes a speech enhancement approach, which statistically determines an adaptive threshold using the Teager energy operated WP coefficients of noisy speech. The obtained threshold is employed upon the WP coefficients of the noisy speech by employing a modified hard thresholding function. Extensive simulations in the presence of different noises indicate that this new method is very effective for both white noise and color noise reduction from speech, resulting in enhanced speech with better speech quality. Several standard objective measures and subjective observations show that the proposed method outperforms recent state-of-the-art thresholding based approaches from high to low level SNRs.

[1]  Michael T. Johnson,et al.  Speech signal enhancement through adaptive wavelet thresholding , 2007, Speech Commun..

[2]  Philipos C. Loizou,et al.  Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum , 2005, IEEE Transactions on Speech and Audio Processing.

[3]  Eric Plourde,et al.  Bayesian short-time spectral amplitude estimators for single-channel speech enhancement , 2009 .

[4]  Yang Lu,et al.  Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[6]  Chip-Hong Chang,et al.  A Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[7]  John H. L. Hansen,et al.  Speech Enhancement Based on Generalized Minimum Mean Square Error Estimators and Masking Properties of the Auditory System , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Philipos C. Loizou,et al.  A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Sofia Ben Jebara A Perceptual Approach to Reduce Musical Noise Phenomenon with Wiener Denoising Technique , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[10]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[11]  Bin Chen,et al.  A Laplacian-based MMSE estimator for speech enhancement , 2007, Speech Commun..

[12]  Herman J. M. Steeneken,et al.  Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems , 1993, Speech Commun..

[13]  K. Yamashita,et al.  Nonstationary noise estimation using low-frequency regions for spectral subtraction , 2005, IEEE Signal Processing Letters.

[14]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[15]  Athanasios Papoulis,et al.  Probability, Random Variables and Stochastic Processes , 1965 .

[16]  Jesper Jensen,et al.  Minimum Mean-Square Error Estimation of Discrete Fourier Coefficients With Generalized Gamma Priors , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Younghun Kwon,et al.  Speech enhancement for non-stationary noise environment by adaptive wavelet packet , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Ben P. Milner,et al.  Visually Derived Wiener Filters for Speech Enhancement , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Jean Rouat,et al.  A new approach for wavelet speech enhancement , 2001, INTERSPEECH.

[20]  Wei Zhang,et al.  Speech enhancement employing Laplacian-Gaussian mixture , 2005, IEEE Transactions on Speech and Audio Processing.

[21]  Jean Rouat,et al.  Wavelet speech enhancement based on time-scale adaptation , 2006, Speech Commun..

[22]  Hamid Sheikhzadeh,et al.  HMM-based strategies for enhancement of speech signals embedded in nonstationary noise , 1998, IEEE Trans. Speech Audio Process..

[23]  Hamid Sheikhzadeh,et al.  An improved wavelet-based speech enhancement system , 2001, INTERSPEECH.

[24]  Yasser Ghanbari,et al.  A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets , 2006, Speech Commun..

[25]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[26]  Yi Hu,et al.  Subjective comparison and evaluation of speech enhancement algorithms , 2007, Speech Commun..

[27]  Yi Hu,et al.  Speech enhancement based on wavelet thresholding the multitaper spectrum , 2004, IEEE Transactions on Speech and Audio Processing.

[28]  Ahmad Akbari,et al.  A new wavelet thresholding method for speech enhancement based on symmetric Kullback-Leibler divergence , 2009, 2009 14th International CSI Computer Conference.

[29]  Bruno O. Shubert,et al.  Random variables and stochastic processes , 1979 .

[30]  Benoît Champagne,et al.  Incorporating the human hearing properties in the signal subspace approach for speech enhancement , 2003, IEEE Trans. Speech Audio Process..

[31]  Jhing-Fa Wang,et al.  Speech Enhancement Using Perceptual Wavelet Packet Decomposition and Teager Energy Operator , 2004, J. VLSI Signal Process..

[32]  Martin Vetterli,et al.  Adaptive wavelet thresholding for image denoising and compression , 2000, IEEE Trans. Image Process..

[33]  Sven Nordholm,et al.  Spectral subtraction using reduced delay convolution and adaptive averaging , 2001, IEEE Trans. Speech Audio Process..

[34]  James F. Kaiser,et al.  Some useful properties of Teager's energy operators , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[35]  J. Rouat,et al.  Wavelet speech enhancement based on the Teager energy operator , 2001, IEEE Signal Processing Letters.

[36]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..