A Novel Wavelet Packet Speech Enhancement Algorithm Based On Time-Frequency Threshold

Threshold choosing is critical in wavelet speech enhancement. In the paper, a novel wavelet packet speech enhancement algorithm is presented based on time-frequency (TF) threshold. Different from the conventional methods in threshold choosing, e.g. invariant threshold and time-variant threshold, the proposed threshold is modulated according to speech TF details other than rough envelops adopted in the recent algorithms based on eager energy operator (TEO) and adaptive noise estimation (ANE). In the new algorithm, the speech TF information is obtained from the frequency-based pre-estimate, and the threshold is modulated with TF characteristic of the pre-estimate. Then via thresholding the wavelet packet coefficients, the contaminated speech can be denoised adaptively. Compared with the former wavelet based algorithms, the proposed algorithm offers more pleasant enhanced speech with less distortion and residual noise in additive Gaussian noise case. Experimental results show its better performance in subjective test, input and output SNR test and modified bark spectral distortion measurement (MBSD).

[1]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[2]  Sheau-Fang Lei,et al.  Speech enhancement for nonstationary noises by wavelet packet transform and adaptive noise estimation , 2005, 2005 International Symposium on Intelligent Signal Processing and Communication Systems.

[3]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[4]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[5]  Wonho Yang,et al.  A modified bark spectral distortion measure which uses noise masking threshold , 1997, 1997 IEEE Workshop on Speech Coding for Telecommunications Proceedings. Back to Basics: Attacking Fundamental Problems in Speech Coding.

[6]  Jhing-Fa Wang,et al.  Speech Enhancement Using Perceptual Wavelet Packet Decomposition and Teager Energy Operator , 2004, J. VLSI Signal Process..

[7]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .