A new wavelet thresholding method for speech enhancement based on symmetric Kullback-Leibler divergence

Performance of wavelet thresholding methods for speech enhancement is dependent on estimating an exact threshold value in the wavelet sub-bands. In this paper, we propose a new method for more exact estimating the threshold value. We proposed to determine the threshold value based on the symmetric Kullback-Leibler divergence between the probability distributions of noisy speech and noise wavelet coefficients. In the next step, we improved this value using segmental SNR. We used some of TIMIT utterances to assess the performance of the proposed threshold. The algorithm is evaluated using the PESQ score and the SNR improvement. In average, we obtain 2db SNR improvement and a PESQ score increase up to 0.7 in comparison to the conventional wavelet thresholding approaches.

[1]  S. Tabibian,et al.  Noise reduction from speech signal based on wavelet transform and Kullback-Leibler divergence , 2008, 2008 International Symposium on Telecommunications.

[2]  Jesper Jensen,et al.  Speech enhancement based on Rayleigh mixture modeling of speech spectral amplitude distributions , 2007, 2007 15th European Signal Processing Conference.

[3]  Inder Jeet Taneja,et al.  On Generalized Information Measures and Their Applications , 1989 .

[4]  Javier Ramírez,et al.  A new Kullback-Leibler VAD for speech recognition in noise , 2004, IEEE Signal Processing Letters.

[5]  Bin Chen,et al.  A Laplacian-based MMSE estimator for speech enhancement , 2007, Speech Commun..

[6]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[7]  Yi Hu,et al.  Speech enhancement based on wavelet thresholding the multitaper spectrum , 2004, IEEE Transactions on Speech and Audio Processing.

[8]  Maarten Jansen,et al.  Noise Reduction by Wavelet Thresholding , 2001 .

[9]  L. Goddard Information Theory , 1962, Nature.

[10]  Joon-Hyuk Chang Complex laplacian probability density function for noisy speech enhancement , 2007, IEICE Electron. Express.

[11]  S. Ayat,et al.  Wavelet based speech enhancement using a new thresholding algorithm , 2004, Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, 2004..

[12]  M. Gabrea,et al.  Wavelet based speech enhancement using two different threshold-based denoising algorithms , 2004, Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513).

[13]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[14]  Sung-Il Yang,et al.  Speech enhancement using adaptive wavelet shrinkage , 2001, ISIE 2001. 2001 IEEE International Symposium on Industrial Electronics Proceedings (Cat. No.01TH8570).

[15]  Martin Vetterli,et al.  Adaptive wavelet thresholding for image denoising and compression , 2000, IEEE Trans. Image Process..

[16]  John S. Baras,et al.  Optimal wavelet basis selection for signal representation , 1994, Defense, Security, and Sensing.