Adaptive Speech Enhancement Based on Classification of Voiced/Unvoiced Signal

At low SNR's, the conventional wavelet enhancement algorithm may lose some useful ingredients of the speech, resulting in a low enhancement performance. To solve the problem, this paper presents an improved wavelet enhancement algorithm based on the classification of the voiced/unvoiced signal. First, we eliminate part of the noise through spectral subtraction algorithm and separate voiced signal from unvoiced signal according to its short-time energy. Second, the Wavelet Packet Transform (WPT) is made on unvoiced speech to prevent signal distortion. Then dynamic thresholds are applied to different wavelet analytical scales to avoid a much smoother signal waveform. Finally, a new adaptive threshold function is used to make up the disadvantages of soft and hard thresholding algorithm. Experiments show that the proposed method can remove much noise while keeping intelligibility of the reconstructed speech.

[1]  Hugo Van hamme,et al.  A Review of Signal Subspace Speech Enhancement and Its Application to Noise Robust Speech Recognition , 2007, EURASIP J. Adv. Signal Process..

[2]  M.A. Akhaee,et al.  Speech Enhancement by Adaptive Noise Cancellation in the Wavelet Domain , 2005, 2005 5th International Conference on Information Communications & Signal Processing.

[3]  Chip-Hong Chang,et al.  A versatile speech enhancement system based on perceptual wavelet denoising , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[4]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[5]  Hon Keung Kwan,et al.  Adaptive wavelet denoising system for speech enhancement , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[6]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[7]  S. Ayat,et al.  Wavelet based speech enhancement using a new thresholding algorithm , 2004, Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, 2004..

[8]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[9]  Tai-Chiu Hsung,et al.  Speech enhancement based on adaptive wavelet denoising on multitaper spectrum , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[10]  Lutfa Akter,et al.  Crosscorrelation Compensated Wiener Filter for Speech Enhancement , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.