An Improved Speech Enhancement Method based on Teager Energy Operator and Perceptual Wavelet Packet Decomposition

According to the distribution characteristic of noise and clean speech signal in the frequency domain, a new speech enhancement method based on teager energy operator (TEO) and perceptual wavelet packet decomposition (PWPD) is proposed. Firstly, a modified Mask construction method is made to protect the acoustic cues at the low frequencies. Then a level-dependent parameter is introduced to further adjust the thresholds in light of the noise distribution feature. At last the sub-bands which have very little influence are set directly 0 to improve the signal-to-noise ratio (SNR) and reduce the computation load. Simulation results show that, under different kinds of noise environments, this new method not only enhances the signal-to-noise ratio (SNR) and perceptual evaluation of speech quality (PESQ), but also reduces the computation load, which is very advantageous for real-time realizing.

[1]  C. Turner,et al.  Combining acoustic and electrical hearing , 2003 .

[2]  J. F. Kaiser,et al.  On a simple algorithm to calculate the 'energy' of a signal , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[3]  Philipos C. Loizou,et al.  A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  I. Johnstone,et al.  Wavelet Threshold Estimators for Data with Correlated Noise , 1997 .

[5]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Bruce J Gantz,et al.  Combining acoustic and electrical hearing. , 2003, The Laryngoscope.

[7]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[8]  J. Rouat,et al.  Wavelet speech enhancement based on the Teager energy operator , 2001, IEEE Signal Processing Letters.

[9]  Bin Zhou,et al.  An improved wavelet-based speech enhancement method using adaptive block thresholding , 2010, WCSP.

[10]  Simona Halunga,et al.  Nonlinear spectral subtraction method for colored noise reduction using multi-band Bark scale , 2008, Signal Process..

[11]  Philipos C. Loizou,et al.  Speech enhancement using a frequency-specific composite Wiener function , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12]  B. Vidakovic,et al.  On time-dependent wavelet denoising , 1998, IEEE Trans. Signal Process..

[13]  Yi Hu,et al.  Speech enhancement based on wavelet thresholding the multitaper spectrum , 2004, IEEE Transactions on Speech and Audio Processing.

[14]  Yasser Ghanbari,et al.  A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets , 2006, Speech Commun..

[15]  Jhing-Fa Wang,et al.  Speech Enhancement Using Perceptual Wavelet Packet Decomposition and Teager Energy Operator , 2004, J. VLSI Signal Process..

[16]  Hamid Sheikhzadeh,et al.  An improved wavelet-based speech enhancement system , 2001, INTERSPEECH.

[17]  Istvan Pintér,et al.  Perceptual wavelet-representation of speech signals and its application to speech enhancement , 1996, Comput. Speech Lang..