论文信息 - An Improved Speech Enhancement Method based on Teager Energy Operator and Perceptual Wavelet Packet Decomposition

An Improved Speech Enhancement Method based on Teager Energy Operator and Perceptual Wavelet Packet Decomposition

According to the distribution characteristic of noise and clean speech signal in the frequency domain, a new speech enhancement method based on teager energy operator (TEO) and perceptual wavelet packet decomposition (PWPD) is proposed. Firstly, a modified Mask construction method is made to protect the acoustic cues at the low frequencies. Then a level-dependent parameter is introduced to further adjust the thresholds in light of the noise distribution feature. At last the sub-bands which have very little influence are set directly 0 to improve the signal-to-noise ratio (SNR) and reduce the computation load. Simulation results show that, under different kinds of noise environments, this new method not only enhances the signal-to-noise ratio (SNR) and perceptual evaluation of speech quality (PESQ), but also reduces the computation load, which is very advantageous for real-time realizing.

[1] C. Turner,et al. Combining acoustic and electrical hearing , 2003 .

[2] J. F. Kaiser,et al. On a simple algorithm to calculate the 'energy' of a signal , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[3] Philipos C. Loizou,et al. A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] I. Johnstone,et al. Wavelet Threshold Estimators for Data with Correlated Noise , 1997 .

[5] Yi Hu,et al. Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[6] Bruce J Gantz,et al. Combining acoustic and electrical hearing. , 2003, The Laryngoscope.

[7] David L. Donoho,et al. De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[8] J. Rouat,et al. Wavelet speech enhancement based on the Teager energy operator , 2001, IEEE Signal Processing Letters.

[9] Bin Zhou,et al. An improved wavelet-based speech enhancement method using adaptive block thresholding , 2010, WCSP.

[10] Simona Halunga,et al. Nonlinear spectral subtraction method for colored noise reduction using multi-band Bark scale , 2008, Signal Process..

[11] Philipos C. Loizou,et al. Speech enhancement using a frequency-specific composite Wiener function , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12] B. Vidakovic,et al. On time-dependent wavelet denoising , 1998, IEEE Trans. Signal Process..

[13] Yi Hu,et al. Speech enhancement based on wavelet thresholding the multitaper spectrum , 2004, IEEE Transactions on Speech and Audio Processing.

[14] Yasser Ghanbari,et al. A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets , 2006, Speech Commun..

[15] Jhing-Fa Wang,et al. Speech Enhancement Using Perceptual Wavelet Packet Decomposition and Teager Energy Operator , 2004, J. VLSI Signal Process..

[16] Hamid Sheikhzadeh,et al. An improved wavelet-based speech enhancement system , 2001, INTERSPEECH.

[17] Istvan Pintér,et al. Perceptual wavelet-representation of speech signals and its application to speech enhancement , 1996, Comput. Speech Lang..