Speech enhancement using perceptual Wiener filter combined with unvoiced speech — A new Scheme

A new speech enhancement technique is proposed that reduces musical noise and improves intelligibility of speech. The proposed system has two stages. First stage consists of Wiener filter with self adaptive averaging factor to estimate apriori SNR. To reduce residual musical noise that remains in the enhanced speech, perceptual weighting filter is employed based on simultaneous and temporal masking effects of human auditory system is used in the second stage. An unvoiced speech enhancement algorithm is also integrated in the scheme to improve the intelligibility of speech. Through rigorous objective and subjective evaluations, it is observed that the proposed speech enhancement scheme is capable of reducing noise with little speech distortion in adverse noise environments and the overall performance is superior to several other methods available in literature.

[1]  Nathalie Virag Speech enhancement based on masking properties of the auditory system , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[2]  Soo Ngee Koh,et al.  Low distortion speech enhancement , 2000 .

[3]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[4]  Joseph Sylvester Chang,et al.  A parametric formulation of the generalized spectral subtraction method , 1998, IEEE Trans. Speech Audio Process..

[5]  Olivier Cappé,et al.  Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor , 1994, IEEE Trans. Speech Audio Process..

[6]  Ahmed Tamtaoui,et al.  Perceptual improvement of Wiener filtering , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  John H. L. Hansen,et al.  An effective quality evaluation protocol for speech enhancement algorithms , 1998, ICSLP.

[8]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[9]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[10]  Steven F. Boll,et al.  Optimal estimators for spectral restoration of noisy speech , 1984, ICASSP.

[11]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[12]  A.V. Oppenheim,et al.  Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[13]  Yi Hu,et al.  Incorporating a psychoacoustical model in frequency domain speech enhancement , 2004, IEEE Signal Processing Letters.

[14]  Christophe Beaugeant,et al.  Noise reduction using perceptual spectral change , 1999, EUROSPEECH.

[15]  I. Cohen Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator , 2002, IEEE Signal Processing Letters.

[16]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[17]  Ying Zheng,et al.  Analysis of signal de-noising method based on an improved wavelet thresholding , 2009, 2009 9th International Conference on Electronic Measurement & Instruments.