Speech enhancement using modified magnitude and phase spectra

Degrading the quality and intelligibility of the speech signals, background noise is a severe problem in communication and other speech related systems. In order to get rid of this problem, it is important to enhance the noisy speech signal mainly through noise reduction. All most all the speech enhancement methods modify the frequency domain spectrum of the noise-corrupted speech to suppress the noise. Although both magnitude and phase spectra together contain the frequency domain information, the traditional speech enhancement procedures either work with magnitude or phase spectrum. This paper presents a speech enhancement method that exploits both magnitude and phase spectra. Experimental studies of the proposed method exhibits better PESQ (Perceptual Estimation of Speech Quality) score than that of other existing methods. We also found better speech quality with our proposed method in several subjective experiments.

[1]  Yi Hu,et al.  Subjective Comparison of Speech Enhancement Algorithms , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[2]  Jae S. Lim,et al.  The unimportance of phase in speech enhancement , 1982 .

[3]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[4]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[5]  Kuldip K. Paliwal,et al.  Noise driven short-time phase spectrum compensation procedure for speech enhancement , 2008, INTERSPEECH.

[6]  Kuldip K. Paliwal,et al.  The importance of phase in speech enhancement , 2011, Speech Commun..

[7]  Kuldip K. Paliwal,et al.  Exploiting Conjugate Symmetry of the Short-Time Fourier Spectrum for Speech Enhancement , 2008, IEEE Signal Processing Letters.

[8]  C. K. Yuen,et al.  Theory and Application of Digital Signal Processing , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[9]  Kuldip K. Paliwal,et al.  Usefulness of phase in speech processing , 2003 .

[10]  Norbert Wiener,et al.  Extrapolation, Interpolation, and Smoothing of Stationary Time Series, with Engineering Applications , 1949 .

[11]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[12]  Jae Lim,et al.  Signal estimation from modified short-time Fourier transform , 1984 .

[13]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..