Speech enhancement by combining statistical estimators of speech and noise

This paper presents a novel speech enhancement algorithm that can substantially improve the signal-to-residual spectrum ratio by combining statistical estimators of the spectral magnitude of the speech and noise. The noise spectral magnitude estimator is derived from the speech magnitude estimator, by appropriately transforming the a priori and the a posteriori SNR values. By expressing the signal-to-residual spectrum ratio as a function of the estimator's gain function, we derive a hybrid strategy that can improve the signal-to-residual spectrum ratio when the a priori and the a posteriori SNR are detected to be lower than 0 dB. Experimental results showed that the signal-to-residual spectrum ratio as well as the PESQ scores can be improved substantially in stationary and quasi-stationary noise conditions with the proposed hybrid estimators. Informal listening tests revealed improved speech quality and no musical noise.

[1]  Simon J. Godsill,et al.  Efficient Alternatives to the Ephraim and Malah Suppression Rule for Audio Signal Enhancement , 2003, EURASIP J. Adv. Signal Process..

[2]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[3]  Yi Hu,et al.  Objective measures for predicting speech intelligibility in noisy conditions based on new band-importance functions. , 2009, The Journal of the Acoustical Society of America.

[4]  Rongshan Yu A low-complexity noise estimation algorithm based on smoothing of noise power estimation and estimation bias correction , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Ronald E. Crochiere,et al.  A study of complexity and quality of speech waveform coders , 1978, ICASSP.

[6]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[7]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[9]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[10]  Yi Hu,et al.  Subjective comparison and evaluation of speech enhancement algorithms , 2007, Speech Commun..

[11]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.