Speech enhancement with adaptive spectral estimators

Common statistical estimators for speech enhancement rely on several assumptions about statistical properties of speech and noise processes. In real applications, these assumptions may not be always satisfied due to the effects of a nonstationary environment. In this work, we propose new robust spectral estimators for speech enhancement by incorporation of calculation of rank-order statistics to existing speech enhancement estimators. The proposed estimators are better adapted to nonstationary characteristics of speech signals and noise processes in real environments. By means of computer simulations, we show that the proposed estimators outperform the known estimators in terms of objective criteria of quality.

[1]  Peter Vary,et al.  Speech Enhancement by MAP Spectral Amplitude Estimation Using a Super-Gaussian Speech Model , 2005, EURASIP J. Adv. Signal Process..

[2]  Tae-Sun Choi,et al.  Improved motion stereo matching based on a modified dynamic programming , 2001 .

[3]  Murray Eden,et al.  Fundamentals of Digital Optics , 1996 .

[4]  Vitaly Kober,et al.  Robust speech processing using local adaptive non-linear filtering , 2013, IET Signal Process..

[5]  Joon-Hyuk Chang,et al.  Spectral enhancement based on global soft decision , 2000, IEEE Signal Process. Lett..

[6]  Rémi Gribonval,et al.  Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[8]  Jesper Jensen,et al.  An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Guo-Hong Ding,et al.  Suppression of additive noise using a power spectral density MMSE estimator , 2004, IEEE Signal Processing Letters.

[10]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[11]  Rainer Martin,et al.  Speech enhancement based on minimum mean-square error estimation and supergaussian priors , 2005, IEEE Transactions on Speech and Audio Processing.

[12]  Vitaly Kober,et al.  Nonlinear filters with spatially-connected neighborhoods , 2001 .

[13]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[14]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .