Joint Soft Threshold and Statistical Estimation for Speech Enhancement

This paper presents a novel method for speech enhancement based on the combination of sigmoid shrinkage and bayesian estimator. The main idea is to apply a joint detection and estimation to noisy speech before using a standard minimum-mean-squared-error (MMSE) estimator. Hence, the proposed method can take advantage of two basic approaches for improving the quality of noisy speech. Experiments performed on stationary and non-stationary noisy speech signals show that the proposed approach is promising when compared to classical methods, in terms of objective and pseudo-subjective measurements.

[1]  Jesper Jensen,et al.  DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement , 2013, DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement.

[2]  DeLiang Wang,et al.  Speech intelligibility in background noise with ideal binary time-frequency masking. , 2009, The Journal of the Acoustical Society of America.

[3]  Abdourrahmane M. Atto,et al.  Detection threshold for non-parametric estimation , 2008, Signal Image Video Process..

[4]  Yang Lu,et al.  Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Jesper Jensen,et al.  An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Jesper Jensen,et al.  Spectral Magnitude Minimum Mean-Square Error Estimation Using Binary and Continuous Gain Functions , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Eric Plourde,et al.  Generalized Bayesian Estimators of the Spectral Amplitude for Speech Enhancement , 2009, IEEE Signal Processing Letters.

[8]  Philipos C. Loizou,et al.  Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum , 2005, IEEE Transactions on Speech and Audio Processing.

[9]  Abdeldjalil Aïssa-El-Bey,et al.  Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[10]  Dominique Pastor,et al.  Random Distortion Testing and Optimality of Thresholding Tests , 2013, IEEE Transactions on Signal Processing.

[11]  Aliakbar Tadaion,et al.  Joint Detection and Estimation of Speech Spectral Amplitude Using Noncontinuous Gain Functions , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[12]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[13]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[14]  Israel Cohen,et al.  Simultaneous Detection and Estimation Approach for Speech Enhancement , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Rosângela Coelho,et al.  Speech Enhancement with Nonstationary Acoustic Noise Detection in Time Domain , 2016, IEEE Signal Processing Letters.

[16]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[17]  Susanto Rahardja,et al.  /spl beta/-order MMSE spectral amplitude estimation for speech enhancement , 2005, IEEE Transactions on Speech and Audio Processing.

[18]  Philipos C. Loizou,et al.  Improving Speech Intelligibility in Noise Using a Binary Mask That Is Based on Magnitude Spectrum Constraints , 2010, IEEE Signal Processing Letters.

[19]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[20]  Philipos C. Loizou,et al.  Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Abdourrahmane M. Atto,et al.  Smooth sigmoid wavelet shrinkage for non-parametric estimation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[22]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[23]  I. Johnstone,et al.  Ideal spatial adaptation by wavelet shrinkage , 1994 .