论文信息 - Joint Soft Threshold and Statistical Estimation for Speech Enhancement

Joint Soft Threshold and Statistical Estimation for Speech Enhancement

This paper presents a novel method for speech enhancement based on the combination of sigmoid shrinkage and bayesian estimator. The main idea is to apply a joint detection and estimation to noisy speech before using a standard minimum-mean-squared-error (MMSE) estimator. Hence, the proposed method can take advantage of two basic approaches for improving the quality of noisy speech. Experiments performed on stationary and non-stationary noisy speech signals show that the proposed approach is promising when compared to classical methods, in terms of objective and pseudo-subjective measurements.

[1] Jesper Jensen,et al. DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement , 2013, DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement.

[2] DeLiang Wang,et al. Speech intelligibility in background noise with ideal binary time-frequency masking. , 2009, The Journal of the Acoustical Society of America.

[3] Abdourrahmane M. Atto,et al. Detection threshold for non-parametric estimation , 2008, Signal Image Video Process..

[4] Yang Lu,et al. Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[5] Jesper Jensen,et al. An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[6] Jesper Jensen,et al. Spectral Magnitude Minimum Mean-Square Error Estimation Using Binary and Continuous Gain Functions , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[7] Eric Plourde,et al. Generalized Bayesian Estimators of the Spectral Amplitude for Speech Enhancement , 2009, IEEE Signal Processing Letters.

[8] Philipos C. Loizou,et al. Speech enhancement based on perceptually motivated bayesian estimators of the magnitude spectrum , 2005, IEEE Transactions on Speech and Audio Processing.

[9] Abdeldjalil Aïssa-El-Bey,et al. Robust Estimation of Non-Stationary Noise Power Spectrum for Speech Enhancement , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[10] Dominique Pastor,et al. Random Distortion Testing and Optimality of Thresholding Tests , 2013, IEEE Transactions on Signal Processing.

[11] Aliakbar Tadaion,et al. Joint Detection and Estimation of Speech Spectral Amplitude Using Noncontinuous Gain Functions , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[12] Philipos C. Loizou,et al. Speech Enhancement: Theory and Practice , 2007 .

[13] Ephraim. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[14] Israel Cohen,et al. Simultaneous Detection and Estimation Approach for Speech Enhancement , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[15] Rosângela Coelho,et al. Speech Enhancement with Nonstationary Acoustic Noise Detection in Time Domain , 2016, IEEE Signal Processing Letters.

[16] David Malah,et al. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[17] Susanto Rahardja,et al. /spl beta/-order MMSE spectral amplitude estimation for speech enhancement , 2005, IEEE Transactions on Speech and Audio Processing.

[18] Philipos C. Loizou,et al. Improving Speech Intelligibility in Noise Using a Binary Mask That Is Based on Magnitude Spectrum Constraints , 2010, IEEE Signal Processing Letters.

[19] David L. Donoho,et al. De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[20] Philipos C. Loizou,et al. Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[21] Abdourrahmane M. Atto,et al. Smooth sigmoid wavelet shrinkage for non-parametric estimation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[22] Yi Hu,et al. Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[23] I. Johnstone,et al. Ideal spatial adaptation by wavelet shrinkage , 1994 .