Noise Estimation Using Mean Square Cross Prediction Error for Speech Enhancement

This paper shows the feasibility of noise extraction from noisy speech and presents a two-stage approach for speech enhancement. The preproposed mean square cross prediction error (MSCPE) based blind source extraction algorithm is utilized to extract the additive noise from the noisy speech signal in the first stage. After that a modified spectral subtraction and a modified Wiener filter approach are proposed to extract the speech signal for speech enhancement in the second stage, where all the frequency spectra of the extracted noise are utilized. Theoretical justification shows that the MSCPE-based algorithm can extract desired signal from mixed sources. Experimental results show that the averaged correlation coefficient between the extracted noise and the original additive noise are beyond 85% for Gaussian noise and beyond 75% for real-world noise at SNR = 0 dB, and the proposed speech enhancement approaches perform better than conventional methods, such as spectral subtraction and Wiener filter.

[1]  A.V. Oppenheim,et al.  Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[2]  Gang Wang,et al.  Extraction of Desired Signal Based on AR Model with Its Application to Atrial Activity Estimation in Atrial Fibrillation , 2008, EURASIP J. Adv. Signal Process..

[3]  俊一 甘利,et al.  A. Hyvärinen, J. Karhunen and E. Oja, Independent Component Analysis, Jhon Wiley & Sons, 2001年,504ページ. (根本幾・川勝真喜訳:独立成分分析——信号解析の新しい世界,東京電機大学出版局,2005年,532ページ.) , 2010 .

[4]  Y. Ephraim,et al.  Extension of the signal subspace speech enhancement approach to colored noise , 2003, IEEE Signal Processing Letters.

[5]  Y. Ephraim,et al.  A Brief Survey of Speech Enhancement , 2003 .

[6]  Andrzej Cichocki,et al.  Adaptive blind signal and image processing , 2002 .

[7]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[8]  Andrzej Cichocki,et al.  Blind source extraction based on a linear predictor , 2007 .

[9]  Yi Hu,et al.  Subjective comparison and evaluation of speech enhancement algorithms , 2007, Speech Commun..

[10]  Chuang He,et al.  Adaptive two-band spectral subtraction with multi-window spectral estimation , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[11]  Wei Liu,et al.  Blind Second-Order Source Extraction of Instantaneous Noisy Mixtures , 2006, IEEE Transactions on Circuits and Systems II: Express Briefs.

[12]  Allan Kardec Barros,et al.  Extraction of Specific Signals with Temporal Structure , 2001, Neural Computation.

[13]  Andrzej Cichocki,et al.  Adaptive Blind Signal and Image Processing - Learning Algorithms and Applications , 2002 .

[14]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[15]  Wonyong Sung,et al.  A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.

[16]  Saeed Gazor,et al.  An adaptive KLT approach for speech enhancement , 2001, IEEE Trans. Speech Audio Process..

[17]  Shun-ichi Amari,et al.  Sequential blind signal extraction in order specified by stochastic properties , 1997 .

[18]  Jong-Hwan Lee,et al.  Speech enhancement with MAP estimation and ICA-based speech features , 2000 .

[19]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Philipos C. Loizou,et al.  A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Ju Liu,et al.  Speech Signal Enhancement Based on MAP Algorithm in the ICA Space , 2008, IEEE Transactions on Signal Processing.

[22]  Rainer Martin,et al.  MMSE estimation of magnitude-squared DFT coefficients with superGaussian priors , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[23]  Y. Ephraim,et al.  A Brief Survey of Speech Enhancement 1 , 2018, Microelectronics.

[24]  Rainer Martin,et al.  Speech enhancement based on minimum mean-square error estimation and supergaussian priors , 2005, IEEE Transactions on Speech and Audio Processing.

[25]  Aapo Hyvärinen,et al.  Sparse Code Shrinkage: Denoising of Nongaussian Data by Maximum Likelihood Estimation , 1999, Neural Computation.

[26]  LotterThomas,et al.  Speech enhancement by map spectral amplitude estimation using a super-Gaussian speech model , 2005 .

[27]  Julien Pinquier,et al.  A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002 .

[28]  Soo Ngee Koh,et al.  Low distortion speech enhancement , 2000 .

[29]  Kyawt Na Thar Min. Speech enhancement employing Laplacian-Gaussian mixture. , 2011 .