论文信息 - Subspace based speech enhancement using Gaussian mixture model

Subspace based speech enhancement using Gaussian mixture model

Traditional subspace based speech enhancement (SSE)methods use linear minimum mean square error (LMMSE) estimation that is optimal if the Karhunen Loeve transform (KLT) coefficients of speech and noise are Gaussian distributed. In this paper, we investigate the use of Gaussian mixture (GM) density for modeling the non-Gaussian statistics of the clean speech KLT coefficients. Using Gaussian mixture model (GMM), the optimum minimum mean square error (MMSE) estimator is found to be nonlinear and the traditional LMMSE estimator is shown to be a special case. Experimental results show that the proposed method provides better enhancement performance than the traditional subspace based methods.Index Terms: Subspace based speech enhancement, Gaussian mixture density, MMSE estimation.

Thippur V. Sreenivas | Saikat Chatterjee | Achintya Kundu

[1] S. Gazor,et al. Speech probability distribution , 2003, IEEE Signal Processing Letters.

[2] Søren Holdt Jensen,et al. Reduction of broad-band noise in speech by truncated QSVD , 1995, IEEE Trans. Speech Audio Process..

[3] H. Sorenson,et al. Recursive bayesian estimation using gaussian sums , 1971 .

[4] Jesper Jensen,et al. Improved Subspace-Based Single-Channel Speech Enhancement Using Generalized Super-Gaussian Priors , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[5] Michelle Effros,et al. Suboptimality of the Karhunen-Loeve transform for transform coding , 2003, IEEE Transactions on Information Theory.

[6] George Carayannis,et al. Speech enhancement from noise: A regenerative approach , 1991, Speech Commun..

[7] Y. Ephraim,et al. Extension of the signal subspace speech enhancement approach to colored noise , 2003, IEEE Signal Processing Letters.

[8] Lin-Shan Lee,et al. A Perceptually Constrained GSVD-Based Approach for Enhancing Speech Corrupted by Colored Noise , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[9] I. Cohen,et al. Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[10] Saeed Gazor,et al. An adaptive KLT approach for speech enhancement , 2001, IEEE Trans. Speech Audio Process..

[11] Philipos C. Loizou,et al. Speech Enhancement: Theory and Practice , 2007 .

[12] Yi Hu,et al. A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..

[13] Thippur V. Sreenivas,et al. GMM based Bayesian approach to speech enhancement in signal / transform domain , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14] Nam C. Phamdo,et al. Signal/noise KLT based approach for enhancing speech degraded by colored noise , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[15] Yariv Ephraim,et al. A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..