Improved embedded pre-whitening subspace approach for enhancing speech contaminated by colored noise

Abstract An improved embedded pre-whitening subspace approach based on spectral domain constraints is proposed for enhancement of speech contaminated by colored noise. The main contribution of this work is to propose a particular non-unitary spectral transformation of the residual noise. This non-unitary transform is based on simultaneous diagonalization of the clean speech and noise covariance matrices. With this particular transformation, the optimization problem results from subspace approach based on spectral domain constraints can be solved without any restrictions on the form of contributed matrices. Under some theoretical assumptions the proposed method is reduced to the Hu and Loizou method, which is similar to the proposed method except for the gain matrix. Comparing with the Hu and Loizou method, speech enhancement measures for our approach show significant improvement in enhancing the TIMIT sentences corrupted by brown, pink and multi-talker babble noises. Also, comparisons with two non embedded pre-whitening subspace methods, PKLT and PKLT–VRE, show that the proposed method is comparable with these two methods.

[1]  Perceptual subspace speech enhancement using variance of the reconstruction error , 2014, Digit. Signal Process..

[2]  George Carayannis,et al.  Speech enhancement from noise: A regenerative approach , 1991, Speech Commun..

[3]  Nam C. Phamdo,et al.  Signal/noise KLT based approach for enhancing speech degraded by colored noise , 2000, IEEE Trans. Speech Audio Process..

[4]  Saeed Gazor,et al.  An adaptive KLT approach for speech enhancement , 2001, IEEE Trans. Speech Audio Process..

[5]  Ronald W. Schafer,et al.  Theory and Applications of Digital Speech Processing , 2010 .

[6]  O. Nelles,et al.  An Introduction to Optimization , 1996, IEEE Antennas and Propagation Magazine.

[7]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[8]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[9]  Richard H. Bartels,et al.  Algorithm 432 [C2]: Solution of the matrix equation AX + XB = C [F4] , 1972, Commun. ACM.

[10]  Yi Hu,et al.  A generalized subspace approach for enhancing speech corrupted by colored noise , 2003, IEEE Trans. Speech Audio Process..

[11]  Søren Holdt Jensen,et al.  Reduction of broad-band noise in speech by truncated QSVD , 1995, IEEE Trans. Speech Audio Process..

[12]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Y. Ephraim,et al.  Extension of the signal subspace speech enhancement approach to colored noise , 2003, IEEE Signal Processing Letters.

[14]  Benoît Champagne,et al.  Incorporating the human hearing properties in the signal subspace approach for speech enhancement , 2003, IEEE Trans. Speech Audio Process..

[15]  Susanto Rahardja,et al.  Audible Noise Reduction in Eigendomain for Speech Enhancement , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Zhongfu Ye,et al.  A Higher Order Subspace Algorithm for Multichannel Speech Enhancement , 2015, IEEE Signal Processing Letters.

[17]  Yi Hu,et al.  A perceptually motivated approach for speech enhancement , 2003, IEEE Trans. Speech Audio Process..

[18]  Kiyohiro Shikano,et al.  Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Edwin K. P. Chong,et al.  An Introduction to Optimization: Chong/An Introduction , 2008 .

[20]  P. Hansen,et al.  Prewhitening for rank-deficient noise in subspace methods for noise reduction , 2005, IEEE Transactions on Signal Processing.

[21]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .