Blind Speech Separation in Presence of Correlated Noise with Generalized Eigenvector Beamforming

This paper considers the convolutive blind source separation of speech sources in the presence of spatially correlated noise. We introduce a method for estimating the scaled mixing matrix from the sources to the microphones even if coherent noise is present. This is achieved by combining time-frequency sparseness with the generalized eigenvalue decomposition of the power spectral density matrix (PSD) of the noisy speech and noise-only microphone signals. Separation is performed by spatial filtering with coefficients constructed by Gram-Schmidt orthogonalization which places spatial nulls at the interferer{^A}?`s direction. Experimental results show that our approach is capable of separating 2 sources in a reverberant environment (RT60=0ms..500ms) degraded by significant directional noise.

[1]  Reinhold Häb-Umbach,et al.  Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Walter Kellermann,et al.  A generalization of blind source separation algorithms for convolutive mixtures based on second-order statistics , 2005, IEEE Transactions on Speech and Audio Processing.

[3]  Hiroshi Sawada,et al.  Measuring Dependence of Bin-wise Separated Signals for Permutation Alignment in Frequency-domain BSS , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[4]  Hiroshi Sawada,et al.  A robust approach to the permutation problem of frequency-domain blind source separation , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[5]  S. Rickard,et al.  NON-SQUARE BLIND SOURCE SEPARATION UNDER COHERENT NOISE BY BEAMFORMING AND TIME-FREQUENCY MASKING , 2002 .

[6]  Juha Karhunen,et al.  Adaptive algorithms for estimating eigenvectors of correlation type matrices , 1984, ICASSP.

[7]  Hiroshi Sawada,et al.  Frequency-Domain Blind Source Separation , 2007, Blind Speech Separation.

[8]  Hiroshi Sawada,et al.  Underdetermined Blind Source Separation of Convolutive Mixtures by Hierarchical Clustering and L1-Norm Minimization , 2007, Blind Speech Separation.

[9]  Andrzej Cichocki,et al.  Adaptive blind signal and image processing , 2002 .

[10]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.