A New Initialization Method for Frequency-Domain Blind Source Separation Algorithms

Frequency-domain blind source separation (BSS) techniques have been proposed with the objective of increasing the speed and/or reducing the computational complexity of conventional algorithms, mainly for applications that involve large convolutive mixture filters. In this letter, we present a new initialization method for frequency-domain BSS algorithms employing estimates of the directions of arrival and time-frequency masking, that outperforms the classical pre-whitening initialization technique. Performance and convergence time results of the proposed approach are presented using a frequency-domain BSS algorithm that exploits higher-order frequency dependencies, employing real conference room recordings and simulated data.

[1]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[2]  N. Hamada,et al.  Time-Frequency Masking Method Using Wavelet Transform for BSS Problem , 2006, TENCON 2006 - 2006 IEEE Region 10 Conference.

[3]  Hiroshi Sawada,et al.  Blind extraction of a dominant source from mixtures of many sources using ICA and time-frequency masking , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[4]  Hiroshi Sawada,et al.  Underdetermined sparse source separation of convolutive mixtures with observation vector clustering , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[5]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[6]  Hiroshi Sawada,et al.  Underdetermined blind separation for speech in real environments with sparseness and ICA , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Simon Haykin,et al.  Adaptive Filter Theory 4th Edition , 2002 .

[8]  Te-Won Lee,et al.  Blind Source Separation Exploiting Higher-Order Frequency Dependencies , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Hiroshi Sawada,et al.  Reducing musical noise by a fine-shift overlap-add method applied to source separation using a time-frequency mask , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[10]  K. Matsuoka,et al.  Minimal distortion principle for blind source separation , 2002, Proceedings of the 41st SICE Annual Conference. SICE 2002..

[11]  Zengfeng Wang Comparison of Four Kinds of Fuzzy C-Means Clustering Methods , 2010, 2010 Third International Symposium on Information Processing.

[12]  V. G. Reju,et al.  A Robust Correlation Method for Solving Permutation Problem in Frequency Domain Blind Source Separation of Speech Signals , 2006, APCCAS 2006 - 2006 IEEE Asia Pacific Conference on Circuits and Systems.