Blind Source Separation with Distributed Microphone Pairs Using Permutation Correction by Intra-Pair TDOA Clustering

In this paper, we present a novel framework of distributed microphone array for blind source separation (BSS), where stereo microphones or proximately-placed microphone pairs are distributed. Unlike distributing all microphones individually, the time difference of arrival (TDOA) in the paired channels can be robustly estimated without suffering spatial aliasing. Based on it, sound sources are separated by the frequency-domain independent component analysis (ICA) with the permutation correction by clustering the intra-pair TDOAs. The experimental results in real reverberant environment are also shown.

[1]  Biing-Hwang Juang,et al.  Blind Source Separation of Acoustic Mixtures with Distributed Microphones , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[2]  Andreas Ziehe,et al.  An approach to blind source separation based on temporal structure of speech signals , 2001, Neurocomputing.

[3]  Hiroshi Sawada,et al.  Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors , 2007, Signal Process..

[4]  Nobutaka Ito,et al.  Blind alignment of asynchronously recorded signals for distributed microphone array , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[5]  Hiroshi Sawada,et al.  Near-field frequency domain blind source separation for convolutive mixtures , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Zicheng Liu SOUND SOURCE SEPARATION WITH DISTRIBUTED MICROPHONE ARRAYS IN THE PRESENCE OF CLOCK SYNCHRONIZATION ERRORS , 2008 .

[7]  Francesco Nesta,et al.  Cooperative Wiener-ICA for source localization and Separation by distributed microphone arrays , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Te-Won Lee,et al.  Blind Source Separation Exploiting Higher-Order Frequency Dependencies , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Kazuya Takeda,et al.  Evaluation of blind signal separation method using directivity pattern under reverberant conditions , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[10]  Shiro Ikeda,et al.  A METHOD OF ICA IN TIME-FREQUENCY DOMAIN , 2003 .

[11]  Carlos Busso,et al.  Smart room: participant and speaker localization and identification , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[12]  Seungjin Choi,et al.  Independent Component Analysis , 2009, Handbook of Natural Computing.

[13]  Hiroshi Sawada,et al.  Geometrical Interpretation of the PCA Subspace Approach for Overdetermined Blind Source Separation , 2006, EURASIP J. Adv. Signal Process..

[14]  Kazuya Takeda,et al.  Encoding large array signals into a 3D sound field representation for selective listening point audio based on blind source separation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Shun-ichi Amari,et al.  Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.

[16]  Hiroshi Sawada,et al.  A robust approach to the permutation problem of frequency-domain blind source separation , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[17]  Jacek Dmochowski,et al.  Blind source separation in a distributed microphone meeting environment for improved teleconferencing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Biing-Hwang Juang,et al.  A study on recognizing distorted speech over local distributed transducer networks , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.