Identifying Single Source Data for Mixing Matrix Estimation in Instantaneous Blind Source Separation

This paper presents a simple yet effective way of improving the estimate of the mixing matrix, in instantaneous blind source separation, by using only reliable data. The paper describes how the idea of detecting single source data is implemented by selecting only the data which remain for two consecutive frames in the same spatial signature. Such data, which are most likely to belong to a single source, are then used to accurately identify the spatial directions of the sources and, hence, the mixing matrix. The paper also presents a refined histogram procedure which improves on the potential function method to estimate the mixing matrix, in the two dimensional case (two sensors). The approach was experimentally evaluated and submitted to the first Stereo Audio Source Separation Evaluation Campaign (SASSEC), with good results in matrix estimation both for development and test data.

[1]  Aapo Hyvärinen,et al.  Learning Natural Image Structure with a Horizontal Product Model , 2009, ICA.

[2]  Emmanuel Vincent,et al.  First Stereo Audio Source Separation Evaluation Campaign: Data, Algorithms and Results , 2007, ICA.

[3]  Terrence J. Sejnowski,et al.  Learning Overcomplete Representations , 2000, Neural Computation.

[4]  Barak A. Pearlmutter,et al.  Blind Source Separation via Multinode Sparse Representation , 2001, NIPS.

[5]  Stephen P. Boyd,et al.  Applications of second-order cone programming , 1998 .

[6]  Shoko Araki,et al.  Underdetermined Blind Separation of Convolutive Mixtures of Speech Using Time-Frequency Mask and Mixing Matrix Estimation , 2005, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[7]  Te-Won Lee,et al.  Blind Speech Separation , 2007, Blind Speech Separation.

[8]  Terrence J. Sejnowski,et al.  Learning Nonlinear Overcomplete Representations for Efficient Coding , 1997, NIPS.

[9]  Christian Jutten,et al.  Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture , 1991, Signal Process..

[10]  Pau Bofill,et al.  Underdetermined blind separation of delayed sound sources in the frequency domain , 2003, Neurocomputing.

[11]  Barak A. Pearlmutter,et al.  Survey of sparse and non‐sparse methods in source separation , 2005, Int. J. Imaging Syst. Technol..

[12]  H. Sawada,et al.  On real and complex valued /spl lscr//sub 1/-norm minimization for overcomplete blind source separation , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[13]  Michael Zibulevsky,et al.  Underdetermined blind source separation using sparse representations , 2001, Signal Process..