UNDERDETERMINED SPARSE BLIND SOURCE SEPARATION WITH DELAYS

In this paper, we address the problem of under-determined blind source separation (BSS), mainly for speech signals, in an anechoic environment. Our approach is based on exploiting the sparsity of Gabor expansions of speech signals. For parameter estimation, we adopt the clustering approach of DUET [19]. However, unlike in the case of DUET where only two mixtures are used, we use all available mixtures to get more precise estimates. For source extraction, we propose two methods, both of which are based on constrained optimization. Our first method uses a constrained l (0 < q ≤ 1) approach, and our second method uses a constrained “modified” l minimization approach. In both cases, our algorithms use all available mixtures, and are suited to the anechoic mixing scenario. Experiments indicate that the performances of the proposed algorithms are superior compared to DUET in many different settings.

[1]  V. Rich Personal communication , 1989, Nature.

[2]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[3]  Terrence J. Sejnowski,et al.  Blind source separation of more sources than mixtures using overcomplete representations , 1999, IEEE Signal Processing Letters.

[4]  Terrence J. Sejnowski,et al.  Learning Overcomplete Representations , 2000, Neural Computation.

[5]  D. Donoho Sparse Components of Images and Optimal Atomic Decompositions , 2001 .

[6]  Fabian J Theis,et al.  Formalization of the Two-Step Approach to Overcomplete BSS , 2002 .

[7]  R. Gribonval,et al.  Proposals for Performance Measurement in Source Separation , 2003 .

[8]  Michael Elad,et al.  Optimally sparse representation in general (nonorthogonal) dictionaries via ℓ1 minimization , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Pau Bofill,et al.  Underdetermined blind separation of delayed sound sources in the frequency domain , 2003, Neurocomputing.

[10]  Yuanqing Li,et al.  Sparse Representation and Its Applications in Blind Source Separation , 2003, NIPS.

[11]  S. Amari,et al.  SPARSE COMPONENT ANALYSIS FOR BLIND SOURCE SEPARATION WITH LESS SENSORS THAN SOURCES , 2003 .

[12]  Terrence J. Sejnowski,et al.  Complex Independent Component Analysis of Frequency-Domain Electroencephalographic Data , 2003, Neural Networks.

[13]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[14]  Dmitry M. Malioutov,et al.  A sparse signal reconstruction perspective for source localization with sensor arrays , 2005, IEEE Transactions on Signal Processing.

[15]  Barak A. Pearlmutter,et al.  Survey of sparse and non‐sparse methods in source separation , 2005, Int. J. Imaging Syst. Technol..

[16]  David L Donoho,et al.  Compressed sensing , 2006, IEEE Transactions on Information Theory.