First Stereo Audio Source Separation Evaluation Campaign: Data, Algorithms and Results

This article provides an overview of the first stereo audio source separation evaluation campaign, organized by the authors. Fifteen underdetermined stereo source separation algorithms have been applied to various audio data, including instantaneous, convolutive and real mixtures of speech or music sources. The data and the algorithms are presented and the estimated source signals are compared to reference signals using several objective performance criteria.

[1]  Jean-François Cardoso,et al.  Multidimensional independent component analysis , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[2]  Paris Smaragdis,et al.  Evaluation of blind signal separation methods , 1999 .

[3]  Michael Zibulevsky,et al.  Underdetermined blind source separation using sparse representations , 2001, Signal Process..

[4]  Noboru Ohnishi,et al.  A survey of the performance indexes of ICA algorithms , 2002 .

[5]  Bruce C. Wheeler,et al.  Localization of nonstationary sources using a coherence test , 2003, IEEE Workshop on Statistical Signal Processing, 2003.

[6]  Douglas L. Jones,et al.  Performance of time- and frequency-domain binaural beamformers based on recorded signals from real rooms. , 2004, The Journal of the Acoustical Society of America.

[7]  Barak A. Pearlmutter,et al.  Soft-LOST: EM on a Mixture of Oriented Lines , 2004, ICA.

[8]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[9]  Dan Barry,et al.  Real-time Sound Source Separation: Azimuth Discrimination and Resynthesis , 2004 .

[10]  Ming Xiao,et al.  A Novel Approach for Underdetermined Blind Sources Separation in Frequency Domain , 2005, ISNN.

[11]  Zhang Yi,et al.  Advances in Neural Networks - ISNN 2005, Second International Symposium on Neural Networks, Chongqing, China, May 30 - June 1, 2005, Proceedings, Part II , 2005, ISNN.

[12]  Ming Xiao,et al.  A statistically sparse decomposition principle for underdetermined blind source separation , 2005, 2005 International Symposium on Intelligent Signal Processing and Communication Systems.

[13]  Enric Monte-Moreno,et al.  Underdetermined Convoluted Source Reconstruction Using LP and SOCP, and a Neural Approximator of the Optimizer , 2006, ICA.

[14]  Rémi Gribonval,et al.  Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Rémi Gribonval,et al.  A Robust Method to Count and Locate Audio Sources in a Stereophonic Linear Instantaneous Mixture , 2006, ICA.

[16]  Allan Kardec Barros,et al.  Independent Component Analysis and Blind Source Separation , 2007, Signal Processing.

[17]  Hiroshi Sawada,et al.  Blind Speech Separation in a Meeting Situation with Maximum SNR Beamformers , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[18]  Ahmed H. Tewfik,et al.  Two Improved Sparse Decomposition Methods for Blind Source Separation , 2007, ICA.

[19]  Emmanuel Vincent,et al.  Complex Nonconvex l p Norm Minimization for Underdetermined Source Separation , 2007, ICA.

[20]  Te-Won Lee,et al.  Blind Source Separation Exploiting Higher-Order Frequency Dependencies , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Nikolaos Mitianoudis,et al.  Underdetermined Source Separation Using Mixtures of Warped Laplacians , 2007, ICA.

[22]  Hiroshi Sawada,et al.  Measuring Dependence of Bin-wise Separated Signals for Permutation Alignment in Frequency-domain BSS , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[23]  Thomas Hofmann,et al.  An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments , 2007 .

[24]  Hiroshi Sawada,et al.  Blind Source Separation Based on a Beamformer Array and Time Frequency Binary Masking , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[25]  Shigeki Sagayama,et al.  Sparseness-Based 2CH BSS using the EM Algorithm in Reverberant Environment , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.