Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures

We present a frequency-domain technique based on PARAllel FACtor (PARAFAC) analysis that performs multichannel blind source separation (BSS) of convolutive speech mixtures. PARAFAC algorithms are combined with a dimensionality reduction step to significantly reduce computational complexity. The identifiability potential of PARAFAC is exploited to derive a BSS algorithm for the under-determined case (more speakers than microphones), combining PARAFAC analysis with time-varying Capon beamforming. Finally, a low-complexity adaptive version of the BSS algorithm is proposed that can track changes in the mixing environment. Extensive experiments with realistic and measured data corroborate our claims, including the under-determined case. Signal-to-interference ratio improvements of up to 6 dB are shown compared to state-of-the-art BSS algorithms, at an order of magnitude lower computational complexity.

[1]  A. Gorokhov,et al.  Subspace-based techniques for blind separation of convolutive mixtures with temporally correlated sources , 1997 .

[2]  Mohamed Sahmoudi,et al.  Blind Separation of Convolutive Mixtures using Nonstationarity and Fractional Lower Order Statistics (FLOS): Application to Audio Signals , 2006, Fourth IEEE Workshop on Sensor Array and Multichannel Processing, 2006..

[3]  Eric Moulines,et al.  A blind source separation technique using second-order statistics , 1997, IEEE Trans. Signal Process..

[4]  Hiroshi Sawada,et al.  A robust and precise method for solving the permutation problem of frequency-domain blind source separation , 2004, IEEE Transactions on Speech and Audio Processing.

[5]  S. Araki,et al.  MLSP 2007 Data Analysis Competition: Frequency-Domain Blind Source Separation for Convolutive Mixtures of Speech/Audio Signals , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[6]  Pierre Comon,et al.  Enhanced Line Search: A Novel Method to Accelerate PARAFAC , 2008, SIAM J. Matrix Anal. Appl..

[7]  Birger Kollmeier,et al.  Amplitude Modulation Decorrelation For Convolutive Blind Source Separation , 2000 .

[8]  Lucas C. Parra,et al.  On-line Convolutive Blind Source Separation of Non-Stationary Signals , 2000, J. VLSI Signal Process..

[9]  V. Michael Bove,et al.  Blind Separation Of Real World Audio Signals Using Overdetermined Mixtures , 1999 .

[10]  Dinh-Tuan Pham,et al.  Blind separation of instantaneous mixtures of nonstationary sources , 2001, IEEE Trans. Signal Process..

[11]  A. Stegeman,et al.  On Kruskal's uniqueness condition for the Candecomp/Parafac decomposition , 2007 .

[12]  Rasmus Bro,et al.  A comparison of algorithms for fitting the PARAFAC model , 2006, Comput. Stat. Data Anal..

[13]  Christine Serviere,et al.  BLIND SEPARATION OF CONVOLUTIVE AUDIO MIXTURES USING NONSTATIONARITY , 2003 .

[14]  Nikos D. Sidiropoulos,et al.  On the effectiveness of PARAFAC-based estimation for blind speech separation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Shoko Araki,et al.  ON-LINE TIME-DOMAIN BLIND SOURCE SEPARATION OF NONSTATIONARY CONVOLVED SIGNALS , 2003 .

[16]  James P. Reilly,et al.  A frequency domain method for blind source separation of convolutive audio mixtures , 2005, IEEE Transactions on Speech and Audio Processing.

[17]  Nikos D. Sidiropoulos,et al.  Parallel factor analysis in sensor array processing , 2000, IEEE Trans. Signal Process..

[18]  Richard A. Harshman,et al.  Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis , 1970 .

[19]  J. Cardoso,et al.  Blind beamforming for non-gaussian signals , 1993 .

[20]  Pierre Comon,et al.  Blind identification and source separation in 2×3 under-determined mixtures , 2004, IEEE Trans. Signal Process..

[21]  Lucas C. Parra,et al.  Convolutive blind separation of non-stationary sources , 2000, IEEE Trans. Speech Audio Process..

[22]  Nikolaos Mitianoudis,et al.  Audio source separation of convolutive mixtures , 2003, IEEE Trans. Speech Audio Process..

[23]  L. Lathauwer,et al.  Dimensionality reduction in higher-order signal processing and rank-(R1,R2,…,RN) reduction in multilinear algebra , 2004 .

[24]  Nikos D. Sidiropoulos,et al.  Blind PARAFAC receivers for DS-CDMA systems , 2000, IEEE Trans. Signal Process..

[25]  L. Lathauwer,et al.  An enhanced plane search scheme for complex-valued tensor decompositions , 2010 .

[26]  Andreas Ziehe,et al.  An approach to blind source separation based on temporal structure of speech signals , 2001, Neurocomputing.

[27]  Hiroshi Sawada,et al.  Frequency-Domain Blind Source Separation of Many Speech Signals Using Near-Field and Far-Field Models , 2006, EURASIP J. Adv. Signal Process..

[28]  David E. Booth,et al.  Multi-Way Analysis: Applications in the Chemical Sciences , 2005, Technometrics.

[29]  Nikos D. Sidiropoulos,et al.  Adaptive Algorithms to Track the PARAFAC Decomposition of a Third-Order Tensor , 2009, IEEE Transactions on Signal Processing.

[30]  Lieven De Lathauwer,et al.  Tensor-based techniques for the blind separation of DS-CDMA signals , 2007, Signal Process..

[31]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[32]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[33]  Dinh-Tuan Pham,et al.  Blind separation of speech mixtures based on nonstationarity , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[34]  P. Kroonenberg Applied Multiway Data Analysis , 2008 .

[35]  Dinh-Tuan Pham,et al.  Permutation Correction in the Frequency Domain in Blind Separation of Speech Mixtures , 2006, EURASIP J. Adv. Signal Process..

[36]  J. Kruskal Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics , 1977 .

[37]  Lieven De Lathauwer,et al.  Blind Identification of Underdetermined Mixtures by Simultaneous Matrix Diagonalization , 2008, IEEE Transactions on Signal Processing.

[38]  Simon Haykin,et al.  Development of a flexible, realistic hearing in noise test environment (R-HINT-E) , 2004, Signal Process..

[39]  Arie Yeredor,et al.  Non-orthogonal joint diagonalization in the least-squares sense with application in blind source separation , 2002, IEEE Trans. Signal Process..

[40]  Arogyaswami Paulraj,et al.  An analytical constant modulus algorithm , 1996, IEEE Trans. Signal Process..

[41]  K. Matsuoka,et al.  Minimal distortion principle for blind source separation , 2002, Proceedings of the 41st SICE Annual Conference. SICE 2002..

[42]  L. Lathauwer,et al.  Sufficient conditions for uniqueness in Candecomp/Parafac and Indscal with random component matrices , 2006, Psychometrika.

[43]  Miguel A. Carreira-Perpinan,et al.  Dimensionality Reduction , 2011 .

[44]  Philippe Loubaton,et al.  Subspace based techniques for second order blind separation of convolutive mixtures with temporally correlated sources , 1997 .

[45]  R. Bro PARAFAC. Tutorial and applications , 1997 .

[46]  Rasmus Bro,et al.  Multi-way Analysis with Applications in the Chemical Sciences , 2004 .

[47]  Lucas C. Parra,et al.  A SURVEY OF CONVOLUTIVE BLIND SOURCE SEPARATION METHODS , 2007 .

[48]  Lieven De Lathauwer,et al.  A Link between the Canonical Decomposition in Multilinear Algebra and Simultaneous Matrix Diagonalization , 2006, SIAM J. Matrix Anal. Appl..

[49]  Lieven De Lathauwer,et al.  An enhanced line search scheme for complex-valued tensor decompositions. Application in DS-CDMA , 2008, Signal Process..

[50]  Michael Greenacre,et al.  Multiway data analysis , 1992 .

[51]  Nikos D. Sidiropoulos,et al.  Blind Speech Separation Using Parafac Analysis and Integer Least Squares , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.