论文信息 - Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures

Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures

We present a frequency-domain technique based on PARAllel FACtor (PARAFAC) analysis that performs multichannel blind source separation (BSS) of convolutive speech mixtures. PARAFAC algorithms are combined with a dimensionality reduction step to significantly reduce computational complexity. The identifiability potential of PARAFAC is exploited to derive a BSS algorithm for the under-determined case (more speakers than microphones), combining PARAFAC analysis with time-varying Capon beamforming. Finally, a low-complexity adaptive version of the BSS algorithm is proposed that can track changes in the mixing environment. Extensive experiments with realistic and measured data corroborate our claims, including the under-determined case. Signal-to-interference ratio improvements of up to 6 dB are shown compared to state-of-the-art BSS algorithms, at an order of magnitude lower computational complexity.

[1] A. Gorokhov,et al. Subspace-based techniques for blind separation of convolutive mixtures with temporally correlated sources , 1997 .

[2] Mohamed Sahmoudi,et al. Blind Separation of Convolutive Mixtures using Nonstationarity and Fractional Lower Order Statistics (FLOS): Application to Audio Signals , 2006, Fourth IEEE Workshop on Sensor Array and Multichannel Processing, 2006..

[3] Eric Moulines,et al. A blind source separation technique using second-order statistics , 1997, IEEE Trans. Signal Process..

[4] Hiroshi Sawada,et al. A robust and precise method for solving the permutation problem of frequency-domain blind source separation , 2004, IEEE Transactions on Speech and Audio Processing.

[5] S. Araki,et al. MLSP 2007 Data Analysis Competition: Frequency-Domain Blind Source Separation for Convolutive Mixtures of Speech/Audio Signals , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[6] Pierre Comon,et al. Enhanced Line Search: A Novel Method to Accelerate PARAFAC , 2008, SIAM J. Matrix Anal. Appl..

[7] Birger Kollmeier,et al. Amplitude Modulation Decorrelation For Convolutive Blind Source Separation , 2000 .

[8] Lucas C. Parra,et al. On-line Convolutive Blind Source Separation of Non-Stationary Signals , 2000, J. VLSI Signal Process..

[9] V. Michael Bove,et al. Blind Separation Of Real World Audio Signals Using Overdetermined Mixtures , 1999 .

[10] Dinh-Tuan Pham,et al. Blind separation of instantaneous mixtures of nonstationary sources , 2001, IEEE Trans. Signal Process..

[11] A. Stegeman,et al. On Kruskal's uniqueness condition for the Candecomp/Parafac decomposition , 2007 .

[12] Rasmus Bro,et al. A comparison of algorithms for fitting the PARAFAC model , 2006, Comput. Stat. Data Anal..

[13] Christine Serviere,et al. BLIND SEPARATION OF CONVOLUTIVE AUDIO MIXTURES USING NONSTATIONARITY , 2003 .

[14] Nikos D. Sidiropoulos,et al. On the effectiveness of PARAFAC-based estimation for blind speech separation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15] Shoko Araki,et al. ON-LINE TIME-DOMAIN BLIND SOURCE SEPARATION OF NONSTATIONARY CONVOLVED SIGNALS , 2003 .

[16] James P. Reilly,et al. A frequency domain method for blind source separation of convolutive audio mixtures , 2005, IEEE Transactions on Speech and Audio Processing.

[17] Nikos D. Sidiropoulos,et al. Parallel factor analysis in sensor array processing , 2000, IEEE Trans. Signal Process..

[18] Richard A. Harshman,et al. Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis , 1970 .

[19] J. Cardoso,et al. Blind beamforming for non-gaussian signals , 1993 .

[20] Pierre Comon,et al. Blind identification and source separation in 2×3 under-determined mixtures , 2004, IEEE Trans. Signal Process..

[21] Lucas C. Parra,et al. Convolutive blind separation of non-stationary sources , 2000, IEEE Trans. Speech Audio Process..

[22] Nikolaos Mitianoudis,et al. Audio source separation of convolutive mixtures , 2003, IEEE Trans. Speech Audio Process..

[23] L. Lathauwer,et al. Dimensionality reduction in higher-order signal processing and rank-(R1,R2,…,RN) reduction in multilinear algebra , 2004 .

[24] Nikos D. Sidiropoulos,et al. Blind PARAFAC receivers for DS-CDMA systems , 2000, IEEE Trans. Signal Process..

[25] L. Lathauwer,et al. An enhanced plane search scheme for complex-valued tensor decompositions , 2010 .

[26] Andreas Ziehe,et al. An approach to blind source separation based on temporal structure of speech signals , 2001, Neurocomputing.

[27] Hiroshi Sawada,et al. Frequency-Domain Blind Source Separation of Many Speech Signals Using Near-Field and Far-Field Models , 2006, EURASIP J. Adv. Signal Process..

[28] David E. Booth,et al. Multi-Way Analysis: Applications in the Chemical Sciences , 2005, Technometrics.

[29] Nikos D. Sidiropoulos,et al. Adaptive Algorithms to Track the PARAFAC Decomposition of a Third-Order Tensor , 2009, IEEE Transactions on Signal Processing.

[30] Lieven De Lathauwer,et al. Tensor-based techniques for the blind separation of DS-CDMA signals , 2007, Signal Process..

[31] Jont B. Allen,et al. Image method for efficiently simulating small‐room acoustics , 1976 .

[32] M.G. Bellanger,et al. Digital processing of speech signals , 1980, Proceedings of the IEEE.

[33] Dinh-Tuan Pham,et al. Blind separation of speech mixtures based on nonstationarity , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[34] P. Kroonenberg. Applied Multiway Data Analysis , 2008 .

[35] Dinh-Tuan Pham,et al. Permutation Correction in the Frequency Domain in Blind Separation of Speech Mixtures , 2006, EURASIP J. Adv. Signal Process..

[36] J. Kruskal. Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics , 1977 .

[37] Lieven De Lathauwer,et al. Blind Identification of Underdetermined Mixtures by Simultaneous Matrix Diagonalization , 2008, IEEE Transactions on Signal Processing.

[38] Simon Haykin,et al. Development of a flexible, realistic hearing in noise test environment (R-HINT-E) , 2004, Signal Process..

[39] Arie Yeredor,et al. Non-orthogonal joint diagonalization in the least-squares sense with application in blind source separation , 2002, IEEE Trans. Signal Process..

[40] Arogyaswami Paulraj,et al. An analytical constant modulus algorithm , 1996, IEEE Trans. Signal Process..

[41] K. Matsuoka,et al. Minimal distortion principle for blind source separation , 2002, Proceedings of the 41st SICE Annual Conference. SICE 2002..

[42] L. Lathauwer,et al. Sufficient conditions for uniqueness in Candecomp/Parafac and Indscal with random component matrices , 2006, Psychometrika.

[43] Miguel A. Carreira-Perpinan,et al. Dimensionality Reduction , 2011 .

[44] Philippe Loubaton,et al. Subspace based techniques for second order blind separation of convolutive mixtures with temporally correlated sources , 1997 .

[45] R. Bro. PARAFAC. Tutorial and applications , 1997 .

[46] Rasmus Bro,et al. Multi-way Analysis with Applications in the Chemical Sciences , 2004 .

[47] Lucas C. Parra,et al. A SURVEY OF CONVOLUTIVE BLIND SOURCE SEPARATION METHODS , 2007 .

[48] Lieven De Lathauwer,et al. A Link between the Canonical Decomposition in Multilinear Algebra and Simultaneous Matrix Diagonalization , 2006, SIAM J. Matrix Anal. Appl..

[49] Lieven De Lathauwer,et al. An enhanced line search scheme for complex-valued tensor decompositions. Application in DS-CDMA , 2008, Signal Process..

[50] Michael Greenacre,et al. Multiway data analysis , 1992 .

[51] Nikos D. Sidiropoulos,et al. Blind Speech Separation Using Parafac Analysis and Integer Least Squares , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.