New approaches for solving permutation indeterminacy and scaling ambiguity in frequency domain separation of convolved mixtures

Permutation indeterminacy and scaling ambiguity occur in ICA and they are particularly problematic in time-frequency domain separation of convolutive mixtures. The quality of separation is severely degraded if these two problems are not well addressed. In this paper, we propose new approaches to solve the permutation indeterminacy and scaling ambiguity in the separation of convolutive mixture in frequency domain. We first apply Short Time Fourier Transform to the observed signals in order to transform the convolutive mixing in time domain to instantaneous mixing in time-frequency domain. A fixed-point algorithm with test of saddle point is adopted to derive the separated components in each frequency bin. To solve the permutation problem,we propose a new matching algorithm for this purpose. First we use discrete Haar Wavelet Transform to extract the feature vectors from the magnitude waveforms of the separated components and use Singular Value Decomposition to achieve dimension reduction. The permutation problem is solved by clustering the feature vectors using the new matching algorithm which is a combination of basic K-means and Hungarian algorithm. To solve the scaling ambiguity problem, we treat it as an overcomplete problem and realize it by maximizing the posterior of the scaling factor. Finally, experiments are conducted using benchmark data to present the effectiveness and performance of our proposed algorithms.

[1]  Zbynek Koldovský,et al.  Blind Instantaneous Noisy Mixture Separation with Best Interference-Plus-Noise Rejection , 2007, ICA.

[2]  Hiroshi Sawada,et al.  A robust and precise method for solving the permutation problem of frequency-domain blind source separation , 2004, IEEE Transactions on Speech and Audio Processing.

[3]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[4]  Shiro Ikeda,et al.  A METHOD OF ICA IN TIME-FREQUENCY DOMAIN , 2003 .

[5]  Angelo Ciaramella,et al.  Amplitude and permutation indeterminacies in frequency domain convolved ICA , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[6]  Paris Smaragdis,et al.  Blind separation of convolved mixtures in the frequency domain , 1998, Neurocomputing.

[7]  Te-Won Lee,et al.  On the Assumption of Spherical Symmetry and Sparseness for the Frequency-Domain Speech Model , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Terrence J. Sejnowski,et al.  Learning Overcomplete Representations , 2000, Neural Computation.

[9]  Andrzej Cichocki,et al.  Neural networks for blind decorrelation of signals , 1997, IEEE Trans. Signal Process..

[10]  Ada Wai-Chee Fu,et al.  Efficient time series matching by wavelets , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[11]  Te-Won Lee,et al.  Blind Source Separation Exploiting Higher-Order Frequency Dependencies , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Christopher John Long,et al.  Wavelet based feature extraction for phoneme recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[13]  Adriana Dapena,et al.  A novel frequency domain approach for separating convolutive mixtures of temporally-white signals , 2003, Digit. Signal Process..

[14]  Sushant Sachdeva,et al.  Dimension Reduction , 2008, Encyclopedia of GIS.

[15]  Minje Kim,et al.  ICA-Based Clustering for Resolving Permutation Ambiguity in Frequency-Domain Convolutive Source Separation , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[16]  Hiroshi Sawada,et al.  Blind Extraction of Dominant Target Sources Using ICA and Time-Frequency Masking , 2006, IEEE Transactions on Audio, Speech, and Language Processing.