Using information theoretic distance measures for solving the permutation problem of blind source separation of speech signals

The problem of blind source separation (BSS) of convolved acoustic signals is of great interest for many classes of applications. Due to the convolutive mixing process, the source separation is performed in the frequency domain, using independent component analysis (ICA). However, frequency domain BSS involves several major problems that must be solved. One of these is the permutation problem. The permutation ambiguity of ICA needs to be resolved so that each separated signal contains the frequency components of only one source signal. This article presents a class of methods for solving the permutation problem based on information theoretic distance measures. The proposed algorithms have been tested on different real-room speech mixtures with different reverberation times in conjunction with different ICA algorithms.

[1]  Soo Ngee Koh,et al.  Blind Source Separation of Speech Signals , 2007 .

[2]  V. G. Reju,et al.  Partial separation method for solving permutation problem in frequency domain blind source separation of speech signals , 2008, Neurocomputing.

[3]  Kazuya Takeda,et al.  Evaluation of blind signal separation method using directivity pattern under reverberant conditions , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[4]  Mohamed-Jalal Fadili,et al.  Multivariate statistical modeling of images with the curvelet transform , 2005, Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005..

[5]  Nikolaos Mitianoudis,et al.  Permutation Alignment for Frequency Domain ICA Using Subspace Beamforming Methods , 2004, ICA.

[6]  L. Parra,et al.  Convolutive blind source separation based on multiple decorrelation , 1998, Neural Networks for Signal Processing VIII. Proceedings of the 1998 IEEE Signal Processing Society Workshop (Cat. No.98TH8378).

[7]  Deniz Erdogmus,et al.  Information Theoretic Learning , 2005, Encyclopedia of Artificial Intelligence.

[8]  Rémi Gribonval,et al.  A Sparsity-Based Method to Solve Permutation Indeterminacy in Frequency-Domain Convolutive Blind Source Separation , 2009, ICA.

[9]  Dorothea Kolossa,et al.  A Batch Algorithm for Blind Source Separation of Acoustic Signals Using ICA and Time-Frequency Masking , 2007, ICA.

[10]  J. D. Gorman,et al.  Alpha-Divergence for Classification, Indexing and Retrieval (Revised 2) , 2002 .

[11]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[12]  Jean-François Bercher,et al.  A Renyi entropy convolution inequality with application , 2002, 2002 11th European Signal Processing Conference.

[13]  C. Kapadia,et al.  On estimating the scale parameter of the Rayleigh distribution from doubly censored samples , 1980 .

[14]  Hiroshi Sawada,et al.  A robust approach to the permutation problem of frequency-domain blind source separation , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[15]  Yun He,et al.  A generalized divergence measure for robust image registration , 2003, IEEE Trans. Signal Process..

[16]  Dinh-Tuan Pham,et al.  Blind separation of speech mixtures based on nonstationarity , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[17]  J. A. Domínguez-Molina A practical procedure to estimate the shape parameter in the generalized Gaussian distribution , 2002 .

[18]  H. Krim,et al.  Jensen-renyi divergence measure: theoretical and computational perspectives , 2003, IEEE International Symposium on Information Theory, 2003. Proceedings..

[19]  R. G. Leonard,et al.  A database for speaker-independent digit recognition , 1984, ICASSP.

[20]  Nikolaos Mitianoudis,et al.  Audio source separation of convolutive mixtures , 2003, IEEE Trans. Speech Audio Process..

[21]  Prasad Rajkishore,et al.  Fixed-Point ICA based Speech Signal Separation and Enhancement with Generalized Gaussian Model , 2005 .

[22]  Jean-Franois Cardoso High-Order Contrasts for Independent Component Analysis , 1999, Neural Computation.

[23]  Saeid Sanei,et al.  A Novel Hybrid Approach to the Permutation Problem of Frequency Domain Blind Source Separation , 2004, ICA.

[24]  Mitsuru Kawamoto,et al.  ICA Papers Classified According to their Applications and Performances , 2003 .

[25]  Igor Vajda,et al.  Entropy expressions for multivariate continuous distributions , 2000, IEEE Trans. Inf. Theory.

[26]  Birger Kollmeier,et al.  Amplitude Modulation Decorrelation For Convolutive Blind Source Separation , 2000 .

[27]  Eric P. Xing,et al.  Nonextensive entropic kernels , 2008, ICML '08.

[28]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[29]  D. Chakrabarti,et al.  A fast fixed - point algorithm for independent component analysis , 1997 .

[30]  Christine Serviere,et al.  BLIND SEPARATION OF CONVOLUTIVE AUDIO MIXTURES USING NONSTATIONARITY , 2003 .

[31]  A. Rényi,et al.  Selected papers of Alfréd Rényi , 1976 .

[32]  Hiroshi Sawada,et al.  Blind extraction of a dominant source from mixtures of many sources using ICA and time-frequency masking , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[33]  Shiro Ikeda,et al.  A Method of Blind Separation Based on Temporal Structure of Signals , 1998, ICONIP.

[34]  F. Barthe Optimal young's inequality and its converse: a simple proof , 1997, math/9704210.

[35]  Lucas C. Parra,et al.  Convolutive Blind Source Separation Methods , 2008 .

[36]  Dorothea Kolossa,et al.  REAL TIME SEPARATION OF CONVOLUTIVE MIXTURES , 2001 .

[37]  Hiroshi Sawada,et al.  Measuring Dependence of Bin-wise Separated Signals for Permutation Alignment in Frequency-domain BSS , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[38]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[39]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[40]  Christoph Arndt,et al.  Information Measures: Information and its Description in Science and Engineering , 2001 .

[41]  Kazuya Takeda,et al.  Evaluation of Frequency-Domain Blind Signal Separation Using Directivity Pattern Under Reverberant Conditions , 2000 .

[42]  Dennis R. Morgan,et al.  A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[43]  Mark D. Plumbley,et al.  The Role of High Frequencies in Convolutive Blind Source Separation of Speech Signals , 2007, ICA.

[44]  Hidefumi Kobatake,et al.  A New Approach to the Permutation Problem in Frequency Domain Blind Source Separation , 2004, ICA.

[45]  Nobuhiko Kitawaki,et al.  Combined approach of array processing and independent component analysis for blind separation of acoustic signals , 2003, IEEE Trans. Speech Audio Process..

[46]  Nikolaos Mitianoudis,et al.  New fixed-point solutions for convolved mixtures , 2001 .

[47]  A. Lerner A SIMPLE PROOF OF THE A , 2012 .

[48]  Andreas Ziehe,et al.  An approach to blind source separation based on temporal structure of speech signals , 2001, Neurocomputing.

[49]  William C. Hoffman The Joint Distribution of n Successive Outputs of a Linear Detector , 1954 .

[50]  Radoslaw Mazur,et al.  Solving the Permutation Problem in Convolutive Blind Source Separation , 2007, ICA.