Subband Based Blind Source Separation

In this chapter, we address subband-based blind source separation (BSS) for convolutive mixtures of speech by reporting a large number of experimental results. The subband-based BSS approach offers a compromise between time-domain and frequency-domain techniques. The former is usually difficult and slow with many separation filter coefficients to estimate. With the latter it is difficult to estimate statistics when the adaptation data length is insufficient. With subband-based BSS, a sufficient number of samples for estimating statistics can be held in each subband by using a moderate number of subbands. Moreover, by using FIR filters in each subband, which are shorter than the filters used for time-domain BSS, we can handle long reverberation. In addition, subband-based BSS allows us to select the separation method suited to each subband. Using this advantage, we introduce efficient separation procedures that take both the frequency characteristics of the room reverberation and speech signals into consideration. In concrete terms, longer separation filters and an overlap-blockshift in BSS’s batch adaptation in low frequency bands improve the separation performance. Consequently, frequency-dependent subband processing is successfully realized with subband-based BSS.

[1]  Shoji Makino,et al.  SSB subband echo canceller using low-order projection algorithm , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[2]  Hiroshi Sawada,et al.  Evaluation of separation and dereverberation performance in frequency domain blind source separation , 2004 .

[3]  Kazuya Takeda,et al.  Evaluation of blind signal separation method using directivity pattern under reverberant conditions , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[4]  James P. Reilly,et al.  A new fast-converging method for blind source separation of speech signals in acoustic environments , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[5]  Te-Won Lee,et al.  Blind Separation of Delayed and Convolved Sources , 1996, NIPS.

[6]  Walter Kellermann,et al.  Blind Source Separation for Convolutive Mixtures: A Unified Treatment , 2004 .

[7]  Shoko Araki,et al.  Equivalence between frequency domain blind source separation and frequency domain adaptive null beamformers , 2001, INTERSPEECH.

[8]  Yunxin Zhao,et al.  Subband-based adaptive decorrelation filtering for co-channel speech separation , 2000, IEEE Trans. Speech Audio Process..

[9]  Shoko Araki,et al.  Equivalence between Frequency-Domain Blind Source Separation and Frequency-Domain Adaptive Beamforming for Convolutive Mixtures , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Hiroshi Sawada,et al.  Direction of arrival estimation for multiple source signals using independent component analysis , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[11]  Hiroshi Sawada,et al.  Polar coordinate based nonlinear function for frequency-domain blind source separation , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Kiyohiro Shikano,et al.  MULTISTAGE ICA FOR BLIND SOURCE SEPARATION OF REAL ACOUSTIC CONVOLUTIVE MIXTURE , 2003 .

[13]  Shiro Ikeda,et al.  A METHOD OF ICA IN TIME-FREQUENCY DOMAIN , 2003 .

[14]  M. Portnoff,et al.  Implementation of the digital phase vocoder using the fast Fourier transform , 1976 .

[15]  Xiaoan Sun,et al.  A NATURAL GRADIENT CONVOLUTIVE BLIND SOURCE SEPARATION ALGORITHM FOR SPEECH MIXTURES , 2001 .

[16]  S.C. Douglas,et al.  Multichannel blind deconvolution and equalization using the natural gradient , 1997, First IEEE Signal Processing Workshop on Signal Processing Advances in Wireless Communications.

[17]  Hiroshi Sawada,et al.  A robust approach to the permutation problem of frequency-domain blind source separation , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[18]  Te-Won Lee,et al.  Independent Component Analysis , 1998, Springer US.

[19]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[20]  K. Matsuoka,et al.  Minimal distortion principle for blind source separation , 2002, Proceedings of the 41st SICE Annual Conference. SICE 2002..

[21]  Shoko Araki,et al.  Subband Based Blind Source Separation with Appropriate Processing for Each Frequency Band , 2003 .

[22]  Shoko Araki,et al.  Blind Source Separation for Convolutive Mixtures of Speech using Subband processing , 2002 .

[23]  Benoît Champagne,et al.  Fast convolutive blind speech separation via subband adaptation , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[24]  Shoko Araki,et al.  Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming , 2002, Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing.

[25]  Scott C. Douglas,et al.  Blind Separation of Acoustic Signals , 2001, Microphone Arrays.

[26]  Shoko Araki,et al.  The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech , 2003, IEEE Trans. Speech Audio Process..

[27]  K. Shikano,et al.  Blind Source Separation of Acoustic Signals Based on Multistage ICA Combining Frequency-Domain ICA and Time-Domain ICA , 2003, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[28]  R.J. Mammone,et al.  Fast converging subband acoustic echo cancellation using RAP on the WE DSP16A , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[29]  Sven Nordholm,et al.  Blind signal separation using overcomplete subband representation , 2001, IEEE Trans. Speech Audio Process..

[30]  Hiroshi Sawada,et al.  Blind source separation with different sensor spacing and filter length for each frequency range , 2002, Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing.

[31]  Noboru Ohnishi,et al.  A method of blind separation for convolved non-stationary signals , 1998, Neurocomputing.

[32]  E. Oja,et al.  Independent Component Analysis , 2013 .

[33]  Shoko Araki,et al.  Subband based blind source separation for convolutive mixtures of speech , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[34]  Paris Smaragdis,et al.  Blind separation of convolved mixtures in the frequency domain , 1998, Neurocomputing.

[35]  Andreas Ziehe,et al.  An approach to blind source separation based on temporal structure of speech signals , 2001, Neurocomputing.

[36]  Kazuya Takeda,et al.  Blind source separation combining frequency-domain ICA and beamforming , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[37]  Dennis R. Morgan,et al.  Exploring permutation inconsistency in blind separation of speech signals in a reverberant environment , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).