论文信息 - Speech Source Separation in Convolutive Environments Using Space-Time-Frequency Analysis

Speech Source Separation in Convolutive Environments Using Space-Time-Frequency Analysis

We propose a new method for speech source separation that is based on directionally-disjoint estimation of the transfer functions between microphones and sources at different frequencies and at multiple times. The spatial transfer functions are estimated from eigenvectors of the microphones' correlation matrix. Smoothing and association of transfer function parameters across different frequencies are performed by simultaneous extended Kalman filtering of the amplitude and phase estimates. This approach allows transfer function estimation even if the number of sources is greater than the number of microphones, and it can operate for both wideband and narrowband sources. The performance of the proposed method was studied via simulations and the results show good performance.

[1] Scott Rickard,et al. Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[2] Dennis R. Morgan,et al. Permutation inconsistency in blind speech separation: investigation and solutions , 2005, IEEE Transactions on Speech and Audio Processing.

[3] Lucas C. Parra,et al. On-line Convolutive Blind Source Separation of Non-Stationary Signals , 2000, J. VLSI Signal Process..

[4] Paris Smaragdis,et al. Evaluation of blind signal separation methods , 1999 .

[5] Özgür Yilmaz,et al. Blind separation of disjoint orthogonal signals: demixing N sources from 2 mixtures , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[6] Cédric Févotte,et al. Two contributions to blind source separation using time-frequency distributions , 2004, IEEE Signal Processing Letters.

[7] DeLiang Wang,et al. Speech segregation based on sound localization , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[8] Yannick Deville,et al. Temporal and time-frequency correlation-based blind source separation methods. Part I: Determined and underdetermined linear instantaneous mixtures , 2007, Signal Process..

[9] Kari Torkkola,et al. Blind Separation For Audio Signals - Are We There Yet? , 1999 .