论文信息 - Relaxation of rank-1 spatial constraint in overdetermined blind source separation

Relaxation of rank-1 spatial constraint in overdetermined blind source separation

In this paper, we propose a new algorithm for overdetermined blind source separation (BSS), which enables us to achieve good separation performance even for signals recorded in a reverberant environment. The proposed algorithm utilizes ex tra observations (channels) in overdetermined BSS to esti mate both direct and reverberant components of each source. This approach can relax the rank-1 spatial constraint, which corresponds to the assumption of a linear time-invariant mixing system. To confirm the efficacy of the proposed algorithm, we apply the relaxation of the rank-1 spatial constraint to con ventional BSS techniques. The experimental results show that the proposed algorithm can avoid the degradation of separation performance for reverberant signals in some cases.

[1] Paris Smaragdis,et al. Blind separation of convolved mixtures in the frequency domain , 1998, Neurocomputing.

[2] Hirokazu Kameoka,et al. Efficient multichannel nonnegative matrix factorization exploiting rank-1 spatial model , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3] Ieee Staff. 2017 25th European Signal Processing Conference (EUSIPCO) , 2017 .

[4] Shoko Araki,et al. The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech , 2003, IEEE Trans. Speech Audio Process..

[5] Andreas Ziehe,et al. The 2011 Signal Separation Evaluation Campaign (SiSEC2011): - Audio Source Separation - , 2012, LVA/ICA.

[6] Kiyohiro Shikano,et al. Blind source separation based on a fast-convergence algorithm combining ICA and beamforming , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[7] Te-Won Lee,et al. Blind Speech Separation , 2007, Blind Speech Separation.

[8] Hirokazu Kameoka,et al. Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[9] Satoshi Nakamura,et al. Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition , 2000, LREC.

[10] Rémi Gribonval,et al. Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[11] Alexey Ozerov,et al. Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[12] Hirokazu Kameoka,et al. Constrained and regularized variants of non-negative matrix factorization incorporating music-specific constraints , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13] H. V. Trees. Detection, Estimation, And Modulation Theory , 2001 .

[14] Pierre Comon,et al. Independent component analysis, A new concept? , 1994, Signal Process..

[15] Andreas Ziehe,et al. An approach to blind source separation based on temporal structure of speech signals , 2001, Neurocomputing.

[16] Te-Won Lee,et al. Blind Source Separation Exploiting Higher-Order Frequency Dependencies , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[17] Nobutaka Ono,et al. Stable and fast update rules for independent vector analysis based on auxiliary function technique , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[18] Hirokazu Kameoka,et al. Robust speech dereverberation based on non-negativity and sparse nature of speech spectrograms , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19] Jérôme Idier,et al. Algorithms for Nonnegative Matrix Factorization with the β-Divergence , 2010, Neural Computation.