Semi-blind disjoint non-negative matrix factorization for extracting target source from single channel noisy mixture

We present a semi-blind non-negative matrix factorization(NMF) approach to suppress interference noise signals from a single channel mixture signal. By enforcing a disjointness constraint into the NMF error criterion under the semi-blind denoising framework, it is possible to decompose the mixture spectrogram into target and noise components by minimizing overlaps in the time-frequency domain. Experimental results show that the proposed semi-blind disjoint NMF algorithm can significantly suppress non-stationary noise components in the noisy mixture.

[1]  Bhiksha Raj,et al.  Speech denoising using nonnegative matrix factorization with priors , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Israel Cohen,et al.  Speech enhancement for non-stationary noise environments , 2001, Signal Process..

[3]  Levent M. Arslan Modified Wiener filtering , 2006, Signal Process..

[4]  Paris Smaragdis,et al.  Convolutive Speech Bases and Their Application to Supervised Speech Separation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  J. Larsen,et al.  Reduction of non-stationary noise using a non-negative latent variable decomposition , 2008, 2008 IEEE Workshop on Machine Learning for Signal Processing.

[6]  Jae-Hoon Jeong,et al.  Dominant speech enhancement based on SNR-adaptive soft mask filtering , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Özgür Yilmaz,et al.  On the approximate W-disjoint orthogonality of speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Rémi Gribonval,et al.  Oracle estimators for the benchmarking of source separation algorithms , 2007, Signal Process..

[9]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[10]  Ye Li,et al.  Speech Enhancement for Non-Stationary Noise Environments , 2009, 2009 International Conference on Information Engineering and Computer Science.

[11]  Akihiko Sugiyama,et al.  Single-Channel Impact-Noise Suppression with no Auxiliary Information for its Detection , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[12]  Andrzej Cichocki,et al.  New Algorithms for Non-Negative Matrix Factorization in Applications to Blind Source Separation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[13]  Jörn Anemüller,et al.  Maximization of Component Disjointness: A Criterion for Blind Source Separation , 2007, ICA.

[14]  Israel Cohen,et al.  Enhancement of Speech Signals Under Multiple Hypotheses using an Indicator for Transient Noise Presence , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[15]  Jonathan G. Fiscus,et al.  Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .