论文信息 - Perceptual single-channel audio source separation by non-negative matrix factorization

Perceptual single-channel audio source separation by non-negative matrix factorization

This paper proposes a single-channel audio source decomposition method that integrates perceptual quality criteria into source separation. Unlike the existing methods, the proposed method applies a perceptually weighted non-negative matrix factorization on log-frequency spectrogram of the mixed signal. The weights are adaptively calculated for each critical band based on a perceptual model described by ITU-R BS. 1387 perceptual quality standard. It is shown that the proposed adaptive weighting scheme significantly improves the quality of audio sources estimated by minimizing the weighted divergence between the observed log-frequency spectrogram and the model.

Bilge Gunsel | Serap Kirbiz | S. Kırbız | B. Gunsel

[1] Tuomas Virtanen,et al. Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2] H. Sebastian Seung,et al. Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[3] P. Smaragdis,et al. Independent component analysis for automatic note extraction from musical trills. , 2004, The Journal of the Acoustical Society of America.

[4] Paris Smaragdis,et al. Non-negative Matrix Factor Deconvolution; Extraction of Multiple Sound Sources from Monophonic Inputs , 2004, ICA.

[5] Morten Mørup,et al. Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation , 2006, ICA.

[6] Hugo Fastl,et al. Psychoacoustics: Facts and Models , 1990 .

[7] T. Virtanen. Monaural Sound Source Separation by Perceptually Weighted Non-Negative Matrix Factorization , 2003 .