论文信息 - On the use of the beta divergence for musical source separation

On the use of the beta divergence for musical source separation

Non-negative Tensor Factorisation based methods have found use in the context of musical sound source separation. These techniques require the use of a suitable cost function to determine the optimal factorisation, and most work has focused on the use of the generalised Kullback-Liebler divergence, and more recently the Itakura-Saito divergence. These divergences can be regarded as limiting cases of the parameterised Beta divergence. This paper looks at the use of the Beta Divergence in the context of musical source separation with a view to determining an optimal value of Beta for this problem. This is considered for both magnitude and power spectrograms. In an effort to avoid potential local minima in the Beta divergence, the use of a "tempered" Beta Divergence is also explored. Also presented are the update equations for the generalised non-negative tensor factorisation model described in this paper which were previously unpublished. (6 pages)

[1] Nancy Bertin,et al. Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.

[2] Andrzej Cichocki,et al. Csiszár's Divergences for Non-negative Matrix Factorization: Family of New Algorithms , 2006, ICA.

[3] Roland Badeau,et al. A tempering approach for Itakura-Saito non-negative matrix factorization. With application to music transcription , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4] P. Smaragdis,et al. Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[5] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[6] Roger B. Dannenberg,et al. Remixing Stereo Music with Score-Informed Source Separation , 2006, ISMIR.

[7] Derry Fitzgerald,et al. Extended Nonnegative Tensor Factorisation Models for Musical Sound Source Separation , 2008, Comput. Intell. Neurosci..

[8] Rémi Gribonval,et al. Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[9] Raul Kompass,et al. A Generalized Divergence Measure for Nonnegative Matrix Factorization , 2007, Neural Computation.

[10] Derry Fitzgerald,et al. Musical Source Separation using Generalised Non-negative Tensor Factorisation Models , 2008 .