论文信息 - Soft Nonnegative Matrix Co-Factorization

Soft Nonnegative Matrix Co-Factorization

This work introduces a new framework for nonnegative matrix factorization (NMF) in multisensor or multimodal data configurations, where taking into account the mutual dependence that exists between the related parallel streams of data is expected to improve performance. In contrast with previous works that focused on co-factorization methods -where some factors are shared by the different modalities-we propose a soft co-factorization scheme which accounts for possible local discrepancies across modalities or channels. This objective is formalized as an optimization problem where concurrent factorizations are jointly performed while being tied by a coupling term that penalizes differences between the related factor matrices associated with different modalities. We provide majorization-minimization (MM) algorithms for three common measures of fit-the squared Euclidean norm, the Kullback-Leibler divergence and the Itakura-Saito divergence-and two possible coupling variants, using either the l1 or the squared Euclidean norm of differences. The approach is shown to achieve promising performance in two audio-related tasks: multimodal speaker diarization using audiovisual data and audio source separation using stereo data.

[1] D. Hunter,et al. A Tutorial on MM Algorithms , 2004 .

[2] Sylvain Meignier,et al. LIUM SPKDIARIZATION: AN OPEN SOURCE TOOLKIT FOR DIARIZATION , 2010 .

[3] Minje Kim,et al. Nonnegative matrix partial co-factorization for drum source separation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4] Geoffrey J. Gordon,et al. Relational learning via collective matrix factorization , 2008, KDD.

[5] Andreas Ziehe,et al. The 2011 Signal Separation Evaluation Campaign (SiSEC2011): - Audio Source Separation - , 2012, LVA/ICA.

[6] Hagai Attias,et al. Topic regression multi-modal Latent Dirichlet Allocation for image annotation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7] S. Amari,et al. Nonnegative Matrix and Tensor Factorization [Lecture Notes] , 2008, IEEE Signal Processing Magazine.

[8] Andrzej Cichocki,et al. Nonnegative Matrix and Tensor Factorization T , 2007 .

[9] Naoto Yokoya,et al. Coupled Nonnegative Matrix Factorization Unmixing for Hyperspectral and Multispectral Data Fusion , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[10] Seungjin Choi,et al. Matrix Co-Factorization on Compressed Sensing , 2011, IJCAI.

[11] Olivier Cappé,et al. Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12] Andrew Zisserman,et al. Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[13] Rémi Gribonval,et al. Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[14] Ali Taylan Cemgil,et al. Nonnegative matrix factorizations as probabilistic inference in composite models , 2009, 2009 17th European Signal Processing Conference.

[15] Luo Si,et al. Matrix co-factorization for recommendation with rich side information and implicit feedback , 2011, HetRec '11.

[16] Olivier Cappé,et al. Piecewise constant nonnegative matrix factorization , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17] Nancy Bertin,et al. Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.

[18] Jonathon Shlens. I T ] 8 A pr 2 01 4 Notes on Kullback-Leibler Divergence and Likelihood Theory , 2007 .

[19] Slim Essid,et al. Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring , 2013, IEEE Transactions on Multimedia.

[20] Yanhua Chen,et al. Non-Negative Matrix Factorization for Semisupervised Heterogeneous Data Coclustering , 2010, IEEE Transactions on Knowledge and Data Engineering.

[21] Seungjin Choi,et al. Group Nonnegative Matrix Factorization for EEG Classification , 2009, AISTATS.

[22] D. Fitzgerald,et al. Using Tensor Factorisation Models to Separate Drums from Polyphonic Music , 2009 .

[23] Tamara G. Kolda,et al. All-at-once Optimization for Coupled Matrix and Tensor Factorizations , 2011, ArXiv.

[24] Tuomas Virtanen,et al. Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[25] Ali Taylan Cemgil,et al. Generalised Coupled Tensor Factorisation , 2011, NIPS.

[26] Tom F. Wilderjans,et al. Computational Statistics and Data Analysis Simultaneous Analysis of Coupled Data Blocks Differing in Size: a Comparison of Two Weighting Schemes , 2022 .

[27] Nicholas W. D. Evans,et al. Speaker Diarization: A Review of Recent Research , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[28] Fillia Makedon,et al. Learning from Incomplete Ratings Using Non-negative Matrix Factorization , 2006, SDM.

[29] Gwenn Englebienne,et al. Multimodal Speaker Diarization , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] Alexey Ozerov,et al. Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[31] Alessandro Vinciarelli,et al. Canal9: A database of political debates for analysis of social interactions , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[32] Slim Essid,et al. A Multimodal Approach to Speaker Diarization on TV Talk-Shows , 2013, IEEE Transactions on Multimedia.

[33] Alexey Ozerov,et al. Text-informed audio source separation using nonnegative matrix partial co-factorization , 2013, 2013 IEEE International Workshop on Machine Learning for Signal Processing (MLSP).

[34] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[35] Jérôme Idier,et al. Algorithms for Nonnegative Matrix Factorization with the β-Divergence , 2010, Neural Computation.

[36] Deepak Agarwal,et al. Regression-based latent factor models , 2009, KDD.