Optimal weight learning for Coupled Tensor Factorization with mixed divergences

Incorporating domain-specific side information via coupled factorization models is useful in source separation applications. Coupled models can easily incorporate information from source modalities with different statistical properties by estimating shared factors via divergence minimization. Here, it is useful to use mixed divergences, a specific divergence for each modality. However, this extra freedom requires choosing the correct divergence as well as an optimal weighting mechanism to select the relative `importance'. In this paper, we present an approach for determining the relative weights, framed as dispersion parameter estimation, based on an inference framework introduced previously as Generalized Coupled Tensor Factorization (GCTF). The dispersion parameters play a key role on inference as they form a balance between the information obtained from multimodal observations. We tackle the problem of optimal weighting by maximum likelihood exploiting the relation between β-divergences and the family of Tweedie distributions. We demonstrate the usefulness of our approach on a drum source separation application.

[1]  Arindam Banerjee,et al.  Multi-way Clustering on Relation Graphs , 2007, SDM.

[2]  Nicolas Sturmel,et al.  Informed Source Separation Using Iterative Reconstruction , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Ali Taylan Cemgil,et al.  Score guided musical source separation using Generalized Coupled Tensor Factorization , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[4]  Tom F. Wilderjans,et al.  Computational Statistics and Data Analysis Simultaneous Analysis of Coupled Data Blocks Differing in Size: a Comparison of Two Weighting Schemes , 2022 .

[5]  Antoine Liutkus,et al.  Coding-Based Informed Source Separation: Nonnegative Tensor Factorization Approach , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Minje Kim,et al.  Nonnegative Matrix Partial Co-Factorization for Spectral and Temporal Drum Source Separation , 2011, IEEE Journal of Selected Topics in Signal Processing.

[7]  Gordon K. Smyth,et al.  Series evaluation of Tweedie exponential dispersion model densities , 2005, Stat. Comput..

[8]  Ananda Sen,et al.  The Theory of Dispersion Models , 1997, Technometrics.

[9]  Tamara G. Kolda,et al.  All-at-once Optimization for Coupled Matrix and Tensor Factorizations , 2011, ArXiv.

[10]  Ali Taylan Cemgil,et al.  Link Prediction via Generalized Coupled Tensor Factorisation , 2012, ArXiv.

[11]  Ali Taylan Cemgil,et al.  Alpha/Beta Divergences and Tweedie Models , 2012, ArXiv.

[12]  Nancy Bertin,et al.  Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.

[13]  R. A. van den Berg,et al.  Simultaneous analysis of coupled data matrices subject to different amounts of noise. , 2011, The British journal of mathematical and statistical psychology.

[14]  Pierre Comon,et al.  Handbook of Blind Source Separation: Independent Component Analysis and Applications , 2010 .

[15]  Andrzej Cichocki,et al.  Nonnegative Matrix and Tensor Factorization T , 2007 .