Interpretable Matrix Factorization with Stochastic Nonnegative DEDICOM

Decomposition into Directed Components (DEDICOM) is a special matrix factorization technique to factorize a given asymmetric similarity matrix into a combination of a loading matrix describing the latent structures in the data and an asymmetric affinity matrix encoding the relationships between the found latent structures. Finding DEDICOM factors can be cast as a matrix norm minimization problem that requires alternating least square updates to find appropriate factors. Yet, due to the way DEDICOM reconstructs the data, unconstrained factors might yield results that are difficult to interpret. In this paper we derive a projection-free gradient descent based alternating least squares algorithm to calculate constrained DEDICOM factors. Our algorithm constrains the loading matrix to be column-stochastic and the affinity matrix to be nonnegative for more interpretable low rank representations. Additionally, unlike most of the available approximate solutions for finding the loading matrix, our approach takes the entire occurrences of the loading matrix into account to assure convergence. We evaluate our algorithm on a behavioral dataset containing pairwise asymmetric associations between variety of game titles from an online platform.

[1]  C. Ji An Archetypal Analysis on , 2005 .

[2]  Bracha Shapira,et al.  Recommender Systems Handbook , 2015, Springer US.

[3]  Lior Rokach,et al.  Recommender Systems Handbook , 2010 .

[4]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[5]  A. Tversky Features of Similarity , 1977 .

[6]  Christian Bauckhage,et al.  User Churn Migration Analysis with DEDICOM , 2015, RecSys.

[7]  Hans-Peter Kriegel,et al.  A Three-Way Model for Collective Learning on Multi-Relational Data , 2011, ICML.

[8]  Philip Wolfe,et al.  An algorithm for quadratic programming , 1956 .

[9]  Christian Bauckhage,et al.  Large-Scale Cross-Game Player Behavior Analysis on Steam , 2021, AIIDE.

[10]  Tamara G. Kolda,et al.  Temporal Analysis of Semantic Graphs Using ASALSAN , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[11]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[12]  Christian Bauckhage,et al.  k-Means Clustering via the Frank-Wolfe Algorithm , 2016, LWDA.

[13]  Martin Jaggi,et al.  Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization , 2013, ICML.

[14]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[15]  Christian Bauckhage,et al.  Archetypal Game Recommender Systems , 2014, LWA.

[16]  Christian Bauckhage,et al.  Beyond heatmaps: Spatio-temporal clustering using behavior-based partitioning of game levels , 2014, 2014 IEEE Conference on Computational Intelligence and Games.

[17]  Christian Bauckhage,et al.  Predicting Retention in Sandbox Games with Tensor Factorization-based Representation Learning , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).

[18]  Alla Rozovskaya,et al.  Using DEDICOM for Completely Unsupervised Part-of-Speech Tagging , 2009 .