Interpretable Matrix Factorization with Stochasticity Constrained Nonnegative DEDICOM

Decomposition into Directed Components (DEDICOM) is a special matrix factorization technique to factorize a given asymmetric similarity matrix into a combination of a loading matrix describing the latent structures in the data and an asymmetric affinity matrix encoding the relationships between the found latent structures. Finding DEDICOM factors can be casted as a matrix norm minimization problem that requires alternating least square updates to find appropriate factors. Yet, due to the way DEDICOM reconstructs the data, unconstrained factors might yield results that are difficult to interpret. In this paper we derive a projection-free gradient descent based alternating least squares algorithm to calculate constrained DEDICOM factors. Our algorithm constrains the loading matrix to be column-stochastic and the affinity matrix to be nonnegative for more interpretable low rank representations. Additionally, unlike most of the available approximate solutions for finding the factors, our approach take into account the entire occurrences of the loading matrix during the optimization process to assure convergence. We evaluate our algorithm on a real world behavioral dataset containing pairwise asymmetric associations between variety of game titles.

[1]  C. Ji An Archetypal Analysis on , 2005 .

[2]  Christian Bauckhage,et al.  Beyond heatmaps: Spatio-temporal clustering using behavior-based partitioning of game levels , 2014, 2014 IEEE Conference on Computational Intelligence and Games.

[3]  A. Tversky Features of Similarity , 1977 .

[4]  Christian Bauckhage,et al.  User Churn Migration Analysis with DEDICOM , 2015, RecSys.

[5]  Christian Bauckhage,et al.  Large-Scale Cross-Game Player Behavior Analysis on Steam , 2021, AIIDE.

[6]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[7]  Christian Bauckhage,et al.  Archetypal Game Recommender Systems , 2014, LWA.

[8]  Hans-Peter Kriegel,et al.  A Three-Way Model for Collective Learning on Multi-Relational Data , 2011, ICML.

[9]  Christian Bauckhage,et al.  Predicting Retention in Sandbox Games with Tensor Factorization-based Representation Learning , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).

[10]  Christian Bauckhage,et al.  k-Means Clustering via the Frank-Wolfe Algorithm , 2016, LWDA.

[11]  Domonkos Tikk,et al.  Scalable Collaborative Filtering Approaches for Large Recommender Systems , 2009, J. Mach. Learn. Res..

[12]  Martin Jaggi,et al.  Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization , 2013, ICML.

[13]  Bracha Shapira,et al.  Recommender Systems Handbook , 2015, Springer US.

[14]  Tamara G. Kolda,et al.  Temporal Analysis of Semantic Graphs Using ASALSAN , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[15]  Alla Rozovskaya,et al.  Using DEDICOM for Completely Unsupervised Part-of-Speech Tagging , 2009 .

[16]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[17]  Philip Wolfe,et al.  An algorithm for quadratic programming , 1956 .

[18]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.