论文信息 - Relational learning via collective matrix factorization

Relational learning via collective matrix factorization

Relational learning is concerned with predicting unknown values of a relation, given a database of entities and observed relations among entities. An example of relational learning is movie rating prediction, where entities could include users, movies, genres, and actors. Relations encode users' ratings of movies, movies' genres, and actors' roles in movies. A common prediction technique given one pairwise relation, for example a #users x #movies ratings matrix, is low-rank matrix factorization. In domains with multiple relations, represented as multiple matrices, we may improve predictive accuracy by exploiting information from one relation while predicting another. To this end, we propose a collective matrix factorization model: we simultaneously factor several matrices, sharing parameters among factors when an entity participates in multiple relations. Each relation can have a different value type and error distribution; so, we allow nonlinear relationships between the parameters and outputs, using Bregman divergences to measure error. We extend standard alternating projection algorithms to our model, and derive an efficient Newton update for the projection. Furthermore, we propose stochastic optimization methods to deal with large, sparse matrices. Our model generalizes several existing matrix factorization methods, and therefore yields new large-scale optimization algorithms for these problems. Our model can handle any pairwise relational schema and a wide variety of error models. We demonstrate its efficiency, as well as the benefit of sharing parameters among relations.

Geoffrey J. Gordon | Ajit Paul Singh | A. P. Singh

[1] L. Bregman. The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming , 1967 .

[2] Peter P. Chen. The entity-relationship model—toward a unified view of data , 2011, TODS.

[3] D. Aldous. Representations for partially exchangeable arrays of random variables , 1981 .

[4] P. McCullagh,et al. Generalized Linear Models , 1984 .

[5] D. Aldous. Exchangeability and related topics , 1985 .

[6] J. Magnus,et al. Matrix Differential Calculus with Applications in Statistics and Econometrics (Revised Edition) , 1999 .

[7] Pierre Priouret,et al. Adaptive Algorithms and Stochastic Approximations , 1990, Applications of Mathematics.

[8] Donna K. Harman,et al. Overview of the Second Text REtrieval Conference (TREC-2) , 1994, HLT.

[9] Jan de Leeuw,et al. Block-relaxation Algorithms in Statistics , 1994 .

[10] Gene H. Golub,et al. Matrix computations (3rd ed.) , 1996 .

[11] Y. Censor,et al. Parallel Optimization: Theory, Algorithms, and Applications , 1997 .