Relational Topic Factorization for Link Prediction in Document Networks

Link prediction is one of the fundamental problems in complex networks. In this paper, we focus on link prediction in document networks, in which nodes are text documents. We propose the relational topic factorization model (RTF), a model that combines topic models and matrix factorization. We also develop an efficient Monte Carlo EM algorithm for learning the parameters. Empirical results show that our model outperforms other state-of-the-art ones, and can give better understanding of the documents.

[1]  Hong Cheng,et al.  A model-based approach to attributed graph clustering , 2012, SIGMOD Conference.

[2]  David M. Blei,et al.  Relational Topic Models for Document Networks , 2009, AISTATS.

[3]  Jon Kleinberg,et al.  The link prediction problem for social networks , 2003, CIKM '03.

[4]  Ning Chen,et al.  Generalized Relational Topic Models with Data Augmentation , 2013, IJCAI.

[5]  Yifan Hu,et al.  Collaborative Filtering for Implicit Feedback Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[6]  Peter A. Flach,et al.  Evaluation Measures for Multi-class Subgroup Discovery , 2009, ECML/PKDD.

[7]  S. Wasserman,et al.  Logit models and logistic regressions for social networks: I. An introduction to Markov graphs andp , 1996 .

[8]  Chris H Wiggins,et al.  Bayesian approach to network modularity. , 2007, Physical review letters.

[9]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[10]  Deng Cai,et al.  Topic modeling with network regularization , 2008, WWW.

[11]  Cristopher Moore,et al.  Scalable text and link analysis with mixed-topic link models , 2013, KDD.

[12]  Thomas L. Griffiths,et al.  Discovering Latent Classes in Relational Data , 2004 .

[13]  William W. Cohen,et al.  Block-LDA: Jointly Modeling Entity-Annotated Text and Entity-Entity Links , 2014, Handbook of Mixed Membership Models and Their Applications.

[14]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[15]  Ramesh Nallapati,et al.  Link-PLSA-LDA: A New Unsupervised Model for Topics and Influence of Blogs , 2021, ICWSM.

[16]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[17]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[18]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[19]  Yan Liu,et al.  Topic-link LDA: joint models of topic and author community , 2009, ICML '09.

[20]  Charles Elkan,et al.  Link Prediction via Matrix Factorization , 2011, ECML/PKDD.