Context Sensitive Topic Models for Author Influence in Document Networks

In a document network such as a citation network of scientific documents, web-logs, etc., the content produced by authors exhibits their interest in certain topics. In addition some authors influence other authors' interests. In this work, we propose to model the influence of cited authors along with the interests of citing authors. Moreover, we hypothesize that apart from the citations present in documents, the context surrounding the citation mention provides extra topical information about the cited authors. However, associating terms in the context to the cited authors remains an open problem. We propose novel document generation schemes that incorporate the context while simultaneously modeling the interests of citing authors and influence of the cited authors. Our experiments show significant improvements over baseline models for various evaluation criteria such as link prediction between document and cited author, and quantitatively explaining unseen text.

[1]  J. Lafferty,et al.  Mixed-membership models of scientific publications , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Ramesh Nallapati,et al.  Joint latent topic models for text and citations , 2008, KDD.

[3]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[4]  Dan Roth,et al.  Citation Author Topic Model in Expert Search , 2010, COLING.

[5]  Andrew McCallum,et al.  Group and Topic Discovery from Relations and Their Attributes , 2005, NIPS.

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Prasenjit Mitra,et al.  Utilizing Context in Generative Bayesian Models for Linked Corpus , 2010, AAAI.

[8]  Yan Liu,et al.  Topic-link LDA: joint models of topic and author community , 2009, ICML '09.

[9]  Steffen Bickel,et al.  Unsupervised prediction of citation influences , 2007, ICML '07.

[10]  Thomas L. Griffiths,et al.  The Author-Topic Model for Authors and Documents , 2004, UAI.

[11]  Michal Rosen-Zvi,et al.  Latent Topic Models for Hypertext , 2008, UAI.

[12]  David M. Blei,et al.  Relational Topic Models for Document Networks , 2009, AISTATS.

[13]  Padhraic Smyth,et al.  Statistical entity-topic models , 2006, KDD '06.

[14]  Lise Getoor,et al.  A Latent Dirichlet Model for Unsupervised Entity Resolution , 2005, SDM.

[15]  Takenao Ohkawa,et al.  Entity Network Prediction Using Multitype Topic Models , 2008, IEICE Trans. Inf. Syst..

[16]  ChengXiang Zhai,et al.  Probabilistic Models for Expert Finding , 2007, ECIR.

[17]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[18]  M. de Rijke,et al.  A language modeling framework for expert finding , 2009, Inf. Process. Manag..