The Author-Topic-Community Model: A Generative Model Relating Authors' Interests and Their Community Structure

In this paper, we introduce a generative model named Author-Topic-Community (ATC) model which can infer authors’ interests and their community structure at the same time based on the contents and citation information of a document corpus. Via the mutual promotion between the author topics and the author community structure introduced in the ATC model, the robustness of the model towards cases with spare citation information can be enhanced. Variational inference is adopted to estimate the model parameters of ATC. We performed evaluation using both synthetic data as well as a real dataset which contains SIGKDD and SIGMOD papers published in 10 years. By constrasting the performance of ATC with some state-of-the-art methods which model authors’ interests and their community structure separately, our experimental results show that 1) the ATC model with the inference of the authors’ interests and the community structure integrated can improve the accuracy of author topic modeling and that of author community discovery; and 2) more in-depth analysis of the authors’ influence can be readily supported.

[1]  Ramesh Nallapati,et al.  Joint latent topic models for text and citations , 2008, KDD.

[2]  Takenao Ohkawa,et al.  Entity Network Prediction Using Multitype Topic Models , 2008, IEICE Trans. Inf. Syst..

[3]  Lise Getoor,et al.  A Latent Dirichlet Model for Unsupervised Entity Resolution , 2005, SDM.

[4]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[5]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[6]  David M. Blei,et al.  Relational Topic Models for Document Networks , 2009, AISTATS.

[7]  Dan Roth,et al.  Citation Author Topic Model in Expert Search , 2010, COLING.

[8]  Dave Cliff,et al.  Human-Agent Auction Interactions: Adaptive-Aggressive Agents Dominate , 2011, IJCAI.

[9]  Tom Minka,et al.  Expectation Propagation for approximate Bayesian inference , 2001, UAI.

[10]  Cornelia Caragea,et al.  Context Sensitive Topic Models for Author Influence in Document Networks , 2011, IJCAI.

[11]  Thomas L. Griffiths,et al.  The Author-Topic Model for Authors and Documents , 2004, UAI.

[12]  Hongyuan Zha,et al.  Probabilistic models for discovering e-communities , 2006, WWW '06.

[13]  Yun Chi,et al.  Analyzing communities and their evolutions in dynamic social networks , 2009, TKDD.

[14]  Yan Liu,et al.  Topic-link LDA: joint models of topic and author community , 2009, ICML '09.

[15]  Deng Cai,et al.  Topic modeling with network regularization , 2008, WWW.