Multi-Topic Tracking Model for dynamic social network

The topic tracking problem has attracted much attention in the last decades. However, existing approaches rarely consider network structures and textual topics together. In this paper, we propose a novel statistical model based on dynamic bayesian network, namely Multi-Topic Tracking Model for Dynamic Social Network (MTTD). It takes influence phenomenon, selection phenomenon, document generative process and the evolution of textual topics into account. Specifically, in our MTTD model, Gibbs Random Field is defined to model the influence of historical status of users in the network and the interdependency between them in order to consider the influence phenomenon. To address the selection phenomenon, a stochastic block model is used to model the link generation process based on the users’ interests to topics. Probabilistic Latent Semantic Analysis (PLSA) is used to describe the document generative process according to the users’ interests. Finally, the dependence on the historical topic status is also considered to ensure the continuity of the topic itself in topic evolution model. Expectation Maximization (EM) algorithm is utilized to estimate parameters in the proposed MTTD model. Empirical experiments on real datasets show that the MTTD model performs better than Popular Event Tracking (PET) and Dynamic Topic Model (DTM) in generalization performance, topic interpretability performance, topic content evolution and topic popularity evolution performance.

[1]  Chong Wang,et al.  Continuous Time Dynamic Topic Models , 2008, UAI.

[2]  Yizhou Sun,et al.  iTopicModel: Information Network-Integrated Topic Modeling , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[3]  E. Xing,et al.  Mixed Membership Stochastic Block Models for Relational Data with Application to Protein-Protein Interactions , 2006 .

[4]  Ruixuan Li,et al.  RankTopic: Ranking Based Topic Modeling , 2012, 2012 IEEE 12th International Conference on Data Mining.

[5]  Yun Chi,et al.  Analyzing communities and their evolutions in dynamic social networks , 2009, TKDD.

[6]  David Buttler,et al.  Tracking multiple topics for finding interesting articles , 2007, KDD '07.

[7]  Dan Roth,et al.  Citation Author Topic Model in Expert Search , 2010, COLING.

[8]  Stan Z. Li,et al.  Markov Random Field Modeling in Image Analysis , 2001, Computer Science Workbench.

[9]  Gregor Heinrich Parameter estimation for text analysis , 2009 .

[10]  James Allan,et al.  Topic detection and tracking: event-based information organization , 2002 .

[11]  Alexander J. Smola,et al.  Discovering geographical topics in the twitter stream , 2012, WWW.

[12]  Ravi Kumar,et al.  Influence and correlation in social networks , 2008, KDD.

[13]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[14]  Volker Tresp,et al.  Soft Clustering on Graphs , 2005, NIPS.

[15]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Dan Roth,et al.  Experts’ Retrieval with Multiword-Enhanced Author Topic Model , 2010, HLT-NAACL 2010.

[17]  Zhigang Guo,et al.  An Effective Algorithm of News Topic Tracking , 2009, 2009 WRI Global Congress on Intelligent Systems.

[18]  Tinghuai Ma,et al.  Social Network and Tag Sources Based Augmenting Collaborative Recommender System , 2015, IEICE Trans. Inf. Syst..

[19]  Bin Gu,et al.  Incremental Support Vector Learning for Ordinal Regression , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[20]  Andrew McCallum,et al.  Topics over time: a non-Markov continuous-time model of topical trends , 2006, KDD '06.

[21]  Bin Gu,et al.  Incremental learning for ν-Support Vector Regression , 2015, Neural Networks.

[22]  Michael C. Horsch,et al.  Dynamic Bayesian networks , 1990 .

[23]  Yiming Yang,et al.  Topic Detection and Tracking Pilot Study Final Report , 1998 .

[24]  David M. Blei,et al.  Relational Topic Models for Document Networks , 2009, AISTATS.

[25]  Ying Liu,et al.  Burst topic discovery and trend tracing based on Storm , 2014 .

[26]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Indexing , 1999, SIGIR Forum.

[27]  John D. Lafferty,et al.  Correlated Topic Models , 2005, NIPS.

[28]  Deng Cai,et al.  Topic modeling with network regularization , 2008, WWW.

[29]  Byron Hall Bayesian Inference , 2011 .

[30]  Chris Chatfield,et al.  The Analysis of Time Series , 1990 .

[31]  David Buttler,et al.  Latent topic feedback for information retrieval , 2011, KDD.

[32]  Qin Lu,et al.  Topic tracking with time granularity reasoning , 2006, TALIP.

[33]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[34]  Bo Zhao,et al.  PET: a statistical model for popular events tracking in social communities , 2010, KDD.

[35]  Frank Dellaert,et al.  The Expectation Maximization Algorithm , 2002 .

[36]  Daniel Jurafsky,et al.  Studying the History of Ideas Using Topic Models , 2008, EMNLP.