Topic-aware social influence propagation models

The study of influence-driven propagations in social networks and its exploitation for viral marketing purposes has recently received a large deal of attention. However, regardless of the fact that users authoritativeness, expertise, trust and influence are evidently topic-dependent, the research on social influence has surprisingly largely overlooked this aspect. In this article, we study social influence from a topic modeling perspective. We introduce novel topic-aware influence-driven propagation models that, as we show in our experiments, are more accurate in describing real-world cascades than the standard (i.e., topic-blind) propagation models studied in the literature. In particular, we first propose simple topic-aware extensions of the well-known Independent Cascade and Linear Threshold models. However, these propagation models have a very large number of parameters which could lead to overfitting. Therefore, we propose a different approach explicitly modeling authoritativeness, influence and relevance under a topic-aware perspective. Instead of considering user-to-user influence, the proposed model focuses on user authoritativeness and interests in a topic, leading to a drastic reduction in the number of parameters of the model. We devise methods to learn the parameters of the models from a data set of past propagations. Our experimentation confirms the high accuracy of the proposed models and learning schemes.

[1]  David M. Blei,et al.  Introduction to Probabilistic Topic Models , 2010 .

[2]  Masahiro Kimura,et al.  Tractable Models for Information Diffusion in Social Networks , 2006, PKDD.

[3]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[4]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[5]  Francesco Bonchi,et al.  Influence Propagation in Social Networks: A Data Mining Perspective , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[6]  Yun Chi,et al.  Information flow modeling based on diffusion rate for prediction and ranking , 2007, WWW '07.

[7]  Masahiro Kimura,et al.  Efficient discovery of influential nodes for SIS models in social networks , 2011, Knowledge and Information Systems.

[8]  Aristides Gionis,et al.  Sparsification of influence networks , 2011, KDD.

[9]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[10]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[11]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[12]  Suh-Yin Lee,et al.  Efficient algorithms for influence maximization in social networks , 2012, Knowledge and Information Systems.

[13]  Jiawei Han,et al.  Mining topic-level influence in heterogeneous networks , 2010, CIKM.

[14]  Andrew McCallum,et al.  Using Maximum Entropy for Text Classification , 1999 .

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[17]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[18]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[19]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[20]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[21]  Jiawei Han,et al.  The Joint Inference of Topic Diffusion and Evolution in Social Communities , 2011, 2011 IEEE 11th International Conference on Data Mining.

[22]  Laks V. S. Lakshmanan,et al.  A Data-Based Approach to Social Influence Maximization , 2011, Proc. VLDB Endow..

[23]  BonchiFrancesco,et al.  A data-based approach to social influence maximization , 2011, VLDB 2011.

[24]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[25]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[26]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[27]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[28]  Nicola Barbieri,et al.  Topic-Aware Social Influence Propagation Models , 2012, ICDM.

[29]  Jennifer Neville,et al.  Modeling relationship strength in online social networks , 2010, WWW '10.

[30]  Masahiro Kimura,et al.  Prediction of Information Diffusion Probabilities for Independent Cascade Model , 2008, KES.

[31]  Francesco Bonchi,et al.  The Meme Ranking Problem: Maximizing Microblogging Virality , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[32]  David Blei,et al.  Probabilistic topic models , 2011, KDD '11 Tutorials.