Mining topic-level influence in heterogeneous networks

Influence is a complex and subtle force that governs the dynamics of social networks as well as the behaviors of involved users. Understanding influence can benefit various applications such as viral marketing, recommendation, and information retrieval. However, most existing works on social influence analysis have focused on verifying the existence of social influence. Few works systematically investigate how to mine the strength of direct and indirect influence between nodes in heterogeneous networks. To address the problem, we propose a generative graphical model which utilizes the heterogeneous link information and the textual content associated with each node in the network to mine topic-level direct influence. Based on the learned direct influence, a topic-level influence propagation and aggregation algorithm is proposed to derive the indirect influence between nodes. We further study how the discovered topic-level influence can help the prediction of user behaviors. We validate the approach on three different genres of data sets: Twitter, Digg, and citation networks. Qualitatively, our approach can discover interesting influence patterns in heterogeneous networks. Quantitatively, the learned topic-level influence can greatly improve the accuracy of user behavior prediction.

[1]  Steffen Bickel,et al.  Unsupervised prediction of citation influences , 2007, ICML '07.

[2]  Jon M. Kleinberg,et al.  The structure of information pathways in a social communication network , 2008, KDD.

[3]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[4]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[5]  Jure Leskovec,et al.  Inferring networks of diffusion and influence , 2010, KDD.

[6]  Huan Liu,et al.  Relational learning via latent social dimensions , 2009, KDD.

[7]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[8]  D. Krackhardt The strength of strong ties: The importance of Philos in organizations , 2003 .

[9]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[10]  Jing Li,et al.  Heterogeneous data fusion for alzheimer's disease study , 2008, KDD.

[11]  Matthew Richardson,et al.  Yes, there is a correlation: - from social networks to personal behavior on the web , 2008, WWW.

[12]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[13]  N. Christakis,et al.  Dynamic spread of happiness in a large social network: longitudinal analysis over 20 years in the Framingham Heart Study , 2008, BMJ : British Medical Journal.

[14]  David M. Blei,et al.  Connections between the lines: augmenting social networks with text , 2009, KDD.

[15]  Pang-Ning Tan,et al.  Measuring the effects of preprocessing decisions and network forces in dynamic network analysis , 2009, KDD.

[16]  Ramesh Nallapati,et al.  Joint latent topic models for text and citations , 2008, KDD.

[17]  Foster Provost,et al.  A Simple Relational Classifier , 2003 .

[18]  Jennifer Neville,et al.  Randomization tests for distinguishing social influence and homophily effects , 2010, WWW '10.

[19]  John Whitfield,et al.  The secret of happiness: grinning on the Internet , 2008 .

[20]  Yun Chi,et al.  Combining link and content for community detection: a discriminative approach , 2009, KDD.

[21]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[22]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[23]  Marco Pellegrini,et al.  Extraction and classification of dense communities in the web , 2007, WWW '07.

[24]  Yizhou Sun,et al.  RankClus: integrating clustering with ranking for heterogeneous information network analysis , 2009, EDBT '09.

[25]  Jon M. Kleinberg,et al.  Feedback effects between similarity and social influence in online communities , 2008, KDD.

[26]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[27]  Ravi Kumar,et al.  Influence and correlation in social networks , 2008, KDD.

[28]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[29]  Jean King A review of bibliometric and other science indicators and their role in research evaluation , 1987, J. Inf. Sci..

[30]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[31]  Jennifer Neville,et al.  Modeling relationship strength in online social networks , 2010, WWW '10.

[32]  Lise Getoor,et al.  Co-evolution of social and affiliation networks , 2009, KDD.

[33]  Yizhou Sun,et al.  Ranking-based clustering of heterogeneous information networks with star network schema , 2009, KDD.