Who Influenced You? Predicting Retweet via Social Influence Locality

Social influence occurs when one’s opinions, emotions, or behaviors are affected by others in a social network. However, social influence takes many forms, and its underlying mechanism is still unclear. For example, how is one’s behavior influenced by a group of friends who know each other and by the friends from different ego friend circles? In this article, we study the social influence problem in a large microblogging network. Particularly, we consider users’ (re)tweet behaviors and focus on investigating how friends in one’s ego network influence retweet behaviors. We propose a novel notion of social influence locality and develop two instantiation functions based on pairwise influence and structural diversity. The defined influence locality functions have strong predictive power. Without any additional features, we can obtain an F1-score of 71.65% for predicting users’ retweet behaviors by training a logistic regression classifier based on the defined influence locality functions. We incorporate social influence locality into a factor graph model, which can further leverage the network-based correlation. Our experiments on the large microblogging network show that the model significantly improves the precision of retweet prediction. Our analysis also reveals several intriguing discoveries. For example, if you have six friends retweeting a microblog, the average likelihood that you will also retweet it strongly depends on the structure among the six friends: The likelihood will significantly drop (only ⅙) when the six friends do not know each other, compared with the case when the six friends know each other.

[1]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[2]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[3]  Juan-Zi Li,et al.  Understanding retweeting behaviors in social networks , 2010, CIKM.

[4]  M. Jackson,et al.  Peer Pressure , 2008 .

[5]  Laks V. S. Lakshmanan,et al.  SIMPATH: An Efficient Algorithm for Influence Maximization under the Linear Threshold Model , 2011, 2011 IEEE 11th International Conference on Data Mining.

[6]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[7]  Conor Hayes,et al.  Cross-Community Influence in Discussion Fora , 2012, ICWSM.

[8]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[9]  Jie Tang,et al.  Learning to Infer Social Ties in Large Networks , 2011, ECML/PKDD.

[10]  Jiawei Han,et al.  Learning influence from heterogeneous social networks , 2012, Data Mining and Knowledge Discovery.

[11]  Gregor Heinrich Parameter estimation for text analysis , 2009 .

[12]  Jie Tang,et al.  Modeling Indirect Influence on Twitter , 2012, Int. J. Semantic Web Inf. Syst..

[13]  Jie Tang,et al.  Mining structural hole spanners through information diffusion in social networks , 2013, WWW.

[14]  Jiawei Han,et al.  ACM Transactions on Knowledge Discovery from Data: Introduction , 2007 .

[15]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[16]  Bram P. Buunk The Blackwell encyclopedia of social psychology. , 1995 .

[17]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[18]  Jure Leskovec,et al.  Patterns of Influence in a Recommendation Network , 2006, PAKDD.

[19]  Philip S. Yu,et al.  Extracting social events for learning better information diffusion models , 2013, KDD.

[20]  Steffen Bickel,et al.  Unsupervised prediction of citation influences , 2007, ICML '07.

[21]  Jimeng Sun,et al.  Neighborhood formation and anomaly detection in bipartite graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[22]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[23]  Yong Yu,et al.  Collaborative personalized tweet recommendation , 2012, SIGIR '12.

[24]  Matthew Michelson,et al.  Tweet Disambiguate Entities Retrieve Folksonomy SubTree Step 1 : Discover Categories Generate Topic Profile from SubTrees Step 2 : Discover Profile Topic Profile : “ English Football ” “ World Cup ” , 2011 .

[25]  Cameron Marlow,et al.  A 61-million-person experiment in social influence and political mobilization , 2012, Nature.

[26]  Masahiro Kimura,et al.  Prediction of Information Diffusion Probabilities for Independent Cascade Model , 2008, KES.

[27]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[28]  Juan-Zi Li,et al.  Role-Aware Conformity Modeling and Analysis in Social Networks , 2014, AAAI.

[29]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[30]  Ravi Kumar,et al.  Influence and correlation in social networks , 2008, KDD.

[31]  Yifei Yuan,et al.  Influence Maximization in Social Networks When Negative Opinions May Emerge and Propagate , 2011, SDM.

[32]  ZhangJing,et al.  Who Influenced You? Predicting Retweet via Social Influence Locality , 2015 .

[33]  Wei Chen,et al.  Influence diffusion dynamics and influence maximization in social networks with friend and foe relationships , 2011, WSDM.

[34]  Piotr Faliszewski,et al.  Parameterized Algorithmics for Computational Social Choice: Nine Research Challenges , 2014, ArXiv.

[35]  J. M. Hammersley,et al.  Markov fields on finite graphs and lattices , 1971 .

[36]  Jie Tang,et al.  Role-aware conformity influence modeling and analysis in social networks , 2014, AAAI 2014.

[37]  Jimeng Sun,et al.  Confluence: conformity influence in large social networks , 2013, KDD.

[38]  Jianyong Wang,et al.  Retweet or not?: personalized tweet re-ranking , 2013, WSDM.

[39]  Masahiro Kimura,et al.  Learning information diffusion model in a social network for predicting influence of nodes , 2011, Intell. Data Anal..

[40]  Jimeng Sun,et al.  Patent partner recommendation in enterprise social networks , 2013, WSDM.

[41]  Yizhou Sun,et al.  RAIN: Social Role-Aware Information Diffusion , 2015, AAAI.

[42]  Rong Yan,et al.  Social influence in social advertising: evidence from field experiments , 2012, EC '12.

[43]  R. Robertson Globalization: Social Theory and Global Culture , 1992 .

[44]  Jennifer Neville,et al.  Randomization tests for distinguishing social influence and homophily effects , 2010, WWW '10.

[45]  Thomas Gottron,et al.  Bad news travel fast: a content-based analysis of interestingness on Twitter , 2011, WebSci '11.

[46]  Long Jiang,et al.  User-level sentiment analysis incorporating social networks , 2011, KDD.

[47]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[48]  Duncan J. Watts,et al.  Who says what to whom on twitter , 2011, WWW.

[49]  Harry Shum,et al.  An Empirical Study on Learning to Rank of Tweets , 2010, COLING.

[50]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[51]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[52]  Alexander J. Smola,et al.  Like like alike: joint friendship and interest propagation in social networks , 2011, WWW.

[53]  Lars Backstrom,et al.  Structural diversity in social contagion , 2012, Proceedings of the National Academy of Sciences.

[54]  Fang Wu,et al.  Social Networks that Matter: Twitter Under the Microscope , 2008, First Monday.

[55]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[56]  Miles Osborne,et al.  RT to Win! Predicting Message Propagation in Twitter , 2011, ICWSM.

[57]  Ning Zhang,et al.  Time-Critical Influence Maximization in Social Networks with Time-Delayed Diffusion Process , 2012, AAAI.

[58]  Jure Leskovec,et al.  Information diffusion and external influence in networks , 2012, KDD.

[59]  Nicola Barbieri,et al.  Topic-aware social influence propagation models , 2012, Knowledge and Information Systems.

[60]  Jürgen Kurths,et al.  Evidence for a bimodal distribution in human communication , 2010, Proceedings of the National Academy of Sciences.

[61]  L. Asz Random Walks on Graphs: a Survey , 2022 .

[62]  Yang Liu,et al.  BPGM: A big graph mining tool , 2014 .

[63]  Arun Sundararajan,et al.  Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks , 2009, Proceedings of the National Academy of Sciences.