Unlocking Author Power: On the Exploitation of Auxiliary Author-Retweeter Relations for Predicting Key Retweeters

Retweeting is a powerful driving force in information propagation on microblogging sites. However, identifying the most effective retweeters of a message (called the ”key retweeter prediction” problem) has become a significant research topic. Conventional approaches have addressed this topic from two main aspects: by analyzing either the personal attributes of microblogging users or the structures of user graph networks. However, according to sociological findings, author-retweeter dependencies also play a crucial role in influencing message propagation. In this paper, we propose a novel model to solve the key retweeter prediction problem by incorporating the auxiliary relations between a tweet author and potential retweeters. Without loss of generality, we formulate the relations from four relational factors: status relation, temporal relation, locational relation, and interactive relation. In addition, we propose a novel method, called “Relation-based Learning to Rank (RL2R),” to determine the key retweeters for a given tweet by ranking the potential retweeters in terms of their spreadability. The experimental results show that our method outperforms the state-of-the-art algorithms at top-k retweeter prediction, achieving a significant relative average improvement of 19.7–29.4 percent. These findings provide new insights for understanding user behaviors on social media for key retweeter prediction purposes.

[1]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[2]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[3]  Yi-Cheng Zhang,et al.  Leaders in Social Networks, the Delicious Case , 2011, PloS one.

[4]  Sinan Aral,et al.  Identifying Influential and Susceptible Members of Social Networks , 2012, Science.

[5]  Yongdong Zhang,et al.  Unfolding Temporal Dynamics: Predicting Social Media Popularity Using Multi-scale Temporal Decomposition , 2016, AAAI.

[6]  Duanbing Chen,et al.  Identifying Influential Spreaders by Weighted LeaderRank , 2013, ArXiv.

[7]  Tie-Yan Liu,et al.  Learning to rank: from pairwise approach to listwise approach , 2007, ICML '07.

[8]  Jiaul H. Paik A novel TF-IDF weighting scheme for effective ranking , 2013, SIGIR.

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  Krishna P. Gummadi,et al.  Geographic Dissection of the Twitter Network , 2012, ICWSM.

[11]  Yi Chang,et al.  Yahoo! Learning to Rank Challenge Overview , 2010, Yahoo! Learning to Rank Challenge.

[12]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[13]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[14]  Sangwook Kim,et al.  Identifying and ranking influential spreaders in complex networks by neighborhood coreness , 2014 .

[15]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[16]  Damon Centola,et al.  The Spread of Behavior in an Online Social Network Experiment , 2010, Science.

[17]  Duncan J. Watts,et al.  Who says what to whom on twitter , 2011, WWW.

[18]  Lev Muchnik,et al.  Identifying influential spreaders in complex networks , 2010, 1001.5285.

[19]  Io Taxidou,et al.  Online analysis of information diffusion in twitter , 2014, WWW.

[20]  Brian D. Davison,et al.  Predicting popular messages in Twitter , 2011, WWW.

[21]  Yamir Moreno,et al.  Absence of influential spreaders in rumor dynamics , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[22]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[23]  D. Murthy Towards a Sociological Understanding of Social Media: Theorizing Twitter , 2012 .

[24]  Yang Liu,et al.  Who Influenced You? Predicting Retweet via Social Influence Locality , 2015, ACM Trans. Knowl. Discov. Data.

[25]  George W. Furnas,et al.  Pictures of relevance: A geometric analysis of similarity measures , 1987, J. Am. Soc. Inf. Sci..

[26]  Jong-Ryul Lee,et al.  A Query Approach for Influence Maximization on Specific Users in Social Networks , 2015, IEEE Transactions on Knowledge and Data Engineering.

[27]  Xuanjing Huang,et al.  Retweet Prediction with Attention-based Deep Neural Network , 2016, CIKM.

[28]  Enhong Chen,et al.  Maximizing the Coverage of Information Propagation in Social Networks , 2015, IJCAI.

[29]  Wei Chen,et al.  Scalable influence maximization for prevalent viral marketing in large-scale social networks , 2010, KDD.

[30]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[31]  Miles Osborne,et al.  RT to Win! Predicting Message Propagation in Twitter , 2011, ICWSM.

[32]  Sheng Tang,et al.  Sparse Ensemble Learning for Concept Detection , 2012, IEEE Transactions on Multimedia.

[33]  Cécile Favre,et al.  Information diffusion in online social networks: a survey , 2013, SGMD.

[34]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[35]  Hang Li,et al.  AdaRank: a boosting algorithm for information retrieval , 2007, SIGIR.

[36]  Yongdong Zhang,et al.  Time Matters: Multi-scale Temporalization of Social Media Popularity , 2016, ACM Multimedia.

[37]  Mohammed J. Zaki,et al.  ProfileRank: finding relevant content and influential users based on information diffusion , 2013, SNAKDD '13.