Predicting Who Will Retweet or Not in Microblogs Network

Retweeting often leads to fast and wide information propagation in microblogs. There has been research concerning predicting users’ retweeting behavior. However, it lacks of in-depth investigations on key features that contribute to the prediction accuracy. In this paper, we systematically examined four types of features: followee (the user who posted tweets) features, follower features, tweet features and interaction features in terms of their contribution to retweeting prediction accuracy. We collected Sina Weibo data and ranked the features extracted from original data on importance by using an information gain method. There were twelve features showing significant association with the retweeting predictive accuracy through lots of experiments. Experimental results showed that using these twelve features, we could train a novel predictive model and achieved the predict accuracy up to 98 %.

[1]  Balachander Krishnamurthy,et al.  A few chirps about twitter , 2008, WOSN '08.

[2]  Yusuke Ota,et al.  Discovery of Interesting Users in Twitter by Overlapping Propagation Paths of Retweets , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[3]  Ralf Herbrich,et al.  Predicting Information Spreading in Twitter , 2010 .

[4]  Will Webberley,et al.  Retweeting: A study of message-forwarding in twitter , 2011, 2011 Workshop on Mobile and Online Social Networks.

[5]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[6]  Scott Counts,et al.  Predicting the Speed, Scale, and Range of Information Diffusion in Twitter , 2010, ICWSM.

[7]  Xueqi Cheng,et al.  Popularity prediction in microblogging network: a case study on sina weibo , 2013, WWW.

[8]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[9]  Shou-De Lin,et al.  Modeling and Visualizing Information Propagation in a Micro-blogging Platform , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[10]  Jeffrey Nichols,et al.  Who will retweet this?: Automatically Identifying and Engaging Strangers on Twitter to Spread Information , 2014, IUI.

[11]  Miles Osborne,et al.  RT to Win! Predicting Message Propagation in Twitter , 2011, ICWSM.

[12]  Rohini K. Srihari,et al.  Feature selection for text categorization on imbalanced data , 2004, SKDD.

[13]  Yang Li,et al.  Predicting the Popularity of Messages on Micro-blog Services , 2014, SMP.

[14]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.