Predicting Twitter Hashtags Popularity Level

This paper investigates the problem of predicting Twitter hashtags popularity level. A data set of more than 18 million tweets containing 748 thousand hashtags has been prepared by using Twitter's Streaming API. Early adoption properties including profile of tweet authors and adoption time series are used to predict a tag's later popularity level. The followers count and tweets count are two such characteristics related to adopters' profile. On the other hand, two types of frequency domain analyses are used to augment the simple mean and standard deviation characteristics of the adoption time series. Fourier transform (FT) spectrum and wavelet transform (WT) spectrum are considered in this study. Experimental results show that WT spectrum improves the prediction result of viral hashtags while FT spectrum does not.

[1]  Ari Rappoport,et al.  What's in a hashtag?: content based prediction of the spread of ideas in microblogging communities , 2012, WSDM '12.

[2]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[3]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[4]  Huan Liu,et al.  Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose , 2013, ICWSM.

[5]  Don H. Johnson,et al.  Gauss and the history of the fast Fourier transform , 1985 .

[6]  Scott Counts,et al.  Identifying topical authorities in microblogs , 2011, WSDM '11.

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[9]  Rizal Setya Perdana What is Twitter , 2013 .

[10]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[11]  Gao Cong,et al.  On predicting the popularity of newly emerging hashtags in Twitter , 2013, J. Assoc. Inf. Sci. Technol..

[12]  Karen Rose,et al.  What is Twitter , 2009 .

[13]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[14]  C. Torrence,et al.  A Practical Guide to Wavelet Analysis. , 1998 .

[15]  Filippo Menczer,et al.  Predicting Successful Memes Using Network and Community Structure , 2014, ICWSM.

[16]  Filippo Menczer,et al.  Virality Prediction and Community Structure in Social Networks , 2013, Scientific Reports.