论文信息 - User Influence and Follower Metrics in a Large Twitter Dataset

User Influence and Follower Metrics in a Large Twitter Dataset

Social media has become an important means to convey information. The microblogging service Twitter with about 284 million users and currently over 500 million tweets per day is an example. The site stores all the tweets once sent so that they can be retrieved later. The site has rather simple site ontology, i.e. the concepts it implements; the users are represented by a profile. They can follow other users, and a received tweet can be retweeted to all the followers of a user. In this paper we investigate diffusion of messages and influence of users on other users, mainly based on the retweet cascade size and attenuation patterns inside the cascade. We rely on a big data set collected after Boston marathon bombing on April 15, 2013. It contains about 8 million tweets and retweets sent by over 4 million different users. It was collected through the Twitter API that selects all the messages containing given keywords, including hashtags. We also collected all 7-8 billion followers of the above users during 2014. The follower relation is also used in influence estimations in some respects. The largest cascades originate from users with most followers and the cascade dies out after two or three frequency peaks.

[1] Stefan Stieglitz,et al. Towards more systematic Twitter analysis: metrics for tweeting activities , 2013 .

[2] Ed H. Chi,et al. Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[3] Jure Leskovec,et al. Modeling Information Diffusion in Implicit Networks , 2010, 2010 IEEE International Conference on Data Mining.

[4] Jure Leskovec,et al. Can cascades be predicted? , 2014, WWW.

[5] Krishna P. Gummadi,et al. Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[6] Qi He,et al. TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[7] Duncan J. Watts,et al. Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[8] Jimeng Sun,et al. A Survey of Models and Algorithms for Social Influence Analysis , 2011, Social Network Data Analytics.

[9] Wolfgang Kellerer,et al. Outtweeting the Twitterers - Predicting Information Cascades in Microblogs , 2010, WOSN.

[10] Daniel M. Romero,et al. Influence and passivity in social media , 2010, ECML/PKDD.

[11] Malik Magdon-Ismail,et al. Information Cascades in Social Media in Response to a Crisis : a Preliminary Model and a Case Study , 2012 .