Who is Retweeting the Tweeters? Modeling, Originating, and Promoting Behaviors in the Twitter Network

Real-time microblogging systems such as Twitter offer users an easy and lightweight means to exchange information. Instead of writing formal and lengthy messages, microbloggers prefer to frequently broadcast several short messages to be read by other users. Only when messages are interesting, are they propagated further by the readers. In this article, we examine user behavior relevant to information propagation through microblogging. We specifically use retweeting activities among Twitter users to define and model originating and promoting behavior. We propose a basic model for measuring the two behaviors, a mutual dependency model, which considers the mutual relationships between the two behaviors, and a range-based model, which considers the depth and reach of users’ original tweets. Next, we compare the three behavior models and contrast them with the existing work on modeling influential Twitter users. Last, to demonstrate their applicability, we further employ the behavior models to detect interesting events from sudden changes in aggregated information propagation behavior of Twitter users. The results will show that the proposed behavior models can be effectively applied to detect interesting events in the Twitter stream, compared to the baseline tweet-based approaches.

[1]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[2]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[3]  W. Grove Statistical Methods for Rates and Proportions, 2nd ed , 1981 .

[4]  Dimitrios Gunopulos,et al.  Searching for events in the blogosphere , 2009, WWW '09.

[5]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[6]  Ana-Maria Popescu,et al.  Detecting controversial events from twitter , 2010, CIKM.

[7]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[8]  Jure Leskovec,et al.  Modeling Information Diffusion in Implicit Networks , 2010, 2010 IEEE International Conference on Data Mining.

[9]  Munmun De Choudhury Discovery of information disseminators and receptors on online social media , 2010, HT '10.

[10]  Kristina Lerman,et al.  Predicting Influential Users in Online Social Networks , 2010, ArXiv.

[11]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[12]  Philip S. Yu,et al.  Identifying the influential bloggers in a community , 2008, WSDM '08.

[13]  Scott Counts,et al.  Predicting the Speed, Scale, and Range of Information Diffusion in Twitter , 2010, ICWSM.

[14]  Prem Melville,et al.  Supervised Rank Aggregation for Predicting Influence in Networks , 2011, ArXiv.

[15]  Daniel M. Romero,et al.  Influence and passivity in social media , 2010, ECML/PKDD.

[16]  Prasenjit Mitra,et al.  Temporal and Information Flow Based Event Detection from Social Text Streams , 2007, AAAI.

[17]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[18]  Jon Kleinberg,et al.  Maximizing the spread of influence through a social network , 2003, KDD '03.

[19]  Richard Sproat,et al.  Mining correlated bursty topic patterns from coordinated text streams , 2007, KDD '07.

[20]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[21]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[22]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[23]  Ee-Peng Lim,et al.  Mining Interaction Behaviors for Email Reply Order Prediction , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[24]  Katherine L. Milkman,et al.  Social Transmission, Emotion, and the Virality of Online Content , 2010 .

[25]  Anastasios Kementsietsidis,et al.  Proceedings of the ACM SIGMOD International Conference on Management of Data, SIGMOD 2013, New York, NY, USA, June 22-27, 2013 , 2013, SIGMOD Conference.

[26]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[27]  Nick Koudas,et al.  TwitterMonitor: trend detection over the twitter stream , 2010, SIGMOD Conference.

[28]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[29]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, SKDD.

[30]  Eric Brill,et al.  Improving web search ranking by incorporating user behavior information , 2006, SIGIR.

[31]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[32]  Ramanathan V. Guha,et al.  Propagation of trust and distrust , 2004, WWW '04.

[33]  Duncan J. Watts,et al.  Who says what to whom on twitter , 2011, WWW.

[34]  Yoichi Shinoda,et al.  Information filtering based on user behavior analysis and best match text retrieval , 1994, SIGIR '94.

[35]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[36]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[37]  Lin Qiu,et al.  Understanding the psychological motives behind microblogging. , 2010, Studies in health technology and informatics.

[38]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[39]  Bo Zhao,et al.  PET: a statistical model for popular events tracking in social communities , 2010, KDD.

[40]  Fabrizio Silvestri,et al.  Know your neighbors: web spam detection using the web topology , 2007, SIGIR.

[41]  Bernardo A. Huberman,et al.  Trends in Social Media: Persistence and Decay , 2011, ICWSM.

[42]  LimEe-Peng,et al.  Who is Retweeting the Tweeters? Modeling, Originating, and Promoting Behaviors in the Twitter Network , 2012 .

[43]  Hila Becker,et al.  Learning similarity metrics for event identification in social media , 2010, WSDM '10.

[44]  Hector Garcia-Molina,et al.  Combating Web Spam with TrustRank , 2004, VLDB.

[45]  Marc Najork,et al.  Detecting spam web pages through content analysis , 2006, WWW '06.

[46]  Mor Naaman,et al.  Is it really about me?: message content in social awareness streams , 2010, CSCW '10.

[47]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.