Scalable Detection of Viral Memes from Diffusion Patterns

Social media and social networking platforms have flourished with the rapid development of mobile technology and the ubiquitous use of the Internet. As a result, memes, or pieces of information spreading from person to person, can be reshared among users quickly and gain huge popularity. As viral memes have tremendous social and economic impact, detecting these viral memes at their early stages of spread is a worthy, yet challenging problem. Here we review the literature on predicting viral memes, and present empirical results from Twitter and Tumblr datasets. We demonstrate how diffusion patterns of memes, in the context of network communities, play an important role in predicting virality. We show that it is feasible to obtain predictive features based on community structure even at the massive scales that common social media services need to process. Our results may not only enable practitioners to make predictions about meme diffusion, but also help researchers understand how and why different factors, in particular diffusion patterns in communities, affect online virality.

[1]  Gözde Özbal,et al.  Exploring Text Virality in Social Networks , 2011, ICWSM.

[2]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[3]  Daniel Halperin,et al.  Scalable Flow-Based Community Detection for Large-Scale Network Analysis , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[4]  D. Kendall,et al.  Epidemics and Rumours , 1964, Nature.

[5]  Lev Muchnik,et al.  Identifying influential spreaders in complex networks , 2010, 1001.5285.

[6]  WILLIAM GOFFMAN,et al.  Generalization of Epidemic Theory: An Application to the Transmission of Ideas , 1964, Nature.

[7]  Robert L. Goldstone,et al.  Propagation of innovations in networked groups. , 2008, Journal of experimental psychology. General.

[8]  Damon Centola An Experimental Study of Homophily in the Adoption of Health Behavior , 2011, Science.

[9]  Matthew J. Salganik,et al.  Experimental Study of Inequality and Unpredictability in an Artificial Cultural Market , 2006, Science.

[10]  Peter H. Reingen,et al.  Social Ties and Word-of-Mouth Referral Behavior , 1987 .

[11]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[12]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[13]  Lei Yang,et al.  We know what @you #tag: does the dual role affect hashtag adoption? , 2012, WWW.

[14]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[15]  Kristina Lerman,et al.  Information Contagion: An Empirical Study of the Spread of News on Digg and Twitter Social Networks , 2010, ICWSM.

[16]  Filippo Menczer,et al.  The Digital Evolution of Occupy Wall Street , 2013, PloS one.

[17]  Filippo Menczer,et al.  Predicting Successful Memes Using Network and Community Structure , 2014, ICWSM.

[18]  Dylan Walker,et al.  Creating Social Contagion Through Viral Product Design: A Randomized Trial of Peer Influence in Networks , 2010, ICIS.

[19]  Stanislav Nikolov Trend or no trend : a novel nonparametric method for classifying time series , 2012 .

[20]  Damon Centola,et al.  The Spread of Behavior in an Online Social Network Experiment , 2010, Science.

[21]  Katherine L. Milkman,et al.  What Makes Online Content Viral? , 2012 .

[22]  Duncan J Watts,et al.  A simple model of global cascades on random networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Jure Leskovec,et al.  Can cascades be predicted? , 2014, WWW.

[24]  Huzefa Rangwala,et al.  Digging Digg: Comment Mining, Popularity Prediction, and Social Network Analysis , 2009, 2009 International Conference on Web Information Systems and Mining.

[25]  Filippo Menczer,et al.  Topicality and Impact in Social Media: Diverse Messages, Focused Messengers , 2014, PloS one.

[26]  R. Lewontin ‘The Selfish Gene’ , 1977, Nature.

[27]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Alessandro Vespignani,et al.  Epidemic spreading in scale-free networks. , 2000, Physical review letters.

[29]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[30]  Filippo Menczer,et al.  Virality Prediction and Community Structure in Social Networks , 2013, Scientific Reports.

[31]  Jimeng Sun,et al.  Social influence analysis in large-scale networks , 2009, KDD.

[32]  Sean J. Taylor,et al.  Social Influence Bias: A Randomized Experiment , 2013, Science.

[33]  Damon Centola Damon Centola Behavior An Experimental Study of Homophily in the Adoption of Health , 2011 .

[34]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[35]  A. Vespignani,et al.  Competition among memes in a world with limited attention , 2012, Scientific Reports.

[36]  Mark S. Granovetter Threshold Models of Collective Behavior , 1978, American Journal of Sociology.

[37]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[38]  Ed H. Chi,et al.  Want to be Retweeted? Large Scale Analytics on Factors Impacting Retweet in Twitter Network , 2010, 2010 IEEE Second International Conference on Social Computing.

[39]  Thomas C. Schelling,et al.  Dynamic models of segregation , 1971 .

[40]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[41]  Lada A. Adamic,et al.  Social influence and the diffusion of user-created content , 2009, EC '09.

[42]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[43]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[44]  Alessandro Flammini,et al.  Optimal network clustering for information diffusion , 2014, Physical review letters.

[45]  Santo Fortunato,et al.  Consensus clustering in complex networks , 2012, Scientific Reports.

[46]  Daniel M. Romero,et al.  Influence and passivity in social media , 2010, ECML/PKDD.

[47]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[48]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[49]  M. Macy,et al.  Complex Contagions and the Weakness of Long Ties1 , 2007, American Journal of Sociology.

[50]  Bernardo A. Huberman,et al.  Trends in Social Media: Persistence and Decay , 2011, ICWSM.

[51]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.