Can cascades be predicted?

On many social networking web sites such as Facebook and Twitter, resharing or reposting functionality allows users to share others' content with their own friends or followers. As content is reshared from user to user, large cascades of reshares can form. While a growing body of research has focused on analyzing and characterizing such cascades, a recent, parallel line of work has argued that the future trajectory of a cascade may be inherently unpredictable. In this work, we develop a framework for addressing cascade prediction problems. On a large sample of photo reshare cascades on Facebook, we find strong performance in predicting whether a cascade will continue to grow in the future. We find that the relative growth of a cascade becomes more predictable as we observe more of its reshares, that temporal and structural features are key predictors of cascade size, and that initially, breadth, rather than depth in a cascade is a better indicator of larger cascades. This prediction performance is robust in the sense that multiple distinct classes of features all achieve similar performance. We also discover that temporal features are predictive of a cascade's eventual shape. Observing independent cascades of the same content, we find that while these cascades differ greatly in size, we are still able to predict which ends up the largest.

[1]  Gleb Gusev,et al.  Prediction of retweet cascade size over time , 2012, CIKM.

[2]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[3]  Jon M. Kleinberg,et al.  Characterizing and curating conversation threads: expansion, focus, volume, re-entry , 2013, WSDM.

[4]  Ee-Peng Lim,et al.  Virality and Susceptibility in Information Diffusions , 2012, ICWSM.

[5]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[6]  Matthew J. Salganik,et al.  Experimental Study of Inequality and Unpredictability in an Artificial Cultural Market , 2006, Science.

[7]  Christos Faloutsos,et al.  Cascading Behavior in Large Blog Graphs , 2007 .

[8]  Lada A. Adamic,et al.  The Anatomy of Large Facebook Cascades , 2013, ICWSM.

[9]  Marco Guerini,et al.  Exploring Image Virality in Google Plus , 2013, 2013 International Conference on Social Computing.

[10]  Gao Cong,et al.  On predicting the popularity of newly emerging hashtags in Twitter , 2013, J. Assoc. Inf. Sci. Technol..

[11]  Ari Rappoport,et al.  What's in a hashtag?: content based prediction of the spread of ideas in microblogging communities , 2012, WSDM '12.

[12]  Ravi Kumar,et al.  Dynamics of conversations , 2010, KDD.

[13]  Bernardo A. Huberman,et al.  Predicting the popularity of online content , 2008, Commun. ACM.

[14]  Jure Leskovec,et al.  Modeling Information Diffusion in Implicit Networks , 2010, 2010 IEEE International Conference on Data Mining.

[15]  Lada A. Adamic,et al.  Social influence and the diffusion of user-created content , 2009, EC '09.

[16]  Chenhao Tan,et al.  On the Interplay between Social and Topical Structure , 2011, ICWSM.

[17]  Matthew O Jackson,et al.  Using selection bias to explain the observed structure of Internet diffusions , 2010, Proceedings of the National Academy of Sciences.

[18]  Duncan J. Watts,et al.  The Structural Virality of Online Diffusion , 2015, Manag. Sci..

[19]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[20]  Duncan J. Watts,et al.  Everything is obvious : how common sense fails , 2011 .

[21]  Daniel G. Goldstein,et al.  The structure of online diffusion networks , 2012, EC '12.

[22]  Scott Counts,et al.  Predicting the Speed, Scale, and Range of Information Diffusion in Twitter , 2010, ICWSM.

[23]  Sandra González Bailón Reseña de "Everything is Obvious. Once You Know the Answer. How Common Sense Fails Us" de Duncan J. Watts , 2012 .

[24]  Katherine L. Milkman,et al.  What Makes Online Content Viral? , 2012 .

[25]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[26]  Wolfgang Kellerer,et al.  Outtweeting the Twitterers - Predicting Information Cascades in Microblogs , 2010, WOSN.

[27]  References , 1971 .

[28]  Eytan Adar,et al.  Implicit Structure and the Dynamics of Blogspace , 2004 .

[29]  Brian D. Davison,et al.  Predicting popular messages in Twitter , 2011, WWW.

[30]  Miles Osborne,et al.  RT to Win! Predicting Message Propagation in Twitter , 2011, ICWSM.

[31]  Jure Leskovec,et al.  Information diffusion and external influence in networks , 2012, KDD.

[32]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[33]  Jon M. Kleinberg,et al.  Tracing information flow on a global scale using Internet chain-letter data , 2008, Proceedings of the National Academy of Sciences.

[34]  Felix Naumann,et al.  Analyzing and predicting viral tweets , 2013, WWW.

[35]  Filippo Menczer,et al.  Virality Prediction and Community Structure in Social Networks , 2013, Scientific Reports.

[36]  Hanna Zijlstra,et al.  Validiteit van de Nederlandse versie van de Linguistic Inquiry and Word Count (liwc) , 2005 .