A framework for quantitative analysis of cascades on networks

How does information flow in online social networks? How does the structure and size of the information cascade evolve in time? How can we efficiently mine the information contained in cascade dynamics? We approach these questions empirically and present an efficient and scalable mathematical framework for quantitative analysis of cascades on networks. We define a cascade generating function that captures the details of the microscopic dynamics of the cascades. We show that this function can also be used to compute the macroscopic properties of cascades, such as their size, spread, diameter, number of paths, and average path length. We present an algorithm to efficiently compute cascade generating function and demonstrate that while significantly compressing information within a cascade, it nevertheless allows us to accurately reconstruct its structure. We use this framework to study information dynamics on the social network of Digg. Digg allows users to post and vote on stories, and easily see the stories that friends have voted on. As a story spreads on Digg through voting, it generates cascades. We extract cascades of more than 3,500 Digg stories and calculate their macroscopic and microscopic properties. We identify several trends in cascade dynamics: spreading via chaining, branching and community. We discuss how these affect the spread of the story through the Digg social network. Our computational framework is general and offers a practical solution to quantitative analysis of the microscopic structure of even very large cascades.

[1]  Lada A. Adamic,et al.  Information flow in social groups , 2003, cond-mat/0305305.

[2]  Ravi Kumar,et al.  Influence and correlation in social networks , 2008, KDD.

[3]  Jure Leskovec,et al.  Patterns of Influence in a Recommendation Network , 2006, PAKDD.

[4]  Kristina Lerman,et al.  Centrality metric for dynamic networks , 2010, MLG '10.

[5]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[6]  N. Ling The Mathematical Theory of Infectious Diseases and its applications , 1978 .

[7]  William J. Reed,et al.  The Double Pareto-Lognormal Distribution—A New Parametric Model for Size Distributions , 2004, WWW 2001.

[8]  Maksim Kitsak,et al.  Identifying influential spreaders in complex networks , 2010 .

[9]  Duncan J Watts,et al.  A simple model of global cascades on random networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Songqing Chen,et al.  Analyzing patterns of user content generation in online social networks , 2009, KDD.

[11]  Christos Faloutsos,et al.  Cascading Behavior in Large Blog Graphs , 2007 .

[12]  Jure Leskovec,et al.  Inferring networks of diffusion and influence , 2010, KDD.

[13]  Ramanathan V. Guha,et al.  Information diffusion through blogspace , 2004, WWW '04.

[14]  Didier Sornette,et al.  Viral, Quality, and Junk Videos on YouTube: Separating Content from Noise in an Information-Rich Environment , 2008, AAAI Spring Symposium: Social Information Processing.

[15]  Peter Sheridan Dodds,et al.  Universal behavior in a generalized model of contagion. , 2004, Physical review letters.

[16]  Lev Muchnik,et al.  Identifying influential spreaders in complex networks , 2010, 1001.5285.

[17]  Eyton,et al.  The Diffusion of Innovations in Social Networks , 2002 .

[18]  Kristina Lerman,et al.  Analysis of social voting patterns on digg , 2008, WOSN '08.

[19]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[20]  E. Rogers,et al.  Diffusion of innovations , 1964, Encyclopedia of Sport Management.

[21]  Munmun De Choudhury,et al.  "Birds of a Feather": Does User Homophily Impact Information Diffusion in Social Media? , 2010, ArXiv.

[22]  Lada A. Adamic,et al.  Social influence and the diffusion of user-created content , 2009, EC '09.

[23]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[24]  Kristina Lerman,et al.  Information Contagion: An Empirical Study of the Spread of News on Digg and Twitter Social Networks , 2010, ICWSM.

[25]  Alexander Grey,et al.  The Mathematical Theory of Infectious Diseases and Its Applications , 1977 .