Uncovering and predicting the dynamic process of information cascades with survival model

Cascades are ubiquitous in various network environments. Predicting these cascades is decidedly nontrivial in various important applications, such as viral marketing, epidemic prevention, and traffic management. Most previous works have focused on predicting the final cascade sizes. As cascades are dynamic processes, it is always interesting and important to predict the cascade size at any given time, or to predict the time when a cascade will reach a certain size (e.g., the threshold for an outbreak). In this paper, we unify all these tasks into a fundamental problem: cascading process prediction. That is, given the early stage of a cascade, can we predict its cumulative cascade size at any later time? For such a challenging problem, an understanding of the micromechanism that drives and generates the macrophenomena (i.e., the cascading process) is essential. Here, we introduce behavioral dynamics as the micromechanism to describe the dynamic process of an infected node’s neighbors getting infected by a cascade (i.e., one-hop sub-cascades). Through data-driven analysis, we find out the common principles and patterns lying in the behavioral dynamics and propose the novel NEtworked WEibull Regression model for modeling it. We also propose a novel method for predicting cascading processes by effectively aggregating behavioral dynamics and present a scalable solution to approximate the cascading process with a theoretical guarantee. We evaluate the proposed method extensively on a large-scale social network dataset. The results demonstrate that the proposed method can significantly outperform other state-of-the-art methods in multiple tasks including cascade size prediction, outbreak time prediction, and cascading process prediction.

[1]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[2]  Aristides Gionis,et al.  Opinion Maximization in Social Networks , 2013, SDM.

[3]  Bernhard Schölkopf,et al.  Structure and dynamics of information pathways in online media , 2012, WSDM.

[4]  David G. Kleinbaum,et al.  Parametric Survival Models , 2012 .

[5]  Bernhard Schölkopf,et al.  Modeling Information Propagation with Survival Theory , 2013, ICML.

[6]  Fei Wang,et al.  From Micro to Macro: Uncovering and Predicting Information Cascading Process with Behavioral Dynamics , 2015, 2015 IEEE International Conference on Data Mining.

[7]  Fei Wang,et al.  Cascading outbreak prediction in networks: a data-driven approach , 2013, KDD.

[8]  Le Song,et al.  Uncover Topic-Sensitive Information Diffusion Networks , 2013, AISTATS.

[9]  Bernhard Schölkopf,et al.  Influence Maximization in Continuous Time Diffusion Networks , 2012, ICML.

[10]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[11]  Wei Chen,et al.  Efficient influence maximization in social networks , 2009, KDD.

[12]  James G. Wiener,et al.  The Weibull Distribution: A New Method of Summarizing Survivorship Data , 1978 .

[13]  Lifeng Sun,et al.  Who should share what?: item-level social influence prediction for users and posts ranking , 2011, SIGIR.

[14]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[15]  G. Rodŕıguez,et al.  Parametric Survival Models , 2010 .

[16]  Yifei Yuan,et al.  Influence Maximization in Social Networks When Negative Opinions May Emerge and Propagate , 2011, SDM.

[17]  Jure Leskovec,et al.  On the Convexity of Latent Social Network Inference , 2010, NIPS.

[18]  Reuven Cohen,et al.  Efficient immunization strategies for computer networks and populations. , 2002, Physical review letters.

[19]  Jure Leskovec,et al.  Can cascades be predicted? , 2014, WWW.

[20]  Bernhard Schölkopf,et al.  Uncovering the Temporal Dynamics of Diffusion Networks , 2011, ICML.

[21]  D.,et al.  Regression Models and Life-Tables , 2022 .