DeepDiffuse: Predicting the 'Who' and 'When' in Cascades

Cascades are an accepted model to capturing how information diffuses across social network platforms. A large body of research has been focused on dissecting the anatomy of such cascades and forecasting their progression. One recurring theme involves predicting the next stage(s) of cascades utilizing pertinent information such as the underlying social network, structural properties of nodes (e.g., degree) and (partial) histories of cascade propagation. However, such type of granular information is rarely available in practice. We study in this paper the problem of cascade prediction utilizing only two types of (coarse) information, viz. which node is infected and its corresponding infection time. We first construct several simple baselines to solve this cascade prediction problem. Then we describe the shortcomings of these methods and propose a new solution leveraging recent progress in embeddings and attention models from representation learning. We also perform an exhaustive analysis of our methods on several real world datasets. Our proposed model outperforms the baselines and several other state-of-the-art methods.

[1]  Christian Borgs,et al.  Maximizing Social Influence in Nearly Optimal Time , 2012, SODA.

[2]  Ari Rappoport,et al.  What's in a hashtag?: content based prediction of the spread of ideas in microblogging communities , 2012, WSDM '12.

[3]  Fei Wang,et al.  Cascading outbreak prediction in networks: a data-driven approach , 2013, KDD.

[4]  Yoshua Bengio,et al.  Plan, Attend, Generate: Planning for Sequence-to-Sequence Models , 2017, NIPS.

[5]  Christian Bauckhage,et al.  How Viral Are Viral Videos? , 2015, ICWSM.

[6]  Madhav V. Marathe,et al.  EpiSimdemics: An efficient algorithm for simulating the spread of infectious disease over large realistic social networks , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[7]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[8]  Misha Denil,et al.  Learning Where to Attend with Deep Architectures for Image Tracking , 2011, Neural Computation.

[9]  Jia Wang,et al.  Topological Recurrent Neural Network for Diffusion Prediction , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[10]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[11]  Xiaokui Xiao,et al.  Influence Maximization in Near-Linear Time: A Martingale Approach , 2015, SIGMOD Conference.

[12]  Cheng Li,et al.  DeepCas: An End-to-end Predictor of Information Cascades , 2016, WWW.

[13]  Geoffrey E. Hinton,et al.  Learning to combine foveal glimpses with a third-order Boltzmann machine , 2010, NIPS.

[14]  E. Bacry,et al.  Market Impacts and the Life Cycle of Investors Orders , 2014, SSRN Electronic Journal.

[15]  Jure Leskovec,et al.  Inferring networks of diffusion and influence , 2010, KDD.

[16]  Jilles Vreeken,et al.  Hidden Hazards: Finding Missing Nodes in Large Graph Epidemics , 2015, SDM.

[17]  Jure Leskovec,et al.  SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity , 2015, KDD.

[18]  Jure Leskovec,et al.  Can cascades be predicted? , 2014, WWW.

[19]  Le Song,et al.  Learning Networks of Heterogeneous Influence , 2012, NIPS.

[20]  Lada A. Adamic,et al.  Detecting Large Reshare Cascades in Social Networks , 2017, WWW.

[21]  Kristina Lerman,et al.  The Simple Rules of Social Contagion , 2013, Scientific Reports.

[22]  Christos Faloutsos,et al.  RSC: Mining and Modeling Temporal Activity in Social Media , 2015, KDD.

[23]  Fei Wang,et al.  From Micro to Macro: Uncovering and Predicting Information Cascading Process with Behavioral Dynamics , 2015, 2015 IEEE International Conference on Data Mining.

[24]  Sylvain Lamprier,et al.  Representation Learning for Information Diffusion through Social Networks: an Embedded Cascade Model , 2016, WSDM.

[25]  Felix Naumann,et al.  Analyzing and predicting viral tweets , 2013, WWW.

[26]  Filippo Menczer,et al.  Predicting Successful Memes Using Network and Community Structure , 2014, ICWSM.

[27]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[28]  Jure Leskovec,et al.  On the Convexity of Latent Social Network Inference , 2010, NIPS.

[29]  Utkarsh Upadhyay,et al.  Recurrent Marked Temporal Point Processes: Embedding Event History to Vector , 2016, KDD.

[30]  Albert-László Barabási,et al.  Modeling and Predicting Popularity Dynamics via Reinforced Poisson Processes , 2014, AAAI.

[31]  Alex Graves,et al.  Recurrent Models of Visual Attention , 2014, NIPS.

[32]  Kristina Lerman,et al.  Information Contagion: An Empirical Study of the Spread of News on Digg and Twitter Social Networks , 2010, ICWSM.

[33]  Christos Faloutsos,et al.  Rise and fall patterns of information diffusion: model and implications , 2012, KDD.