Modeling Sparse Information Diffusion at Scale via Lazy Multivariate Hawkes Processes

Multivariate Hawkes Processes (MHPs) are an important class of temporal point processes that have enabled key advances in understanding and predicting social information systems. However, due to their complex modeling of temporal dependencies, MHPs have proven to be notoriously difficult to scale, what has limited their applications to relatively small domains. In this work, we propose a novel model and computational approach to overcome this important limitation. By exploiting a characteristic sparsity pattern in real-world diffusion processes, we show that our approach allows to compute the exact likelihood and gradients of an MHP – independently of the ambient dimensions of the underlying network. We show on synthetic and real-world datasets that our method does not only achieve state-of-the-art modeling results, but also improves runtime performance by multiple orders of magnitude on sparse event sequences. In combination with easily interpretable latent variables and influence structures, this allows us to analyze diffusion processes in networks at previously unattainable scale.

[1]  James R. Foulds,et al.  HawkesTopic: A Joint Model for Network Inference and Topic Modeling from Text-Based Cascades , 2015, ICML.

[2]  Le Song,et al.  Scalable Influence Estimation in Continuous-Time Diffusion Networks , 2013, NIPS.

[3]  Isabel Valera,et al.  Modeling the Dynamics of Learning Activity on the Web , 2017, WWW.

[4]  Le Song,et al.  Learning Social Infectivity in Sparse Low-rank Networks Using Multi-dimensional Hawkes Processes , 2013, AISTATS.

[5]  Jure Leskovec,et al.  Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks , 2019, KDD.

[6]  Shigeru Shinomoto,et al.  Differences in Spiking Patterns Among Cortical Neurons , 2003, Neural Computation.

[7]  Hongyuan Zha,et al.  Learning Granger Causality for Hawkes Processes , 2016, ICML.

[8]  Tomoharu Iwata,et al.  Discovering latent influence in online social activities via shared cascade poisson processes , 2013, KDD.

[9]  David Duvenaud,et al.  Latent Ordinary Differential Equations for Irregularly-Sampled Time Series , 2019, NeurIPS.

[10]  Jorge Nocedal,et al.  A Limited Memory Algorithm for Bound Constrained Optimization , 1995, SIAM J. Sci. Comput..

[11]  Jure Leskovec,et al.  Community Interaction and Conflict on the Web , 2018, WWW.

[12]  Jure Leskovec,et al.  SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity , 2015, KDD.

[13]  Le Song,et al.  Wasserstein Learning of Deep Generative Point Process Models , 2017, NIPS.

[14]  Jason Eisner,et al.  The Neural Hawkes Process: A Neurally Self-Modulating Multivariate Point Process , 2016, NIPS.

[15]  A. Hawkes Spectra of some self-exciting and mutually exciting point processes , 1971 .

[16]  Argyris Kalogeratos,et al.  Multivariate Hawkes Processes for Large-Scale Inference , 2016, AAAI.

[17]  Austin R. Benson,et al.  Neural Jump Stochastic Differential Equations , 2019, NeurIPS.

[18]  Manuel Gomez-Rodriguez,et al.  Uncovering the Spatiotemporal Patterns of Collective Social Activity , 2017, SDM.

[19]  Béla Bollobás,et al.  Directed scale-free graphs , 2003, SODA '03.

[20]  Swapnil Mishra,et al.  SIR-Hawkes: Linking Epidemic Models and Hawkes Processes to Model Diffusions in Finite Populations , 2017, WWW.

[21]  Le Song,et al.  Fake News Mitigation via Point Process Based Intervention , 2017, ICML.

[22]  Y. Ogata Space-Time Point-Process Models for Earthquake Occurrences , 1998 .

[23]  Bruce A. Desmarais,et al.  A New Database for Inferring Public Policy Innovativeness and Diffusion Networks , 2019 .

[24]  Bernhard Schölkopf,et al.  Structure and dynamics of information pathways in online media , 2012, WSDM.

[25]  Stephen J. Wright,et al.  Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.

[26]  T. Ozaki Maximum likelihood estimation of Hawkes' self-exciting point processes , 1979 .

[27]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[28]  Bernhard Schölkopf,et al.  Uncovering the Temporal Dynamics of Diffusion Networks , 2011, ICML.

[29]  David Duvenaud,et al.  Neural Ordinary Differential Equations , 2018, NeurIPS.

[30]  E. Bacry,et al.  Hawkes Processes in Finance , 2015, 1502.04592.

[31]  Emmanuel Bacry,et al.  Uncovering Causality from Multivariate Hawkes Integrated Cumulants , 2016, ICML.

[32]  Ali Taylan Cemgil,et al.  Clustering Event Streams With Low Rank Hawkes Processes , 2020, IEEE Signal Processing Letters.

[33]  Utkarsh Upadhyay,et al.  Recurrent Marked Temporal Point Processes: Embedding Event History to Vector , 2016, KDD.

[34]  William R. Softky,et al.  Comparison of discharge variability in vitro and in vivo in cat visual cortex neurons. , 1996, Journal of neurophysiology.

[35]  Alexander J. Smola,et al.  Neural Survival Recommender , 2017, WSDM.

[36]  Mihaela van der Schaar,et al.  Learning from Clinical Judgments: Semi-Markov-Modulated Marked Hawkes Processes for Risk Prognosis , 2017, ICML.

[37]  Yosihiko Ogata,et al.  On Lewis' simulation method for point processes , 1981, IEEE Trans. Inf. Theory.

[38]  Stephan Günnemann,et al.  Intensity-Free Learning of Temporal Point Processes , 2020, ICLR.

[39]  Emmanuel Bacry,et al.  Tick: a Python library for statistical learning, with a particular emphasis on time-dependent modelling , 2017, 1707.03003.

[40]  Shuang Li,et al.  COEVOLVE: A Joint Point Process Model for Information Diffusion and Network Co-evolution , 2015, NIPS.

[41]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.