Online Community Detection for Event Streams on Networks

A common goal in network modeling is to uncover the latent community structure present among nodes. For many real-world networks, observed connections consist of events arriving as streams, which are then aggregated to form edges, ignoring the temporal dynamic component. A natural way to take account of this temporal dynamic component of interactions is to use point processes as the foundation of the network models for community detection. Computational complexity hampers the scalability of such approaches to large sparse networks. To circumvent this challenge, we propose a fast online variational inference algorithm for learning the community structure underlying dynamic event arrivals on a network using continuous-time point process latent network models. We provide regret bounds on the loss function of this procedure, giving theoretical guarantees on performance. The proposed algorithm is illustrated, using both simulation studies and real data, to have comparable performance in terms of community structure in terms of community recovery to non-online variants. Our proposed framework can also be readily modified to incorporate other popular network structures.

[1]  Bruce A. Desmarais,et al.  Temporal Exponential Random Graph Models with btergm: Estimation and Bootstrap Confidence Intervals , 2018 .

[2]  Yuguo Chen,et al.  Latent Space Models for Dynamic Networks , 2015, 2005.08808.

[3]  Purnamrita Sarkar,et al.  Mean Field for the Stochastic Blockmodel: Optimization Landscape and Convergence Issues , 2018, NeurIPS.

[4]  Swapnil Mishra,et al.  A Tutorial on Hawkes Processes for Events in Social Media , 2017, ArXiv.

[5]  Yee Whye Teh,et al.  Modelling sparsity, heterogeneity, reciprocity and community structure in temporal interaction data , 2018, NeurIPS.

[6]  Haochen Xu,et al.  Network Group Hawkes Process Model , 2020 .

[7]  Santo Fortunato,et al.  Community detection in networks: A user guide , 2016, ArXiv.

[8]  T. Snijders,et al.  Estimation and Prediction for Stochastic Blockstructures , 2001 .

[9]  Ji Zhu,et al.  Consistency of community detection in networks under degree-corrected stochastic block models , 2011, 1110.3854.

[10]  Niao He,et al.  Online Learning for Multivariate Hawkes Processes , 2017, NIPS.

[11]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[12]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[13]  Marianna Pensky,et al.  Spectral clustering in the dynamic stochastic block model , 2017, Electronic Journal of Statistics.

[14]  Yosihiko Ogata,et al.  Statistical Models for Earthquake Occurrences and Residual Analysis for Point Processes , 1988 .

[15]  João Gama,et al.  Predicting Taxi–Passenger Demand Using Streaming Data , 2013, IEEE Transactions on Intelligent Transportation Systems.

[16]  A. Hawkes Hawkes processes and their applications to finance: a review , 2018 .

[17]  Ulrike Goldschmidt,et al.  An Introduction To The Theory Of Point Processes , 2016 .

[18]  Albert Bifet,et al.  Sentiment Knowledge Discovery in Twitter Streaming Data , 2010, Discovery Science.

[19]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[20]  Francis R. Bach,et al.  Online Learning for Latent Dirichlet Allocation , 2010, NIPS.

[21]  Peter J. Bickel,et al.  Pseudo-likelihood methods for community detection in large sparse networks , 2012, 1207.2340.

[22]  Gen Li,et al.  Varying-coefficient models for dynamic networks , 2017, Comput. Stat. Data Anal..

[23]  A. Hawkes,et al.  A cluster process representation of a self-exciting process , 1974, Journal of Applied Probability.

[24]  A. Dassios,et al.  Exact Simulation of Hawkes Process with Exponentially Decaying Intensity , 2013 .

[25]  Peter Green,et al.  Markov chain Monte Carlo in Practice , 1996 .

[26]  Chong Wang,et al.  Stochastic variational inference , 2012, J. Mach. Learn. Res..

[27]  C. Matias,et al.  A semiparametric extension of the stochastic block model for longitudinal networks , 2015, Biometrika.