Online Sampling of Temporal Networks

Temporal networks representing a stream of timestamped edges are seemingly ubiquitous in the real world. However, the massive size and continuous nature of these networks make them fundamentally challenging to analyze and leverage for descriptive and predictive modeling tasks. In this work, we propose a general framework for temporal network sampling with unbiased estimation. We develop online, single-pass sampling algorithms, and unbiased estimators for temporal network sampling. The proposed algorithms enable fast, accurate, and memory-efficient statistical estimation of temporal network patterns and properties. In addition, we propose a temporally decaying sampling algorithm with unbiased estimators for studying networks that evolve in continuous time, where the strength of links is a function of time, and the motif patterns are temporally weighted. In contrast to the prior notion of a △ t-temporal motif, the proposed formulation and algorithms for counting temporally weighted motifs are useful for forecasting tasks in networks such as predicting future links, or a future time-series variable of nodes and links. Finally, extensive experiments on a variety of temporal networks from different domains demonstrate the effectiveness of the proposed algorithms. A detailed ablation study is provided to understand the impact of the various components of the proposed framework.

[1]  P. Holme,et al.  Predicting and controlling infectious disease epidemics using temporal networks , 2013, F1000prime reports.

[2]  George Varghese,et al.  New directions in traffic measurement and accounting: Focusing on the elephants, ignoring the mice , 2003, TOCS.

[3]  Yongsub Lim,et al.  MASCOT: Memory-efficient and Accurate Sampling for Counting Local Triangles in Graph Streams , 2015, KDD.

[4]  Jeffrey Scott Vitter,et al.  Random sampling with a reservoir , 1985, TOMS.

[5]  Ryan A. Rossi,et al.  Time-Evolving Relational Classification and Ensemble Methods , 2012, PAKDD.

[6]  Ryan A. Rossi,et al.  Modeling dynamic behavior in large evolving graphs , 2013, WSDM.

[7]  Lorenzo De Stefani,et al.  TRIÈST: Counting Local and Global Triangles in Fully-Dynamic Streams with Fixed Memory Size , 2016, KDD.

[8]  Érick S. Florentino,et al.  An edge creation history retrieval based method to predict links in social networks , 2020, Knowl. Based Syst..

[9]  Laks V. S. Lakshmanan,et al.  Information and Influence Propagation in Social Networks , 2013, Synthesis Lectures on Data Management.

[10]  Ryan A. Rossi,et al.  A Structural Graph Representation Learning Framework , 2020, WSDM.

[11]  Christos Faloutsos,et al.  Sampling from large graphs , 2006, KDD '06.

[12]  Luis E C Rocha,et al.  Dynamics of Air Transport Networks: A Review from a Complex Systems Perspective , 2016, 1605.04872.

[13]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.

[14]  Ryan A. Rossi,et al.  Continuous-Time Dynamic Network Embeddings , 2018, WWW.

[15]  Ryan A. Rossi,et al.  The Network Data Repository with Interactive Graph Analytics and Visualization , 2015, AAAI.

[16]  Philip S. Yu,et al.  Outlier detection in graph streams , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[17]  Ramana Rao Kompella,et al.  Time-based sampling of social network activity graphs , 2010, MLG '10.

[18]  Miguel Romance,et al.  On eigenvector-like centralities for temporal networks: Discrete vs. continuous time scales , 2018, J. Comput. Appl. Math..

[19]  Ali Pinar,et al.  A space efficient streaming algorithm for triangle counting using the birthday paradox , 2012, KDD.

[20]  Yossi Matias,et al.  DIMACS Series in Discrete Mathematicsand Theoretical Computer Science Synopsis Data Structures for Massive Data , 2007 .

[21]  Yongsub Lim,et al.  Memory-Efficient and Accurate Sampling for Counting Local Triangles in Graph Streams , 2018, ACM Trans. Knowl. Discov. Data.

[22]  Lawrence B. Holder,et al.  StreamWorks: a system for dynamic graph search , 2013, SIGMOD '13.

[23]  Enrique Herrera-Viedma,et al.  An incremental method to detect communities in dynamic evolving social networks , 2019, Knowl. Based Syst..

[24]  Ryan A. Rossi,et al.  Efficient Graphlet Counting for Large Networks , 2015, 2015 IEEE International Conference on Data Mining.

[25]  Jing Li,et al.  Exploiting Structural and Temporal Evolution in Dynamic Link Prediction , 2018, CIKM.

[26]  Tina Eliassi-Rad,et al.  Understanding the limitations of network online learning , 2020, Applied Network Science.

[27]  Ryutaro Ichise,et al.  Time Score: A New Feature for Link Prediction in Social Networks , 2012, IEICE Trans. Inf. Syst..

[28]  Jure Leskovec,et al.  Higher-order organization of complex networks , 2016, Science.

[29]  Ryan A. Rossi,et al.  On Sampling from Massive Graph Streams , 2017, Proc. VLDB Endow..

[30]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[31]  Philip S. Yu,et al.  On Supervised Change Detection in Graph Streams , 2020, SDM.

[32]  Petter Holme,et al.  Modern temporal network theory: a colloquium , 2015, The European Physical Journal B.

[33]  Ramana Rao Kompella,et al.  Graph sample and hold: a framework for big-graph analytics , 2014, KDD.

[34]  Tina Eliassi-Rad,et al.  Generating Graph Snapshots from Streaming Edge Data , 2016, WWW.

[35]  Edo Liberty,et al.  Near-Optimal Entrywise Sampling for Data Matrices , 2013, NIPS.

[36]  Stacy Williams,et al.  Dynamical clustering of exchange rates , 2009 .

[37]  Carsten Lund,et al.  Priority sampling for estimation of arbitrary subset sums , 2007, JACM.

[38]  Charu C. Aggarwal,et al.  Link prediction in graph streams , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[39]  Jennifer Neville,et al.  Temporal-Relational Classifiers for Prediction in Evolving Domains , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[40]  Ryan A. Rossi,et al.  Estimation of local subgraph counts , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[41]  Jean-Pierre Eckmann,et al.  Entropy of dialogues creates coherent structures in e-mail traffic. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Esteban Moro Egido,et al.  The dynamical strength of social ties in information spreading , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[43]  Mark C. Parsons,et al.  Communicability across evolving networks. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[44]  Ryan A. Rossi,et al.  A Dynamical System for PageRank with Time-Dependent Teleportation , 2012, Internet Math..

[45]  Maurizio Porfiri,et al.  An analytical framework for the study of epidemic models on activity driven networks , 2017, J. Complex Networks.

[46]  Jennifer Neville,et al.  Modeling relationship strength in online social networks , 2010, WWW '10.

[47]  Lawrence B. Holder,et al.  Efficient frequent subgraph mining on large streaming graphs , 2019, Intell. Data Anal..

[48]  Michele Re Fiorentin,et al.  Epidemic Threshold in Continuous-Time Evolving Networks , 2017, Physical review letters.

[49]  Y.-Y. Liu,et al.  The fundamental advantages of temporal networks , 2016, Science.

[50]  Homanga Bharadhwaj,et al.  Explanations for Temporal Recommendations , 2018, KI - Künstliche Intelligenz.

[51]  Rajmonda Sulo Caceres,et al.  Temporal Scale of Processes in Dynamic Networks , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[52]  Ricardo Choren,et al.  Combining contextual, temporal and topological information for unsupervised link prediction in social networks , 2018, Knowl. Based Syst..

[53]  Van Emden Henson,et al.  An ensemble framework for detecting community changes in dynamic networks , 2017, 2017 IEEE High Performance Extreme Computing Conference (HPEC).

[54]  Jennifer Neville,et al.  Exploiting Interaction Links for Node Classification with Deep Graph Neural Networks , 2019, IJCAI.

[55]  Graham Cormode,et al.  A second look at counting triangles in graph streams , 2014, Theor. Comput. Sci..

[56]  Klaus Nordhausen,et al.  Statistical Analysis of Network Data with R , 2015 .

[57]  Mason A. Porter,et al.  Eigenvector-Based Centrality Measures for Temporal Networks , 2015, Multiscale Model. Simul..

[58]  David Moore,et al.  A robust system for accurate real-time summaries of internet traffic , 2005, SIGMETRICS '05.

[59]  Robert D. Tortora,et al.  Sampling: Design and Analysis , 2000 .

[60]  Rui Chen,et al.  Real-Time Streaming Graph Embedding Through Local Actions , 2019, WWW.

[61]  Kun-Lung Wu,et al.  Counting and Sampling Triangles from a Graph Stream , 2013, Proc. VLDB Endow..

[62]  Ryan A. Rossi,et al.  Graphlet decomposition: framework, algorithms, and applications , 2015, Knowledge and Information Systems.

[63]  Jari Saramäki,et al.  Temporal motifs in time-dependent networks , 2011, ArXiv.

[64]  Ali Pinar,et al.  Counting triangles in real-world graph streams: Dealing with repeated edges and time windows , 2013, 2015 49th Asilomar Conference on Signals, Systems and Computers.

[65]  Robert Grossman,et al.  Meaningful selection of temporal resolution for dynamic networks , 2010, MLG '10.

[66]  Ryan A. Rossi,et al.  Learning Role-based Graph Embeddings , 2018, ArXiv.

[67]  Laks V. S. Lakshmanan,et al.  Learning influence probabilities in social networks , 2010, WSDM '10.

[68]  Qian Zhang,et al.  Link prediction of time-evolving network based on node ranking , 2020, Knowl. Based Syst..

[69]  Sedigheh Mahdavi,et al.  Dynamic Joint Variational Graph Autoencoders , 2019, PKDD/ECML Workshops.

[70]  Guisheng Yin,et al.  Link prediction in dynamic networks based on the attraction force between nodes , 2019, Knowl. Based Syst..

[71]  Xin Yang,et al.  Influential User Subscription on Time-Decaying Social Streams , 2018, ArXiv.

[72]  Ramana Rao Kompella,et al.  Network Sampling: From Static to Streaming Graphs , 2012, TKDD.

[73]  Divesh Srivastava,et al.  Forward Decay: A Practical Time Decay Model for Streaming Systems , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[74]  Danai Koutra,et al.  On Proximity and Structural Role-based Embeddings in Networks , 2020, ACM Trans. Knowl. Discov. Data.

[75]  Carsten Wiuf,et al.  Subnets of scale-free networks are not scale-free: sampling properties of networks. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[76]  Nesreen K. Ahmed,et al.  Adaptive Shrinkage Estimation for Streaming Graphs , 2020, NeurIPS.

[77]  Andrew McGregor,et al.  Catching the Head, Tail, and Everything in Between: A Streaming Algorithm for the Degree Distribution , 2015, 2015 IEEE International Conference on Data Mining.

[78]  Jennifer Neville,et al.  Designing Size Consistent Statistics for Accurate Anomaly Detection in Dynamic Networks , 2018, ACM Trans. Knowl. Discov. Data.

[79]  Edith Cohen,et al.  Stream Sampling for Frequency Cap Statistics , 2015, KDD.

[80]  Ryan A. Rossi,et al.  Temporal Network Representation Learning , 2019, ArXiv.

[81]  Petter Holme,et al.  Impact of misinformation in temporal network epidemiology , 2017, Network Science.

[82]  Tamara G. Kolda,et al.  Temporal Link Prediction Using Matrix and Tensor Factorizations , 2010, TKDD.

[83]  Aynaz Taheri,et al.  Predictive Temporal Embedding of Dynamic Graphs , 2019, 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[84]  Ryan A. Rossi,et al.  Role Discovery in Networks , 2014, IEEE Transactions on Knowledge and Data Engineering.

[85]  Charu C. Aggarwal Extracting Real-Time Insights from Graphs and Social Streams , 2018, SIGIR.

[86]  Mark E. J. Newman,et al.  Structure and Dynamics of Networks , 2009 .

[87]  Aynaz Taheri,et al.  Learning to Represent the Evolution of Dynamic Graphs with Recurrent Models , 2019, WWW.

[88]  Charu C. Aggarwal,et al.  On biased reservoir sampling in the presence of stream evolution , 2006, VLDB.

[89]  Nesreen K. Ahmed,et al.  Sampling for Approximate Bipartite Network Projection , 2017, IJCAI.

[90]  Tiago P. Peixoto,et al.  Change points, memory and epidemic spreading in temporal networks , 2017, Scientific Reports.

[91]  Maurizio Porfiri,et al.  Continuous-Time Discrete-Distribution Theory for Activity-Driven Networks. , 2016, Physical review letters.

[92]  Jari Saramäki,et al.  Temporal Networks , 2011, Encyclopedia of Social Network Analysis and Mining.

[93]  Austin R. Benson,et al.  Sampling Methods for Counting Temporal Motifs , 2019, WSDM.

[94]  Andrew McGregor,et al.  Graph stream algorithms: a survey , 2014, SGMD.

[95]  Nathan Eagle,et al.  Persistence and periodicity in a dynamic proximity network , 2012, ArXiv.