Graphlet-orbit Transitions (GoT): A fingerprint for temporal network comparison

Given a set of temporal networks, from different domains and with different sizes, how can we compare them? Can we identify evolutionary patterns that are both (i) characteristic and (ii) meaningful? We address these challenges by introducing a novel temporal and topological network fingerprint named Graphlet-orbit Transitions (GoT). We demonstrate that GoT provides very rich and interpretable network characterizations. Our work puts forward an extension of graphlets and uses the notion of orbits to encapsulate the roles of nodes in each subgraph. We build a transition matrix that keeps track of the temporal trajectory of nodes in terms of their orbits, therefore describing their evolution. We also introduce a metric (OTA) to compare two networks when considering these matrices. Our experiments show that networks representing similar systems have characteristic orbit transitions. GoT correctly groups synthetic networks pertaining to well-known graph models more accurately than competing static and dynamic state-of-the-art approaches by over 30%. Furthermore, our tests on real-world networks show that GoT produces highly interpretable results, which we use to provide insight into characteristic orbit transitions.

[1]  Tijana Milenkovic,et al.  GraphCrunch: A tool for large network analyses , 2008, BMC Bioinformatics.

[2]  Pedro Manuel Pinto Ribeiro,et al.  Towards a faster network-centric subgraph census , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[3]  Manju Bansal,et al.  A novel method for prokaryotic promoter prediction based on DNA stability , 2005, BMC Bioinformatics.

[4]  Irene Finocchi,et al.  Counting small cliques in MapReduce , 2014, ArXiv.

[5]  S. Shen-Orr,et al.  Superfamilies of Evolved and Designed Networks , 2004, Science.

[6]  J. Köbler,et al.  The Graph Isomorphism Problem: Its Structural Complexity , 1993 .

[7]  P. Erdos,et al.  On the evolution of random graphs , 1984 .

[8]  Janez Demsar,et al.  A combinatorial approach to graphlet counting , 2014, Bioinform..

[9]  Natasa Przulj,et al.  Biological network comparison using graphlet degree distribution , 2007, Bioinform..

[10]  Fernando M. A. Silva,et al.  Parallel Subgraph Counting for Multicore Architectures , 2014, 2014 IEEE International Symposium on Parallel and Distributed Processing with Applications.

[11]  Charu C. Aggarwal,et al.  Temporally Factorized Network Modeling for Evolutionary Network Analysis , 2017, WSDM.

[12]  Brendan D. McKay,et al.  Practical graph isomorphism, II , 2013, J. Symb. Comput..

[13]  Ciro Cattuto,et al.  What's in a crowd? Analysis of face-to-face behavioral networks , 2010, Journal of theoretical biology.

[14]  S. Mangan,et al.  Structure and function of the feed-forward loop network motif , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[15]  B. Bollobás The evolution of random graphs , 1984 .

[16]  Tijana Milenkovic,et al.  Exploring the structure and function of temporal networks with dynamic graphlets , 2014, Bioinform..

[17]  Fernando M. A. Silva,et al.  Efficient Subgraph Frequency Estimation with G-Tries , 2010, WABI.

[18]  Danai Koutra,et al.  TimeCrunch: Interpretable Dynamic Graph Summarization , 2015, KDD.

[19]  Ronitt Rubinfeld,et al.  Sublinear-Time Algorithms for Counting Star Subgraphs with Applications to Join Selectivity Estimation , 2016, ArXiv.

[20]  Dongxiao Zhu,et al.  BMC Bioinformatics BioMed Central , 2005 .

[21]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[22]  Luis E C Rocha,et al.  Information dynamics shape the sexual networks of Internet-mediated prostitution , 2010, Proceedings of the National Academy of Sciences.

[23]  Krishna P. Gummadi,et al.  On the evolution of user interaction in Facebook , 2009, WOSN '09.

[24]  Ali Pinar,et al.  ESCAPE: Efficiently Counting All 5-Vertex Subgraphs , 2016, WWW.

[25]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[26]  Stephen A. Cook,et al.  The complexity of theorem-proving procedures , 1971, STOC.

[27]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[28]  Shyhtsun Felix Wu,et al.  The Evolution of Ego-Centric Triads: A Microscopic Approach toward Predicting Macroscopic Network Properties , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[29]  R. Karp,et al.  Conserved pathways within bacteria and yeast as revealed by global protein network alignment , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Christian Sohler,et al.  Counting triangles in data streams , 2006, PODS.

[31]  Jure Leskovec,et al.  Microscopic evolution of social networks , 2008, KDD.

[32]  David Oliveira Aparício,et al.  Extending the Applicability of Graphlets to Directed Networks , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[33]  Padraig Cunningham,et al.  Classifying Wikipedia articles using network motif counts and ratios , 2012, WikiSym '12.

[34]  Tijana Milenkovic,et al.  Dynamic networks reveal key players in aging , 2013, BCB.

[35]  Tin Wee Tan,et al.  In silico grouping of peptide/HLA class I complexes using structural interaction characteristics , 2007, Bioinform..

[36]  Jari Saramäki,et al.  Temporal Networks , 2011, Encyclopedia of Social Network Analysis and Mining.

[37]  Julie Fournet,et al.  Data on face-to-face contacts in an office building suggest a low-cost vaccination strategy based on community linkers , 2014, Network Science.

[38]  Fernando M. A. Silva,et al.  Comparison of Co-authorship Networks across Scientific Fields Using Motifs , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[39]  Ciro Cattuto,et al.  High-Resolution Measurements of Face-to-Face Contact Patterns in a Primary School , 2011, PloS one.

[40]  Ronitt Rubinfeld,et al.  Sublinear-Time Algorithms for Counting Star Subgraphs via Edge Sampling , 2017, Algorithmica.

[41]  Kwang-Hyun Cho,et al.  Spatiotemporal network motif reveals the biological traits of developmental gene regulatory networks in Drosophila melanogaster , 2012, BMC Systems Biology.

[42]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[43]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[44]  Yao Zhang,et al.  Condensing Temporal Networks using Propagation , 2017, SDM.

[45]  Kun-Lung Wu,et al.  Counting and Sampling Triangles from a Graph Stream , 2013, Proc. VLDB Endow..

[46]  Christos Faloutsos,et al.  Graph evolution: Densification and shrinking diameters , 2006, TKDD.

[47]  Cecilia Mascolo,et al.  Graph Metrics for Temporal Networks , 2013, ArXiv.

[48]  Xiaofan Wang,et al.  Evolution of a large online social network , 2009 .

[49]  Ruoming Jin,et al.  Trend Motif: A Graph Mining Approach for Analysis of Dynamic Complex Networks , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[50]  Alberto J. M. Martin,et al.  Graphlet Based Metrics for the Comparison of Gene Regulatory Networks , 2016, PloS one.

[51]  Lucas Antiqueira,et al.  Analyzing and modeling real-world phenomena with complex networks: a survey of applications , 2007, 0711.3199.

[52]  Przemyslaw Kazienko,et al.  Matching Organizational Structure and Social Network Extracted from Email Communication , 2011, BIS.

[53]  Jari Saramäki,et al.  Temporal motifs in time-dependent networks , 2011, ArXiv.

[54]  K. Selçuk Candan,et al.  How Does the Data Sampling Strategy Impact the Discovery of Information Diffusion in Social Media? , 2010, ICWSM.

[55]  Jian Su,et al.  Recognition of protein/gene names from text using an ensemble of classifiers , 2005, BMC Bioinformatics.

[56]  Fernando M. A. Silva,et al.  G-Tries: a data structure for storing and finding subgraphs , 2014, Data Mining and Knowledge Discovery.