Higher-order organization of complex networks

Resolving a network of hubs Graphs are a pervasive tool for modeling and analyzing network data throughout the sciences. Benson et al. developed an algorithmic framework for studying how complex networks are organized by higher-order connectivity patterns (see the Perspective by Pržulj and Malod-Dognin). Motifs in transportation networks reveal hubs and geographical elements not readily achievable by other methods. A motif previously suggested as important for neuronal networks is part of a “rich club” of subnetworks. Science, this issue p. 163; see also p. 123 A mathematical framework for clustering reveals organizational features of a variety of networks. Networks are a fundamental tool for understanding and modeling complex systems in physics, biology, neuroscience, engineering, and social science. Many networks are known to exhibit rich, lower-order connectivity patterns that can be captured at the level of individual nodes and edges. However, higher-order organization of complex networks—at the level of small network subgraphs—remains largely unknown. Here, we develop a generalized framework for clustering networks on the basis of higher-order connectivity patterns. This framework provides mathematical guarantees on the optimality of obtained clusters and scales to networks with billions of edges. The framework reveals higher-order organization in a number of networks, including information propagation units in neuronal networks and hub structure in transportation networks. Results show that networks exhibit rich higher-order organizational structures that are exposed by clustering based on higher-order connectivity patterns.

[1]  Jon M. Kleinberg,et al.  Overview of the 2003 KDD Cup , 2003, SKDD.

[2]  O. Sporns,et al.  Motifs in Brain Networks , 2004, PLoS biology.

[3]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[4]  Jonathan Cohen,et al.  Graph Twiddling in a MapReduce World , 2009, Computing in Science & Engineering.

[5]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[6]  Craig E. Wills,et al.  Proceedings of the 13th international conference on World Wide Web , 2004 .

[7]  S. Mangan,et al.  The coherent feedforward loop serves as a sign-sensitive delay element in transcription networks. , 2003, Journal of molecular biology.

[8]  Robert Cole,et al.  Computer Communications , 1982, Springer New York.

[9]  Sebastiano Vigna,et al.  UbiCrawler: a scalable fully distributed Web crawler , 2004, Softw. Pract. Exp..

[10]  Olaf Sporns,et al.  Network structure of cerebral cortex shapes functional connectivity on multiple time scales , 2007, Proceedings of the National Academy of Sciences.

[11]  Zhi-Li Zhang,et al.  Commute Times for a Directed Graph using an Asymmetric Laplacian , 2011 .

[12]  Armin Haller,et al.  Metadata-driven hypertext content publishing and styling , 2014, WWW '14 Companion.

[13]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[14]  Daniel Boley,et al.  Principal Direction Divisive Partitioning , 1998, Data Mining and Knowledge Discovery.

[15]  L. Christophorou Science , 2018, Emerging Dynamics: Science, Energy, Society and Values.

[16]  Yanlei Wu,et al.  2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2014, Beijing, China, August 17-20, 2014 , 2014, ASONAM.

[17]  Zoran Levnajic,et al.  Revealing the Hidden Language of Complex Networks , 2014, Scientific Reports.

[18]  William W. Cohen,et al.  Proceedings of the 23rd international conference on Machine learning , 2006, ICML 2008.

[19]  Audra E. Kosh,et al.  Linear Algebra and its Applications , 1992 .

[20]  Jure Leskovec,et al.  Exploiting Social Network Structure for Person-to-Person Sentiment Analysis , 2014, TACL.

[21]  Jack Dongarra,et al.  Applied Parallel Computing Industrial Computation and Optimization , 1996, Lecture Notes in Computer Science.

[22]  Marcus Kaiser,et al.  Nonoptimal Component Placement, but Short Processing Paths, due to Long-Distance Projections in Neural Systems , 2006, PLoS Comput. Biol..

[23]  J. van Leeuwen,et al.  Theoretical Computer Science , 2003, Lecture Notes in Computer Science.

[24]  U. Alon Network motifs: theory and experimental approaches , 2007, Nature Reviews Genetics.

[25]  Elisa Bertino,et al.  Proceedings of the 20th international conference on World wide web , 2011, WWW 2011.

[26]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[27]  Sergio Gómez,et al.  Detecting communities of triangles in complex networks using spectral optimization , 2010, Comput. Commun..

[28]  Norishige Chiba,et al.  Arboricity and Subgraph Listing Algorithms , 1985, SIAM J. Comput..

[29]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[30]  Tomoyuki Ishida,et al.  The 28 th International Conference on Distributed Computing Systems Workshops , 2008 .

[31]  Andrzej M. Borzyszkowski,et al.  Proceedings of the 18th International Symposium on Mathematical Foundations of Computer Science , 1993 .

[32]  Jure Leskovec,et al.  Overlapping Communities Explain Core–Periphery Organization of Networks , 2014, Proceedings of the IEEE.

[33]  J. Rodri´guez On the Laplacian Eigenvalues and Metric Parameters of Hypergraphs , 2002 .

[34]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[35]  J. A. Rodríguez,et al.  Linear and Multilinear Algebra , 2007 .

[36]  David S. Munro,et al.  In: Software-Practice and Experience , 2000 .

[37]  Luís A. Nunes Amaral,et al.  Evidence for the existence of a robust pattern of prey selection in food webs , 2007, Proceedings of the Royal Society B: Biological Sciences.

[38]  Journal of Molecular Biology , 1959, Nature.

[39]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[40]  Béat Hirsbrunner,et al.  2010 IEEE 30th International Conference on Distributed Computing Systems Workshops (ICDCSW 2010) : Genoa, Italy, 21 - 25 June 2010 ; [proceedings] , 2011 .

[41]  Elchanan Mossel,et al.  Spectral redemption in clustering sparse networks , 2013, Proceedings of the National Academy of Sciences.

[42]  T. Michoel,et al.  The Index-Based Subgraph Matching Algorithm (ISMA): Fast Subgraph Enumeration in Large Networks Using Optimized Search Trees , 2013, PloS one.

[43]  Michalis Vazirgiannis,et al.  Clustering and Community Detection in Directed Networks: A Survey , 2013, ArXiv.

[44]  B. Parlett The Symmetric Eigenvalue Problem , 1981 .

[45]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[46]  Ian Davidson,et al.  Proceedings of the 2012 SIAM International Conference on Data Mining , 2012 .

[47]  Kunle Olukotun,et al.  EmptyHeaded: A Relational Engine for Graph Processing , 2015, ACM Trans. Database Syst..

[48]  Juliana Freire,et al.  Proceedings of the 19th international conference on World wide web , 2010, WWW 2010.

[49]  Dan Suciu,et al.  Journal of the ACM , 2006 .

[50]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[51]  O. Bagasra,et al.  Proceedings of the National Academy of Sciences , 1914, Science.

[52]  Robert L. Grossman,et al.  Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining , 2005, KDD 2005.

[53]  Sebastian Wernicke,et al.  FANMOD: a tool for fast network motif detection , 2006, Bioinform..

[54]  S. Mangan,et al.  Structure and function of the feed-forward loop network motif , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[55]  Martin Rosvall,et al.  Memory in network flows and its effects on spreading dynamics and community detection , 2013, Nature Communications.

[56]  GusfieldDan Introduction to the IEEE/ACM Transactions on Computational Biology and Bioinformatics , 2004 .

[57]  Albert-László Barabási,et al.  Aggregation of topological motifs in the Escherichia coli transcriptional regulatory network , 2004, BMC Bioinformatics.

[58]  P. Holland,et al.  A Method for Detecting Structure in Sociometric Data , 1970, American Journal of Sociology.

[59]  T. Michoel,et al.  The Index-Based Subgraph Matching Algorithm with General Symmetries (ISMAGS): Exploiting Symmetry for Faster Subgraph Enumeration , 2014, PloS one.

[60]  F. Chung Laplacians and the Cheeger Inequality for Directed Graphs , 2005 .

[61]  Luca Trevisan,et al.  Multi-way spectral partitioning and higher-order cheeger inequalities , 2011, STOC '12.

[62]  Tapio Elomaa,et al.  Principles of Data Mining and Knowledge Discovery , 2002, Lecture Notes in Computer Science.

[63]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[64]  Moni Naor,et al.  Proceedings of the 5th conference on Innovations in theoretical computer science , 2014, ITCS 2014.

[65]  BMC Bioinformatics , 2005 .

[66]  Klaus Jansen,et al.  Experimental and Efficient Algorithms , 2003, Lecture Notes in Computer Science.

[67]  Sunita Sarawagi,et al.  Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, Nevada, USA, August 24-27, 2008 , 2008, KDD.

[68]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[69]  E. Kandel,et al.  Proceedings of the National Academy of Sciences of the United States of America. Annual subject and author indexes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[70]  D. Riddle C. Elegans II , 1998 .

[71]  John Ross Computing in Science , 1992 .

[72]  Junho Lee,et al.  Nictation, a dispersal behavior of the nematode Caenorhabditis elegans, is regulated by IL2 neurons , 2011, Nature Neuroscience.

[73]  Anagha Joshi,et al.  Enrichment and aggregation of topological motifs are independent organizational principles of integrated interaction networks. , 2011, Molecular bioSystems.

[74]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[75]  Meng-Fan Chang,et al.  Editorial in IEEE Transactions on Very Large Scale Integration (VLSI) Systems , 2017 .

[76]  Sebastian Wernicke,et al.  Efficient Detection of Network Motifs , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[77]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[78]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[79]  Matthieu Latapy,et al.  Main-memory triangle computations for very large (sparse (power-law)) graphs , 2008, Theor. Comput. Sci..

[80]  Shuliang Wang,et al.  Data Mining and Knowledge Discovery , 2005, Mathematical Principles of the Internet.

[81]  Igor Jurisica,et al.  Modeling interactome: scale-free or geometric? , 2004, Bioinform..

[82]  VoLUME Xxxix,et al.  THE AMERICAN JOURNAL OF SOCIOLOGY , 2010 .

[83]  Dimitrios Gunopulos,et al.  Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining , 2006, KDD 2006.

[84]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[85]  I. Ial,et al.  Nature Communications , 2010, Nature Cell Biology.

[86]  Jordi Bascompte,et al.  Interaction strength combinations and the overfishing of a marine food web. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[87]  Tamara G. Kolda,et al.  Wedge sampling for computing clustering coefficients and triangle counts on large graphs † , 2013, Stat. Anal. Data Min..

[88]  Jordi Bascompte,et al.  Disentangling the Web of Life , 2009, Science.

[89]  Shashi Shekhar,et al.  Multilevel hypergraph partitioning: applications in VLSI domain , 1999, IEEE Trans. Very Large Scale Integr. Syst..

[90]  J. Ashby References and Notes , 1999 .

[91]  Acknowledgments , 2006, Molecular and Cellular Endocrinology.

[92]  Niels Taatgen,et al.  Proceedings of ICCM - 2010- Tenth International Conference on Cognitive Modeling , 2010 .

[93]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .