Exploring patterns in modularity of protein interaction networks across the tree of life using Spectral Entropy

Modularity and organizational hierarchy are important concepts in understanding the 1 structure and evolution of interactions in complex biological systems. In this work, we introduce 2 and use a spectral characterization measure (Spectral Entropy) to quantify modularity in protein3 to-protein interaction (PPI) networks in species across the tree of life. We evaluated the relation 4 between the size of a PPI network and its (Spectral Entropy-based) modularity, and found a 5 sigmoidal response between the two. We also found significant differences in the distribution 6 of Spectral Entropy values among the three domains of life (Bacteria, Archaea, Eukaryotes). To 7 explore further correlations with biological traits, we focused solely on bacterial PPI networks, 8 which are the most numerous among the three domains and had associated trait metadata, and 9 investigated how modularity impacts or is impacted by growth, aerobicity, selection and location 10 on the tree of life. We found no relation between maximal growth rate and Spectral Entropy, but 11 a strong dependence between G-C content (a proxy for selection) and Spectral Entropy. We also 12 discovered that Spectral Entropy is negatively affected by phylogenetic placement (evolutionary 13 distance from the last universal common ancestor). The general nature of the Spectral Entropy 14 measure of hierarchical modularity in networks suggests that it will be useful in other settings 15 where structural properties of real-world networks are being compared. 16

[1]  E. Ott,et al.  Spectral properties of networks with community structure. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  A. Barabasi,et al.  Spectra of "real-world" graphs: beyond the semicircle law. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  J. Fuhrman,et al.  Estimating maximal microbial growth rates from cultures, metagenomes, and single cells via codon usage patterns , 2020, Proceedings of the National Academy of Sciences.

[4]  H. Ochman,et al.  A selective force favoring increased G+C content in bacterial genes , 2012, Proceedings of the National Academy of Sciences.

[5]  Mackenzie M. Johnson,et al.  A computational exploration of resilience and evolvability of protein–protein interaction networks , 2020, Communications Biology.

[6]  Laurent Duret,et al.  GC-Content Evolution in Bacterial Genomes: The Biased Gene Conversion Hypothesis Expands , 2014, bioRxiv.

[7]  Diogo Melo,et al.  Directional selection can drive the evolution of modularity in complex traits , 2014, Proceedings of the National Academy of Sciences.

[8]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  W. Ford Doolittle,et al.  The generality of Constructive Neutral Evolution , 2018 .

[10]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[11]  S. Horvath,et al.  Genetic programs in human and mouse early embryos revealed by single-cell RNA sequencing , 2013, Nature.

[12]  G. Wagner,et al.  The road to modularity , 2007, Nature Reviews Genetics.

[13]  Somwrita Sarkar,et al.  Spectral Characterization of Hierarchical Modularity in Product Architectures. , 2014, Journal of mechanical design.

[15]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  M. Fiedler A property of eigenvectors of nonnegative symmetric matrices and its application to graph theory , 1975 .

[17]  Marcus Kaiser,et al.  A tutorial in connectome analysis: Topological and spatial features of brain networks , 2011, NeuroImage.

[18]  Hao Zhang,et al.  Segmentation of 3D meshes through spectral clustering , 2004, 12th Pacific Conference on Computer Graphics and Applications, 2004. PG 2004. Proceedings..

[19]  D. Caetano-Anollés,et al.  Emergence of Hierarchical Modularity in Evolving Networks Uncovered by Phylogenomic Analysis , 2019, Evolutionary bioinformatics online.

[20]  V. Hinman,et al.  Modularity and hierarchy in biological systems: Using gene regulatory networks to understand evolutionary change. , 2021, Current topics in developmental biology.

[21]  Dan Braha,et al.  The Topology of Large-Scale Engineering Problem-Solving Networks , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[22]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[23]  Anat Kreimer,et al.  The evolution of modularity in bacterial metabolic networks , 2008, Proceedings of the National Academy of Sciences.

[24]  Scott Federhen,et al.  The NCBI Taxonomy database , 2011, Nucleic Acids Res..

[25]  S. N. Dorogovtsev,et al.  Spectra of complex networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  D. Gatherer,et al.  Nitrogen-fixing aerobic bacteria have higher genomic GC content than non-fixing species within the same genus. , 2004, Hereditas.

[27]  C. Priebe,et al.  Perfect Clustering for Stochastic Blockmodel Graphs via Adjacency Spectral Embedding , 2013, 1310.0532.

[28]  Alex Pothen,et al.  PARTITIONING SPARSE MATRICES WITH EIGENVECTORS OF GRAPHS* , 1990 .

[29]  Carey E. Priebe,et al.  On a two-truths phenomenon in spectral graph clustering , 2018, Proceedings of the National Academy of Sciences.

[30]  Mark C. Field,et al.  Evolution of specificity in the eukaryotic endomembrane system. , 2009, The international journal of biochemistry & cell biology.

[31]  Somwrita Sarkar,et al.  Community detection in graphs using singular value decomposition. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  F. Hildebrand,et al.  Evidence of Selection upon Genomic GC-Content in Bacteria , 2010, PLoS genetics.

[33]  H. Maki,et al.  Impact of reactive oxygen species on spontaneous mutagenesis in Escherichia coli , 2006, Genes to cells : devoted to molecular & cellular mechanisms.

[34]  Feng Luo,et al.  Modular organization of protein interaction networks , 2007, Bioinform..

[35]  Steven D. Eppinger,et al.  A Network Approach to Define Modularity of Components in Complex Products , 2007 .

[36]  PothenAlex,et al.  Partitioning sparse matrices with eigenvectors of graphs , 1990 .

[37]  Hugo Naya,et al.  Aerobiosis Increases the Genomic Guanine Plus Cytosine Content (GC%) in Prokaryotes , 2002, Journal of Molecular Evolution.

[38]  B. Ames,et al.  Sunlight ultraviolet and bacterial DNA base ratios. , 1970, Science.

[39]  Manlio De Domenico,et al.  Spectral entropies as information-theoretic tools for complex network comparison , 2016, 1609.01214.

[40]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[41]  Erik Hoel,et al.  Evolution leads to emergence: An analysis of protein interactomes across the tree of life , 2020, bioRxiv.

[42]  N. Lane Energetics and genetics across the prokaryote-eukaryote divide , 2011, Biology Direct.

[43]  Nathan Crilly,et al.  From modularity to emergence: a primer on the design and science of complex systems , 2016 .

[44]  Lisa R. Moore,et al.  A synthesis of bacterial and archaeal phenotypic trait data , 2020, Scientific Data.

[45]  Jérôme Kunegis,et al.  Preferential attachment in online networks: measurement and explanations , 2013, WebSci.

[46]  R. Wood DNA repair in eukaryotes. , 1996, Annual review of biochemistry.

[47]  Jure Leskovec,et al.  Evolution of resilience in protein interactomes across the tree of life , 2018, Proceedings of the National Academy of Sciences.

[48]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[49]  Brian C. Thomas,et al.  A new view of the tree of life , 2016, Nature Microbiology.

[50]  Hunter B. Fraser,et al.  Modularity and evolutionary constraint on proteins , 2005, Nature Genetics.

[51]  Marcus W Feldman,et al.  Evolution of Hierarchy in Bacterial Metabolic Networks , 2017, bioRxiv.

[52]  Carey E. Priebe,et al.  Community Detection and Classification in Hierarchical Stochastic Blockmodels , 2015, IEEE Transactions on Network Science and Engineering.

[53]  U. Alon Biological Networks: The Tinkerer as an Engineer , 2003, Science.

[54]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[55]  Somwrita Sarkar,et al.  Spectral Characterization of Hierarchical Network Modularity and Limits of Modularity Detection , 2013, PloS one.

[56]  Gerhard Schlosser,et al.  Modularity and the units of evolution , 2002, Theory in Biosciences.

[57]  Gary L. Miller,et al.  On the performance of spectral graph partitioning methods , 1995, SODA '95.

[58]  Lan V. Zhang,et al.  Evidence for dynamically organized modularity in the yeast protein–protein interaction network , 2004, Nature.

[59]  Carey E. Priebe,et al.  A Consistent Adjacency Spectral Embedding for Stochastic Blockmodel Graphs , 2011, 1108.2228.