Measuring the Evolutionary Rewiring of Biological Networks

We have accumulated a large amount of biological network data and expect even more to come. Soon, we anticipate being able to compare many different biological networks as we commonly do for molecular sequences. It has long been believed that many of these networks change, or “rewire”, at different rates. It is therefore important to develop a framework to quantify the differences between networks in a unified fashion. We developed such a formalism based on analogy to simple models of sequence evolution, and used it to conduct a systematic study of network rewiring on all the currently available biological networks. We found that, similar to sequences, biological networks show a decreased rate of change at large time divergences, because of saturation in potential substitutions. However, different types of biological networks consistently rewire at different rates. Using comparative genomics and proteomics data, we found a consistent ordering of the rewiring rates: transcription regulatory, phosphorylation regulatory, genetic interaction, miRNA regulatory, protein interaction, and metabolic pathway network, from fast to slow. This ordering was found in all comparisons we did of matched networks between organisms. To gain further intuition on network rewiring, we compared our observed rewirings with those obtained from simulation. We also investigated how readily our formalism could be mapped to other network contexts; in particular, we showed how it could be applied to analyze changes in a range of “commonplace” networks such as family trees, co-authorships and linux-kernel function dependencies.

[1]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[2]  Anton J. Enright,et al.  Human MicroRNA Targets , 2004, PLoS biology.

[3]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[4]  Michele Caselle,et al.  Ordered structure of the transcription network inherited from the yeast whole-genome duplication , 2010, BMC Systems Biology.

[5]  Bruce Stillman,et al.  Deciphering Protein Kinase Specificity through Large-scale Analysis of Materials Supplemental Deciphering Protein Kinase Specificity through Large-scale Analysis of Yeast Phosphorylation Site Motifs , 2010 .

[6]  G. Glazko,et al.  Origin of a substantial fraction of human regulatory sequences from transposable elements. , 2003, Trends in genetics : TIG.

[7]  Manolis Kellis,et al.  The evolutionary dynamics of the Saccharomyces cerevisiae protein interaction network after duplication , 2008, Proceedings of the National Academy of Sciences.

[8]  Chad A. Cowan,et al.  Rewirable gene regulatory networks in the preimplantation embryonic development of three mammalian species. , 2010, Genome research.

[9]  M. Gerstein,et al.  Global analysis of protein phosphorylation in yeast , 2005, Nature.

[10]  Jodie J. Yin,et al.  A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes , 2004, Genome Biology.

[11]  K. Gunsalus,et al.  Combinatorial microRNA target predictions , 2005, Nature Genetics.

[12]  D. Durocher,et al.  Significant conservation of synthetic lethal genetic interaction networks between distantly related eukaryotes , 2008, Proceedings of the National Academy of Sciences.

[13]  James R. Knight,et al.  A Protein Interaction Map of Drosophila melanogaster , 2003, Science.

[14]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[15]  C. Woese,et al.  Conservation of primary structure in 16S ribosomal RNA , 1975, Nature.

[16]  Philip M. Kim,et al.  Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights , 2006, Science.

[17]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[18]  Robert D. Finn,et al.  iPfam: visualization of protein?Cprotein interactions in PDB at domain and amino acid resolutions , 2005, Bioinform..

[19]  W. Lim,et al.  Evolution of Phosphoregulation: Comparison of Phosphorylation Patterns across Yeast Species , 2009, PLoS biology.

[20]  Frances H Arnold,et al.  Structural determinants of the rate of protein evolution in yeast. , 2006, Molecular biology and evolution.

[21]  Janet M. Thornton,et al.  Microeconomic Principles Explain an Optimal Genome Size in Bacteria , 2004, Spanish Bioinformatics Conference.

[22]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[23]  G. Conant Rapid reorganization of the transcriptional regulatory network after genome duplication in yeast , 2010, Proceedings of the Royal Society B: Biological Sciences.

[24]  Sven Bergmann,et al.  Rewiring of the Yeast Transcriptional Network Through the Evolution of Motif Usage , 2005, Science.

[25]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[26]  Berend Snel,et al.  Protein Complex Evolution Does Not Involve Extensive Network Rewiring , 2008, PLoS Comput. Biol..

[27]  Mark Gerstein,et al.  Divergence of transcription factor binding sites across related yeast species. , 2007, Science.

[28]  S. Teichmann,et al.  Evolutionary dynamics of prokaryotic transcriptional regulatory networks. , 2006, Journal of molecular biology.

[29]  E. Davidson,et al.  Developmental gene regulatory network architecture across 500 million years of echinoderm evolution , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[31]  Janet M. Thornton,et al.  Evolutionary Models for Formation of Network Motifs and Modularity in the Saccharomyces Transcription Factor Network , 2007, PLoS Comput. Biol..

[32]  M. Gerstein,et al.  Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. , 2004, Genome research.

[33]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[34]  M. Vidal,et al.  Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or "interologs". , 2001, Genome research.

[35]  Jianrong Wang,et al.  A c-Myc regulatory subnetwork from human transposable element sequences. , 2009, Molecular bioSystems.

[36]  E. Liu,et al.  Evolution of the mammalian transcription factor binding repertoire via transposable elements. , 2008, Genome research.

[37]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[38]  Stephen G Oliver,et al.  Posttranslational regulation impacts the fate of duplicated genes , 2010, Proceedings of the National Academy of Sciences.

[39]  David A. Nix,et al.  Large-Scale Turnover of Functional Transcription Factor Binding Sites in Drosophila , 2006, PLoS Comput. Biol..

[40]  Avi Pfeffer,et al.  Automatic genome-wide reconstruction of phylogenetic gene trees , 2007, ISMB/ECCB.

[41]  E. Krebs,et al.  Substrate specificity of the cyclic AMP-dependent protein kinase. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[42]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[43]  Wendell A. Lim,et al.  Correction: Evolution of Phosphoregulation: Comparison of Phosphorylation Patterns across Yeast Species , 2009, PLoS Biology.

[44]  B. Dujon,et al.  Genome evolution in yeasts , 2004, Nature.

[45]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[46]  E. Nimwegen Scaling Laws in the Functional Content of Genomes , 2003, physics/0307001.

[47]  Erik L. L. Sonnhammer,et al.  Inparanoid: a comprehensive database of eukaryotic orthologs , 2004, Nucleic Acids Res..

[48]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[50]  Raymond K. Auerbach,et al.  Integrative Analysis of the Caenorhabditis elegans Genome by the modENCODE Project , 2010, Science.

[51]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[52]  A Keith Dunker,et al.  Intrinsic disorder and protein function. , 2002, Biochemistry.

[53]  M. Vidal,et al.  Protein interaction mapping in C. elegans using proteins involved in vulval development. , 2000, Science.

[54]  S. Jeffery Evolution of Protein Molecules , 1979 .

[55]  Gary D Bader,et al.  Global Mapping of the Yeast Genetic Interaction Network , 2004, Science.

[56]  Nicola J. Rinaldi,et al.  Transcriptional regulatory code of a eukaryotic genome , 2004, Nature.

[57]  Ryan D. Hernandez,et al.  Natural selection on protein-coding genes in the human genome , 2005, Nature.

[58]  Michael Bader,et al.  Intracellular Serotonin Modulates Insulin Secretion from Pancreatic β-Cells by Protein Serotonylation , 2009, PLoS biology.

[59]  André Nantel,et al.  Evolutionary Tinkering with Conserved Components of a Transcriptional Regulatory Network , 2010, PLoS biology.

[60]  Mark Gerstein,et al.  An integrated system for studying residue coevolution in proteins , 2008, Bioinform..

[61]  M. Nóbrega,et al.  Scanning Human Gene Deserts for Long-Range Enhancers , 2003, Science.

[62]  Sean R. Collins,et al.  Conservation and Rewiring of Functional Modules Revealed by an Epistasis Map in Fission Yeast , 2008, Science.

[63]  R. Karp,et al.  Conserved pathways within bacteria and yeast as revealed by global protein network alignment , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Pedro Beltrão,et al.  Specificity and Evolvability in Eukaryotic Protein Interaction Networks , 2007, PLoS Comput. Biol..

[65]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information , 2021, Nucleic Acids Res..

[66]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[67]  Mark Gerstein,et al.  Comparing genomes to computer operating systems in terms of the topology and evolution of their regulatory control networks , 2010, Proceedings of the National Academy of Sciences.

[68]  Ariel S. Schwartz,et al.  An Atlas of Combinatorial Transcriptional Regulation in Mouse and Man , 2010, Cell.

[69]  Mark Gerstein,et al.  Open Access Methodology Article Motips: Automated Motif Analysis for Predicting Targets of Modular Protein Domains , 2022 .

[70]  G. Bourque,et al.  Transposable elements have rewired the core regulatory network of human embryonic stem cells , 2010, Nature Genetics.

[71]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[72]  Susumu Goto,et al.  KEGG for representation and analysis of molecular networks involving diseases and drugs , 2009, Nucleic Acids Res..

[73]  Hao Li,et al.  The Evolution of Combinatorial Gene Regulation in Fungi , 2008, PLoS biology.

[74]  H. Lehrach,et al.  A Human Protein-Protein Interaction Network: A Resource for Annotating the Proteome , 2005, Cell.

[75]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[76]  Michael D. Wilson,et al.  Five-Vertebrate ChIP-seq Reveals the Evolutionary Dynamics of Transcription Factor Binding , 2010, Science.

[77]  Junjun Zhang,et al.  BioMart Central Portal—unified access to biological data , 2009, Nucleic Acids Res..

[78]  T. Jukes,et al.  The neutral theory of molecular evolution. , 2000, Genetics.

[79]  R. Tjian,et al.  Transcriptional regulation in mammalian cells by sequence-specific DNA binding proteins. , 1989, Science.

[80]  Justin C. Fay,et al.  Testing the neutral theory of molecular evolution with genomic data from Drosophila , 2002, Nature.