Novel specificities emerge by stepwise duplication of functional modules.

A functional module can be defined as a spatially or chemically isolated set of functionally associated components that accomplishes a discrete biological process. Modularity is a key attribute of cellular systems, but the mechanisms that underlie the evolution of functional modules are largely unknown. Duplication of modules has been shown to be an efficient mechanism for the generation of functional innovation in the field of artificial intelligence, but has not been studied in biological networks. Therefore, we ask whether module duplication occurs in cellular networks. We developed a generic framework for the analysis of module duplication, and use it in a large-scale analysis of Saccharomyces cerevisiae protein complexes. Protein complexes are well defined, experimentally derived, functional modules. We observe that at least 6%-20% of the protein complexes have strong similarity to other complexes; thus a considerable fraction has evolved by duplication. Our results indicate that many complexes evolved by step-wise partial duplications. We show that duplicated complexes retain the same overall function, but have different binding specificities and regulation, revealing that duplication of these modules is associated with functional specialization.

[1]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[2]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[3]  S. Teichmann,et al.  Gene regulatory network growth by duplication , 2004, Nature Genetics.

[4]  C. Pál,et al.  Dosage sensitivity and the evolution of gene families in yeast , 2003, Nature.

[5]  Alexander Rives,et al.  Modular organization of cellular networks , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Stefano Nolfi,et al.  Emergence of functional modularity in robots , 1998 .

[7]  Bertrand Séraphin,et al.  Sm and Sm‐like proteins assemble in two related complexes of deep evolutionary origin , 1999, The EMBO journal.

[8]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[9]  Berend Snel,et al.  Quantifying modularity in the evolution of biomolecular systems. , 2004, Genome research.

[10]  M. Lynch,et al.  The altered evolutionary trajectories of gene duplicates. , 2004, Trends in genetics : TIG.

[11]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[12]  F. Winston,et al.  Recent advances in understanding chromatin remodeling by Swi/Snf complexes. , 2003, Current opinion in genetics & development.

[13]  A. Barabasi,et al.  Bioinformatics analysis of experimentally determined protein complexes in the yeast Saccharomyces cerevisiae. , 2003, Genome research.

[14]  F. B. Pickett,et al.  Splitting pairs: the diverging fates of duplicated genes , 2002, Nature Reviews Genetics.

[15]  J. Bonifacino,et al.  Adaptins: the final recount. , 2001, Molecular biology of the cell.

[16]  Bertrand Séraphin,et al.  A Sm‐like protein complex that participates in mRNA degradation , 2000, The EMBO journal.

[17]  C. Chothia,et al.  Structural assignments to the Mycoplasma genitalium proteins show extensive gene duplications and domain rearrangements. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Anton J. Enright,et al.  Detection of functional modules from protein interaction networks , 2003, Proteins.

[19]  S. Teichmann,et al.  Genes Encoding Subunits of Stable Complexes Are Clustered on the Yeast Chromosomes , 2004, Genetics.

[20]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[21]  F. Jacob,et al.  Evolution and tinkering. , 1977, Science.

[22]  W. Pearson Rapid and sensitive sequence comparison with FASTP and FASTA. , 1990, Methods in enzymology.

[23]  Kara Dolinski,et al.  Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms , 2004, Nucleic Acids Res..

[24]  Sebastian Maurer-Stroh,et al.  Protein prenyltransferases , 2003, Genome Biology.

[25]  B. Dujon,et al.  Genome evolution in yeasts , 2004, Nature.

[26]  E. O’Shea,et al.  Global analysis of protein localization in budding yeast , 2003, Nature.

[27]  Andreas Wagner,et al.  Convergent evolution of gene circuits , 2003, Nature Genetics.

[28]  S. Munro,et al.  The Sec34/35 Golgi transport complex is related to the exocyst, defining a family of complexes involved in multiple steps of membrane traffic. , 2001, Developmental cell.

[29]  M. Lynch,et al.  The structure and early evolution of recently arisen gene duplicates in the Caenorhabditis elegans genome. , 2003, Genetics.

[30]  U. Alon Biological Networks: The Tinkerer as an Engineer , 2003, Science.

[31]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[32]  Dmitrij Frishman,et al.  MIPS: a database for genomes and protein sequences , 1999, Nucleic Acids Res..

[33]  T. F. Hansen Is modularity necessary for evolvability? Remarks on the relationship between pleiotropy and evolvability. , 2003, Bio Systems.

[34]  Anton J. Enright,et al.  BioLayout-an automatic graph layout algorithm for similarity visualization , 2001, Bioinform..

[35]  Balázs Papp,et al.  Evolution of cis-regulatory elements in duplicated genes of yeast. , 2003, Trends in genetics : TIG.

[36]  Stefano Nolfi,et al.  Duplication of Modules Facilitates the Evolution of Functional Specialization , 1999, Artificial Life.

[37]  A. Wagner The yeast protein interaction network evolves rapidly and contains few redundant duplicate genes. , 2001, Molecular biology and evolution.

[38]  K. Nakayama,et al.  Similar subunit interactions contribute to assembly of clathrin adaptor complexes and COPI complex: analysis using yeast three-hybrid system. , 2001, Biochemical and biophysical research communications.

[39]  A. Wagner,et al.  Asymmetric sequence divergence of duplicate genes. , 2003, Genome research.

[40]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[41]  Yaniv Ziv,et al.  Revealing modular organization in the yeast transcriptional network , 2002, Nature Genetics.

[42]  Cyrus Chothia,et al.  The SUPERFAMILY database in 2004: additions and improvements , 2004, Nucleic Acids Res..

[43]  P. Rahl,et al.  Conserved Structural Motifs in Intracellular Trafficking Pathways: Structure of the γCOP Appendage Domain.: Structure of the γCOP Appendage Domain. , 2003 .

[44]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[45]  R. Veitia,et al.  Nonlinear effects in macromolecular assembly and dosage sensitivity. , 2003, Journal of theoretical biology.

[46]  Z N Oltvai,et al.  Evolutionary conservation of motif constituents in the yeast protein interaction network , 2003, Nature Genetics.

[47]  C. Chothia,et al.  The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli. , 2001, Journal of molecular biology.

[48]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[49]  B. Birren,et al.  Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae , 2004, Nature.

[50]  S. Munro,et al.  Vesicle tethering complexes in membrane traffic. , 2002, Journal of cell science.