Buffering by gene duplicates: an analysis of molecular correlates and evolutionary conservation

BackgroundOne mechanism to account for robustness against gene knockouts or knockdowns is through buffering by gene duplicates, but the extent and general correlates of this process in organisms is still a matter of debate. To reveal general trends of this process, we provide a comprehensive comparison of gene essentiality, duplication and buffering by duplicates across seven bacteria (Mycoplasma genitalium, Bacillus subtilis, Helicobacter pylori, Haemophilus influenzae, Mycobacterium tuberculosis, Pseudomonas aeruginosa, Escherichia coli), and four eukaryotes (Saccharomyces cerevisiae (yeast), Caenorhabditis elegans (worm), Drosophila melanogaster (fly), Mus musculus (mouse)).ResultsIn nine of the eleven organisms, duplicates significantly increase chances of survival upon gene deletion (P-value ≤ 0.05), but only by up to 13%. Given that duplicates make up to 80% of eukaryotic genomes, the small contribution is surprising and points to dominant roles of other buffering processes, such as alternative metabolic pathways. The buffering capacity of duplicates appears to be independent of the degree of gene essentiality and tends to be higher for genes with high expression levels. For example, buffering capacity increases to 23% amongst highly expressed genes in E. coli. Sequence similarity and the number of duplicates per gene are weak predictors of the duplicate's buffering capacity. In a case study we show that buffering gene duplicates in yeast and worm are somewhat more similar in their functions than non-buffering duplicates and have increased transcriptional and translational activity.ConclusionIn sum, the extent of gene essentiality and buffering by duplicates is not conserved across organisms and does not correlate with the organisms' apparent complexity. This heterogeneity goes beyond what would be expected from differences in experimental approaches alone. Buffering by duplicates contributes to robustness in several organisms, but to a small extent – and the relatively large amount of buffering by duplicates observed in yeast and worm may be largely specific to these organisms. Thus, the only common factor of buffering by duplicates between different organisms may be the by-product of duplicate retention due to demands of high dosage.

[1]  A. E. Hirsh,et al.  Noise Minimization in Eukaryotic Gene Expression , 2004, PLoS biology.

[2]  Kriston L. McGary,et al.  Open Access Method , 2007 .

[3]  A. Wagner Robustness against mutations in genetic networks of yeast , 2000, Nature Genetics.

[4]  Y. Dong,et al.  Systematic functional analysis of the Caenorhabditis elegans genome using RNAi , 2003, Nature.

[5]  B. Birren,et al.  Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae , 2004, Nature.

[6]  Yuanfang Guan,et al.  Functional Analysis of Gene Duplications in Saccharomyces cerevisiae , 2007, Genetics.

[7]  Cyrus Chothia,et al.  The SUPERFAMILY database in 2007: families and functions , 2006, Nucleic Acids Res..

[8]  Andrew Emili,et al.  Navigating the Chaperone Network: An Integrative Map of Physical and Genetic Interactions Mediated by the Hsp90 Chaperone , 2005, Cell.

[9]  Ben-Yang Liao,et al.  Mouse duplicate genes are as essential as singletons. , 2007, Trends in genetics : TIG.

[10]  Jerel Clayton Davis,et al.  Molecular evolution meets the genomics revolution , 2005 .

[11]  Karl J. Friston,et al.  Metabolic network analysis of the causes and evolution of enzyme dispensability in yeast , 2004 .

[12]  T. Hughes,et al.  H2B Ubiquitin Protease Ubp8 and Sgf11 Constitute a Discrete Functional Module within the Saccharomyces cerevisiae SAGA Complex , 2005, Molecular and Cellular Biology.

[13]  Ben Lehner,et al.  Combinatorial RNA interference in Caenorhabditis elegans reveals that redundancy between gene duplicates can be maintained for more than 80 million years of evolution , 2006, Genome Biology.

[14]  Laurence D. Hurst,et al.  Genomic function (communication arising): Rate of evolution and gene dispensability , 2003, Nature.

[15]  J. Bader,et al.  A DNA Integrity Network in the Yeast Saccharomyces cerevisiae , 2013, Cell.

[16]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[17]  A. E. Hirsh,et al.  Functional genomic analysis of the rates of protein evolution. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Gary D Bader,et al.  Systematic Genetic Analysis with Ordered Arrays of Yeast Deletion Mutants , 2001, Science.

[19]  Christian E. V. Storm,et al.  Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. , 2001, Journal of molecular biology.

[20]  E. Winzeler,et al.  Functional analysis of the yeast genome by precise deletion and parallel phenotypic characterization. , 2000, Novartis Foundation symposium.

[21]  S. Ehrlich,et al.  Essential Bacillus subtilis genes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[22]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[23]  B. Dujon,et al.  Genetic network interactions among replication, repair and nuclear pore deficiencies in yeast. , 2005, DNA repair.

[24]  T. Traut,et al.  A minimal gene set for cellular life derived by comparison of complete bacterial genomes , 1998 .

[25]  Martin A. Nowak,et al.  Evolution of genetic redundancy , 1997, Nature.

[26]  J. W. Campbell,et al.  Experimental Determination and System Level Analysis of Essential Genes in Escherichia coli MG1655 , 2003, Journal of bacteriology.

[27]  Kara Dolinski,et al.  Expanded protein information at SGD: new pages and proteome browser , 2006, Nucleic Acids Res..

[28]  Judith A. Blake,et al.  The mouse genome database (MGD): new features facilitating a model system , 2006, Nucleic Acids Res..

[29]  Nevan J Krogan,et al.  Backup without redundancy: genetic interactions reveal the cost of duplicate gene loss , 2007, Molecular systems biology.

[30]  S. L. Wong,et al.  Combining biological networks to predict genetic interactions. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[31]  B. Garvik,et al.  Principles for the Buffering of Genetic Variation , 2001, Science.

[32]  Jianzhi Zhang,et al.  Higher duplicability of less important genes in yeast genomes. , 2006, Molecular biology and evolution.

[33]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[34]  Charles Boone,et al.  An interactional network of genes involved in chitin synthesis in Saccharomyces cerevisiae , 2005, BMC Genetics.

[35]  Stanley Falkow,et al.  Global Transposon Mutagenesis and Essential Gene Analysis of Helicobacter pylori , 2004, Journal of bacteriology.

[36]  Nevan J. Krogan,et al.  A Snf 2 Family ATPase Complex Required for Recruitment of the Histone H 2 A Variant Htz , 2003 .

[37]  K. H. Wolfe,et al.  Yeast genome evolution in the post-genome era. , 1999, Current opinion in microbiology.

[38]  J. Bader,et al.  A robust toolkit for functional profiling of the yeast genome. , 2004, Molecular cell.

[39]  A. Fraser,et al.  Systematic mapping of genetic interactions in Caenorhabditis elegans identifies common modifiers of diverse signaling pathways , 2006, Nature Genetics.

[40]  C. Pál,et al.  Genomic function: Rate of evolution and gene dispensability. , 2003, Nature.

[41]  Shailesh V. Date,et al.  A Probabilistic Functional Network of Yeast Genes , 2004, Science.

[42]  C. Pál,et al.  An integrated view of protein evolution , 2006, Nature Reviews Genetics.

[43]  Charles Boone,et al.  The Origin Recognition Complex Links Replication, Sister Chromatid Cohesion and Transcriptional Silencing in Saccharomyces cerevisiae , 2004, Genetics.

[44]  J. Hartman,et al.  Buffering of deoxyribonucleotide pool homeostasis by threonine metabolism , 2007, Proceedings of the National Academy of Sciences.

[45]  J. Mekalanos,et al.  A genome-scale analysis for identification of genes required for growth or survival of Haemophilus influenzae , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[46]  D. Burke,et al.  Diverse Functions of Spindle Assembly Checkpoint Genes in Saccharomyces cerevisiae , 2006, Genetics.

[47]  Andreas Wagner,et al.  Duplicate genes and robustness to transient gene knock-downs in Caenorhabditis elegans , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[48]  Mark D. Robinson,et al.  FunSpec: a web-based cluster interpreter for yeast , 2002, BMC Bioinformatics.

[49]  E. Marcotte,et al.  Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation , 2007, Nature Biotechnology.

[50]  N. Perrimon,et al.  Genome-Wide RNAi Analysis of Growth and Viability in Drosophila Cells , 2004, Science.

[51]  Ronald W. Davis,et al.  Role of duplicate genes in genetic robustness against null mutations , 2003, Nature.

[52]  Y. Ho,et al.  Characterization of the yeast amphiphysins Rvs161p and Rvs167p reveals roles for the Rvs heterodimer in vivo. , 2005, Molecular biology of the cell.

[53]  Joshua M. Stuart,et al.  A global analysis of genetic interactions in Caenorhabditis elegans , 2007, Journal of biology.

[54]  Huiming Ding,et al.  A Snf2 family ATPase complex required for recruitment of the histone H2A variant Htz1. , 2003, Molecular cell.

[55]  M. Lynch,et al.  The altered evolutionary trajectories of gene duplicates. , 2004, Trends in genetics : TIG.

[56]  Y. Pilpel,et al.  Transcription control reprogramming in genetic backup circuits , 2005, Nature Genetics.

[57]  U. Sauer,et al.  Metabolic functions of duplicate genes in Saccharomyces cerevisiae. , 2005, Genome research.

[58]  B. Andrews,et al.  Reverse recruitment: the Nup84 nuclear pore subcomplex mediates Rap1/Gcr1/Gcr2 transcriptional activation. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[59]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.

[60]  A. Wilkins,et al.  Canalization: a molecular genetic perspective. , 1997, BioEssays : news and reviews in molecular, cellular and developmental biology.

[61]  U. Sauer,et al.  Large-scale 13C-flux analysis reveals mechanistic principles of metabolic network robustness to null mutations in yeast , 2005, Genome Biology.

[62]  E. O’Shea,et al.  Quantification of protein half-lives in the budding yeast proteome , 2006, Proceedings of the National Academy of Sciences.

[63]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[64]  Ronald W. Davis,et al.  Functional profiling of the Saccharomyces cerevisiae genome , 2002, Nature.

[65]  E. Rubin,et al.  Genes required for mycobacterial growth defined by high density mutagenesis , 2003, Molecular microbiology.

[66]  C. Hutchison,et al.  Essential genes of a minimal bacterium. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[67]  H. Bussey,et al.  Analysis of β-1,3-Glucan Assembly in Saccharomyces cerevisiae Using a Synthetic Interaction Network and Altered Sensitivity to Caspofungin , 2004, Genetics.

[68]  Frederick M Ausubel,et al.  Correction for Liberati et al., An ordered, nonredundant library of Pseudomonas aeruginosa strain PA14 transposon insertion mutants , 2006, Proceedings of the National Academy of Sciences.

[69]  Wen-Hsiung Li,et al.  Gene essentiality, gene duplicability and protein connectivity in human and mouse. , 2007, Trends in genetics : TIG.

[70]  Gary D Bader,et al.  Global Mapping of the Yeast Genetic Interaction Network , 2004, Science.

[71]  Weiwen Zhang,et al.  Predicted highly expressed genes in the genomes of Streptomyces coelicolor and Streptomyces avermitilis and the implications for their metabolism. , 2005, Microbiology.

[72]  D. Tautz,et al.  Redundancies, development and the flow of information. , 1992, BioEssays : news and reviews in molecular, cellular and developmental biology.