GC-content evolution in bacterial genomes: the biased gene conversion hypothesis expands

The characterization of functional elements in genomes relies on the identification of the footprints of natural selection. In this quest, taking into account neutral evolutionary processes such as mutation and genetic drift is crucial because these forces can generate patterns that may obscure or mimic signatures of selection. In mammals, and probably in many eukaryotes, another such confounding factor called GC-Biased Gene Conversion (gBGC) has been documented. This mechanism generates patterns identical to what is expected under selection for higher GC-content, specifically in highly recombining genomic regions. Recent results have suggested that a mysterious selective force favouring higher GC-content exists in Bacteria but the possibility that it could be gBGC has been excluded. Here, we show that gBGC is probably at work in most if not all bacterial species. First we find a consistent positive relationship between the GC-content of a gene and evidence of intra-genic recombination throughout a broad spectrum of bacterial clades. Second, we show that the evolutionary force responsible for this pattern is acting independently from selection on codon usage, and could potentially interfere with selection in favor of optimal AU-ending codons. A comparison with data from human populations shows that the intensity of gBGC in Bacteria is comparable to what has been reported in mammals. We propose that gBGC is not restricted to sexual Eukaryotes but also widespread among Bacteria and could therefore be an ancestral feature of cellular organisms. We argue that if gBGC occurs in bacteria, it can account for previously unexplained observations, such as the apparent non-equilibrium of base substitution patterns and the heterogeneity of gene composition within bacterial genomes. Because gBGC produces patterns similar to positive selection, it is essential to take this process into account when studying the evolutionary forces at work in bacterial genomes.

[1]  M. Nei,et al.  The origins and early evolution of DNA mismatch repair genes—multiple horizontal gene transfers and co-evolution , 2007, Nucleic acids research.

[2]  T. Mailund,et al.  A fine-scale recombination map of the human–chimpanzee ancestor reveals faster change in humans than in chimpanzees and a strong impact of GC-biased gene conversion , 2014, Genome research.

[3]  M Achtman,et al.  Yersinia pestis, the cause of plague, is a recently emerged clone of Yersinia pseudotuberculosis. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Laurent Duret,et al.  Biased gene conversion and the evolution of mammalian genomic landscapes. , 2009, Annual review of genomics and human genetics.

[5]  H. Ochman,et al.  A selective force favoring increased G+C content in bacterial genes , 2012, Proceedings of the National Academy of Sciences.

[6]  S. Ferriera,et al.  Supporting Online Material Materials and Methods Figs. S1 and S2 Tables S1 and S2 References Temporal Fragmentation of Speciation in Bacteria , 2022 .

[7]  G. Perrière,et al.  The source of laterally transferred genes in bacterial genomes , 2003, Genome Biology.

[8]  W. Doolittle Is junk DNA bunk? A critique of ENCODE , 2013, Proceedings of the National Academy of Sciences.

[9]  Simon Easteal,et al.  A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequences , 1996, Comput. Appl. Biosci..

[10]  P. Keim,et al.  Molecular Epidemiology, Evolution, and Ecology of Francisella , 2007, Annals of the New York Academy of Sciences.

[11]  Guy Perrière,et al.  Databases of homologous gene families for comparative genomics , 2009, BMC Bioinformatics.

[12]  D. Petrov,et al.  Evidence That Mutation Is Universally Biased towards AT in Bacteria , 2010, PLoS genetics.

[13]  L. Duret,et al.  Evidence for Widespread GC-biased Gene Conversion in Eukaryotes , 2012, Genome biology and evolution.

[14]  N. Moran,et al.  Microbial Minimalism Genome Reduction in Bacterial Pathogens , 2002, Cell.

[15]  S. Glémin Surprising Fitness Consequences of GC-Biased Gene Conversion: I. Mutation Load and Inbreeding Depression , 2010, Genetics.

[16]  T. Nagylaki Evolution of a finite population under gene conversion. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[17]  G. McVean,et al.  The effects of Hill-Robertson interference between weakly selected mutations on patterns of molecular evolution and variation. , 2000, Genetics.

[18]  John Maynard Smith,et al.  Analyzing the mosaic structure of genes , 1992, Journal of Molecular Evolution.

[19]  N. Galtier,et al.  GC-biased gene conversion impacts ribosomal DNA evolution in vertebrates, angiosperms, and other eukaryotes. , 2011, Molecular biology and evolution.

[20]  K. Pollard,et al.  Substitution Patterns Are GC-Biased in Divergent Sequences across the Metazoans , 2011, Genome biology and evolution.

[21]  L. Steinmetz,et al.  High-resolution mapping of meiotic crossovers and non-crossovers in yeast , 2008, Nature.

[22]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[23]  E. Rocha,et al.  The temporal dynamics of slightly deleterious mutations in Escherichia coli and Shigella spp. , 2009, Molecular biology and evolution.

[24]  Maulik Shukla,et al.  Analysis of Ten Brucella Genomes Reveals Evidence for Horizontal Gene Transfer Despite a Preferred Intracellular Lifestyle , 2009, Journal of bacteriology.

[25]  D. Petrov,et al.  On the Limitations of Using Ribosomal Genes as References for the Study of Codon Usage: A Rebuttal , 2012, PloS one.

[26]  L. Hurst,et al.  Direct and indirect consequences of meiotic recombination: implications for genome evolution. , 2012, Trends in genetics : TIG.

[27]  Jian-Qun Chen,et al.  Optimal Codon Identities in Bacteria: Implications from the Conflicting Results of Two Different Methods , 2011, PloS one.

[28]  Marcella Attimonelli,et al.  ACNUC - a portable retrieval system for nucleic acid sequence databases: logical and physical designs and usage , 1985, Comput. Appl. Biosci..

[29]  Laurent Duret,et al.  Detecting positive selection within genomes: the problem of biased gene conversion , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[30]  E. Willery,et al.  Genome analysis of smooth tubercle bacilli provides insights into ancestry and pathoadaptation of the etiologic agent of tuberculosis , 2013, Nature Genetics.

[31]  H. Ochman,et al.  Bacterial genomes as new gene homes: the genealogy of ORFans in E. coli. , 2004, Genome research.

[32]  C. Fields,et al.  Correction for " Biogeography of the Sulfolobus islandicus pangenome" (Proceedings of the National Academy of Sciences of the United States of America (2009) 106, 21, (8605- 8610) DOI: 10.1073/pnas.0808945106) , 2009 .

[33]  I-Min A. Chen,et al.  The Genomes OnLine Database (GOLD) v.4: status of genomic and metagenomic projects and their associated metadata , 2011, Nucleic Acids Res..

[34]  L. Duret,et al.  GC-biased gene conversion promotes the fixation of deleterious amino acid changes in primates. , 2009, Trends in genetics : TIG.

[35]  N. Sueoka Directional mutation pressure and neutral molecular evolution. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[36]  C. Fields,et al.  Biogeography of the Sulfolobus islandicus pan-genome , 2009, Proceedings of the National Academy of Sciences.

[37]  John W. Taylor,et al.  Geographic Barriers Isolate Endemic Populations of Hyperthermophilic Archaea , 2003, Science.

[38]  L. Duret,et al.  GC-Biased Gene Conversion in Yeast Is Specifically Associated with Crossovers: Molecular Mechanisms and Evolutionary Significance , 2013, Molecular biology and evolution.

[39]  Peter Donnelly,et al.  The Influence of Recombination on Human Genetic Diversity , 2006, PLoS genetics.

[40]  D. Falush,et al.  Recombination and mutation during long-term gastric colonization by Helicobacter pylori: Estimates of clock rates, recombination size, and minimal age , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[41]  M. Lynch Rate, molecular spectrum, and consequences of human mutation , 2010, Proceedings of the National Academy of Sciences.

[42]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[43]  E. Rocha,et al.  Mutational Patterns Cannot Explain Genome Composition: Are There Any Neutral Sites in the Genomes of Bacteria? , 2010, PLoS genetics.

[44]  F. Hildebrand,et al.  Evidence of Selection upon Genomic GC-Content in Bacteria , 2010, PLoS genetics.

[45]  G Bernardi,et al.  The mosaic genome of warm-blooded vertebrates. , 1985, Science.

[46]  T. Read,et al.  Interplay of recombination and selection in the genomes of Chlamydia trachomatis , 2011, Biology Direct.

[47]  I-Min A. Chen,et al.  The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata , 2007, Nucleic Acids Res..

[48]  D. Bryant,et al.  A Simple and Robust Statistical Test for Detecting the Presence of Recombination , 2006, Genetics.

[49]  D. Petrov,et al.  General Rules for Optimal Codon Choice , 2009, PLoS genetics.

[50]  L. Duret,et al.  Adaptation or biased gene conversion? Extending the null hypothesis of molecular evolution. , 2007, Trends in genetics : TIG.

[51]  D. Ussery,et al.  The genus burkholderia: analysis of 56 genomic sequences. , 2009, Genome dynamics.

[52]  N. Sueoka On the genetic basis of variation and heterogeneity of DNA base composition. , 1962, Proceedings of the National Academy of Sciences of the United States of America.

[53]  A. Danchin,et al.  Organised Genome Dynamics in the Escherichia coli Species Results in Highly Diverse Adaptive Paths , 2009, PLoS genetics.

[54]  Laurent Duret,et al.  The Impact of Recombination on Nucleotide Substitutions in the Human Genome , 2008, PLoS genetics.

[55]  S. Sawyer Statistical tests for detecting gene conversion. , 1989, Molecular biology and evolution.

[56]  S. Keeney,et al.  Mouse tetrad analysis provides insights into recombination mechanisms and hotspot evolutionary dynamics , 2014, Nature Genetics.

[57]  D. Falush,et al.  Inference of Homologous Recombination in Bacteria Using Whole-Genome Sequences , 2010, Genetics.

[58]  J. M. Smith,et al.  Free recombination within Helicobacter pylori. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[59]  P. Stenson,et al.  Meiotic recombination favors the spreading of deleterious mutations in human populations , 2011, Human mutation.

[60]  D. Reich,et al.  Non-crossover gene conversions show strong GC bias and unexpected clustering in humans , 2014, bioRxiv.

[61]  P. Bork,et al.  Environments shape the nucleotide composition of genomes , 2005, EMBO reports.

[62]  N. Moran,et al.  Functional Convergence in Reduced Genomes of Bacterial Symbionts Spanning 200 My of Evolution , 2010, Genome biology and evolution.