On the molecular mechanism of GC content variation among eubacterial genomes

BackgroundAs a key parameter of genome sequence variation, the GC content of bacterial genomes has been investigated for over half a century, and many hypotheses have been put forward to explain this GC content variation and its relationship to other fundamental processes. Previously, we classified eubacteria into dnaE-based groups (the dimeric combination of DNA polymerase III alpha subunits), according to a hypothesis where GC content variation is essentially governed by genome replication and DNA repair mechanisms. Further investigation led to the discovery that two major mutator genes, polC and dnaE2, may be responsible for genomic GC content variation. Consequently, an in-depth analysis was conducted to evaluate various potential intrinsic and extrinsic factors in association with GC content variation among eubacterial genomes.ResultsMutator genes, especially those with dominant effects on the mutation spectra, are biased towards either GC or AT richness, and they alter genomic GC content in the two opposite directions. Increased bacterial genome size (or gene number) appears to rely on increased genomic GC content; however, it is unclear whether the changes are directly related to certain environmental pressures. Certain environmental and bacteriological features are related to GC content variation, but their trends are more obvious when analyzed under the dnaE-based grouping scheme. Most terrestrial, plant-associated, and nitrogen-fixing bacteria are members of the dnaE1|dnaE2 group, whereas most pathogenic or symbiotic bacteria in insects, and those dwelling in aquatic environments, are largely members of the dnaE1|polV group.ConclusionOur studies provide several lines of evidence indicating that DNA polymerase III α subunit and its isoforms participating in either replication (such as polC) or SOS mutagenesis/translesion synthesis (such as dnaE2), play dominant roles in determining GC variability. Other environmental or bacteriological factors, such as genome size, temperature, oxygen requirement, and habitat, either play subsidiary roles or rely indirectly on different mutator genes to fine-tune the GC content. These results provide a comprehensive insight into mechanisms of GC content variation and the robustness of eubacterial genomes in adapting their ever-changing environments over billions of years.ReviewersThis paper was reviewed by Nicolas Galtier, Adam Eyre-Walker, and Eugene Koonin.

[1]  E. Gavez [Critical thoughts on the terms "Mycobacterium tuberculosis", "Mycobacterium bovis" and "Mycobacterium avium" in Bergey's Manual of Determinative Bacteriology, 8th edition, 1975, Williams-Wilkins Company, Baltimore]. , 1986, Plucne bolesti : casopis Udruzenja pneumoftiziologa Jugoslavije = the journal of Yugoslav Association of Phthisiology and Pneumology.

[2]  Clifton E. Barry,et al.  DnaE2 Polymerase Contributes to In Vivo Survival and the Emergence of Drug Resistance in Mycobacterium tuberculosis , 2003, Cell.

[3]  D. Andersson,et al.  Whole-genome mutational biases in bacteria , 2008, Proceedings of the National Academy of Sciences.

[4]  J. Lobry,et al.  Relationships Between Genomic G+C Content, RNA Secondary Structures, and Optimal Growth Temperature in Prokaryotes , 1997, Journal of Molecular Evolution.

[5]  R. C. Woodruff,et al.  Mutator genes—pacemakers of evolution , 1978, Nature.

[6]  K. Konstantinidis,et al.  Trends between gene content and genome size in prokaryotic species with larger genomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[7]  A P Martin,et al.  Metabolic rate and directional nucleotide substitution in animal mitochondrial DNA. , 1995, Molecular biology and evolution.

[8]  G. F. Gause,et al.  Induction of Mutants with Altered DNA Composition: Effect of Ultraviolet on Bacterium paracoli 5099 , 1967, Science.

[9]  J. Wernegreen,et al.  Genome evolution in bacterial endosymbionts of insects , 2002, Nature Reviews Genetics.

[10]  D. Gatherer,et al.  Nitrogen-fixing aerobic bacteria have higher genomic GC content than non-fixing species within the same genus. , 2004, Hereditas.

[11]  F. Taddei,et al.  DNA repair systems and bacterial evolution. , 2000, Cold Spring Harbor symposia on quantitative biology.

[12]  Jian Wang,et al.  A complete sequence of the T. tengcongensis genome. , 2002, Genome research.

[13]  J. Hinds,et al.  The majority of inducible DNA repair genes in Mycobacterium tuberculosis are induced independently of RecA , 2003, Molecular microbiology.

[14]  P. Green,et al.  Transcription-associated mutational asymmetry in mammalian evolution , 2003, Nature Genetics.

[15]  Jun Yu,et al.  Comparative Analysis of Eubacterial DNA Polymerase III Alpha Subunits , 2007, Genom. Proteom. Bioinform..

[16]  D. Mindell Fundamentals of molecular evolution , 1991 .

[17]  H. Nishida,et al.  Symbiobacterium Lost Carbonic Anhydrase in the Course of Evolution , 2009, Journal of Molecular Evolution.

[18]  N. Moran,et al.  Microbial Minimalism Genome Reduction in Bacterial Pathogens , 2002, Cell.

[19]  M. Nei,et al.  MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. , 2007, Molecular biology and evolution.

[20]  Michael Doebeli,et al.  Adaptation increases the likelihood of diversification in an experimental bacterial lineage , 2008, Proceedings of the National Academy of Sciences.

[21]  L. Rand,et al.  Definition of the Mycobacterial SOS Box and Use To Identify LexA-Regulated Genes in Mycobacterium tuberculosis , 2002, Journal of bacteriology.

[22]  R. E. Buchanan,et al.  Bergey's manual of determinative bacteriology. 8th Edition. , 1974 .

[23]  T. Gojobori,et al.  The genome stability in Corynebacterium species due to lack of the recombinational repair system. , 2003, Gene.

[24]  R. Hancock,et al.  Mutator Genes Giving Rise to Decreased Antibiotic Susceptibility in Pseudomonas aeruginosa , 2008, Antimicrobial Agents and Chemotherapy.

[25]  Martin Peifer,et al.  Transcription-induced mutational strand bias and its effect on substitution rates in human genes. , 2008, Molecular biology and evolution.

[26]  D. Petrov,et al.  Evidence That Mutation Is Universally Biased towards AT in Bacteria , 2010, PLoS genetics.

[27]  Michael Doebeli,et al.  Unparallel diversification in bacterial microcosms , 2005, Proceedings of the Royal Society B: Biological Sciences.

[28]  J. Oliver,et al.  A relationship between GC content and coding-sequence length , 1996, Journal of Molecular Evolution.

[29]  Carl T. Bergstrom,et al.  The evolution of mutator genes in bacterial populations: the roles of environmental change and timing. , 2003, Genetics.

[30]  C. Yanofsky,et al.  The unusual mutagenic specificity of an E. Coli mutator gene. , 1966, Proceedings of the National Academy of Sciences of the United States of America.

[31]  N. Sueoka On the genetic basis of variation and heterogeneity of DNA base composition. , 1962, Proceedings of the National Academy of Sciences of the United States of America.

[32]  R. E. Buchanan,et al.  Bergey's Manual of Determinative Bacteriology. , 1975 .

[33]  Michael Travisano,et al.  Adaptive radiation in a heterogeneous environment , 1998, Nature.

[34]  B. Ames,et al.  Sunlight ultraviolet and bacterial DNA base ratios. , 1970, Science.

[35]  Alfonso Valencia,et al.  Reductive genome evolution in Buchnera aphidicola , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Jun Yu,et al.  GC content variability of eubacteria is governed by the pol III alpha subunit. , 2007, Biochemical and biophysical research communications.

[37]  Zhang Zhang,et al.  Compositional dynamics of guanine and cytosine content in prokaryotic genomes. , 2007, Research in microbiology.

[38]  C. Menck,et al.  An SOS-regulated operon involved in damage-inducible mutagenesis in Caulobacter crescentus , 2005, Nucleic acids research.

[39]  Zhang Zhang,et al.  Modeling compositional dynamics based on GC and purine contents of protein-coding sequences , 2010, Biology Direct.

[40]  G. Bernardi,et al.  Genomic GC level, optimal growth temperature, and genome size in prokaryotes. , 2006, Biochemical and biophysical research communications.

[41]  G. Bernardi,et al.  Correlations between genomic GC levels and optimal growth temperatures in prokaryotes , 2004, FEBS letters.

[42]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[43]  Hugo Naya,et al.  Aerobiosis Increases the Genomic Guanine Plus Cytosine Content (GC%) in Prokaryotes , 2002, Journal of Molecular Evolution.

[44]  Ivan Erill,et al.  Aeons of distress: an evolutionary perspective on the bacterial SOS response. , 2007, FEMS microbiology reviews.

[45]  N. Sueoka Directional mutation pressure and neutral molecular evolution. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[46]  A. R. Merchant,et al.  High guanine–cytosine content is not an adaptation to high temperature: a comparative analysis amongst prokaryotes , 2001, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[47]  Jun Wang,et al.  Compositional gradients in Gramineae genes. , 2002, Genome research.

[48]  T. Abe,et al.  The genome of Pelotomaculum thermopropionicum reveals niche-associated evolution in anaerobic microbiota. , 2008, Genome research.

[49]  F. Hildebrand,et al.  Evidence of Selection upon Genomic GC-Content in Bacteria , 2010, PLoS genetics.

[50]  Xuhua Xia,et al.  Effects of GC Content and Mutational Pressure on the Lengths of Exons and Coding Sequences , 2003, Journal of Molecular Evolution.

[51]  M. Bulmer,et al.  Coevolution of codon usage and transfer RNA abundance , 1987, Nature.

[52]  M. Gouy,et al.  Codon usage in bacteria: correlation with gene expressivity. , 1982, Nucleic acids research.

[53]  P. Bork,et al.  Environments shape the nucleotide composition of genomes , 2005, EMBO reports.

[54]  Jun Yu,et al.  GC content variability of eubacteria is governed by the pol III α subunit , 2007 .

[55]  A. Spirin,et al.  A Correlation between the Compositions of Deoxyribonucleic and Ribonucleic Acids , 1958, Nature.

[56]  P. Sharp,et al.  Codon usage and gene expression level in Dictyostelium discoideum: highly expressed genes do 'prefer' optimal codons. , 1989, Nucleic acids research.

[57]  G. Bernardi,et al.  Codon usage and genome composition , 2005, Journal of Molecular Evolution.

[58]  M. Marinus,et al.  Escherichia coli mutator genes. , 1999, Trends in microbiology.

[59]  C. F. Menck,et al.  Genome analysis of DNA repair genes in the alpha proteobacterium Caulobacter crescentus , 2007, BMC Microbiology.

[60]  E. Cox Bacterial mutator genes and the control of spontaneous mutation. , 1976, Annual review of genetics.

[61]  J. Miller,et al.  Mutator tRNAs are encoded by the Escherichia coli mutator genes mutA and mutC: a novel pathway for mutagenesis. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[62]  T. Tanaka,et al.  High guanine plus cytosine content in the third letter of codons of an extreme thermophile. DNA sequence of the isopropylmalate dehydrogenase of Thermus thermophilus. , 1984, The Journal of biological chemistry.

[63]  Stephen J Freeland,et al.  A simple model based on mutation and selection explains trends in codon and amino-acid usage and GC composition within and across genomes , 2001, Genome Biology.