Exponential decay of GC content detected by strand-symmetric substitution rates influences the evolution of isochore structure.

The distribution of guanine and cytosine nucleotides throughout a genome, or the GC content, is associated with numerous features in mammals; understanding the pattern and evolutionary history of GC content is crucial to our efforts to annotate the genome. The local GC content is decaying toward an equilibrium point, but the causes and rates of this decay, as well as the value of the equilibrium point, remain topics of debate. By comparing the results of 2 methods for estimating local substitution rates, we identify 620 Mb of the human genome in which the rates of the various types of nucleotide substitutions are the same on both strands. These strand-symmetric regions show an exponential decay of local GC content at a pace determined by local substitution rates. DNA segments subjected to higher rates experience disproportionately accelerated decay and are AT rich, whereas segments subjected to lower rates decay more slowly and are GC rich. Although we are unable to draw any conclusions about causal factors, the results support the hypothesis proposed by Khelifi A, Meunier J, Duret L, and Mouchiroud D (2006. GC content evolution of the human and mouse genomes: insights from the study of processed pseudogenes in regions of different recombination rates. J Mol Evol. 62:745-752.) that the isochore structure has been reshaped over time. If rate variation were a determining factor, then the current isochore structure of mammalian genomes could result from the local differences in substitution rates. We predict that under current conditions strand-symmetric portions of the human genome will stabilize at an average GC content of 30% (considerably less than the current 42%), thus confirming that the human genome has not yet reached equilibrium.

[1]  Stéphane Boissinot,et al.  Molecular evolution and tempo of amplification of human LINE-1 retrotransposons since the origin of primates. , 2005, Genome research.

[2]  H. Ellegren,et al.  Mutation rate variation in the mammalian genome. , 2003, Current opinion in genetics & development.

[3]  J. Lobry,et al.  Asymmetric substitution patterns: a review of possible underlying mutational or selective mechanisms. , 1999, Gene.

[4]  Wen-Hsiung Li,et al.  Are GC-rich isochores vanishing in mammals? , 2006, Gene.

[5]  Giorgio Bernardi,et al.  Inaccurate reconstruction of ancestral GC levels creates a "vanishing isochores" effect. , 2004, Molecular phylogenetics and evolution.

[6]  Christopher B. Burge,et al.  DNA sequence evolution with neighbor-dependent mutation , 2001, RECOMB '02.

[7]  A. von Haeseler,et al.  Distance measures in terms of substitution processes. , 1999, Theoretical population biology.

[8]  L. Duret,et al.  Vanishing GC-rich isochores in mammalian genomes. , 2002, Genetics.

[9]  A. Zharkikh Estimation of evolutionary distances between nucleotide sequences , 1994, Journal of Molecular Evolution.

[10]  Gregory R. Grant,et al.  Statistical Methods in Bioinformatics , 2001 .

[11]  J. Hartigan,et al.  Asynchronous distance between homologous DNA sequences. , 1987, Biometrics.

[12]  G Bernardi,et al.  The distribution of genes in the human genome. , 1991, Gene.

[13]  M. Gouy,et al.  Inferring pattern and process: maximum-likelihood implementation of a nonhomogeneous model of DNA sequence evolution for phylogenetic analysis. , 1998, Molecular biology and evolution.

[14]  P. Green,et al.  Transcription-associated mutational asymmetry in mammalian evolution , 2003, Nature Genetics.

[15]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[16]  T. Nagylaki Evolution of a finite population under gene conversion. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[17]  L. Duret,et al.  Statistical analysis of vertebrate sequences reveals that long genes are scarce in GC-rich isochores , 1995, Journal of Molecular Evolution.

[18]  L. Duret,et al.  GC Content Evolution of the Human and Mouse Genomes: Insights from the Study of Processed Pseudogenes in Regions of Different Recombination Rates , 2006, Journal of Molecular Evolution.

[19]  Francesca Chiaromonte,et al.  Strong and weak male mutation bias at different sites in the primate genomes: insights from the human-chimpanzee comparison. , 2006, Molecular biology and evolution.

[20]  C. Lobry,et al.  Evolution of DNA base composition under no-strand-bias conditions when the substitution rates are not constant. , 1999, Molecular biology and evolution.

[21]  W. Li,et al.  Bias-corrected paralinear and LogDet distances and tests of molecular clocks and phylogenies under nonstationary nucleotide frequencies. , 1996, Molecular biology and evolution.

[22]  H. Munro,et al.  Mammalian protein metabolism , 1964 .

[23]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[24]  The distribution of genes in human genome , 2005 .

[25]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[26]  Terence Hwa,et al.  Distinct changes of genomic biases in nucleotide substitution at the time of Mammalian radiation. , 2003, Molecular biology and evolution.

[27]  L. Duret,et al.  Recombination drives the evolution of GC-content in the human genome. , 2004, Molecular biology and evolution.

[28]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[29]  H. Voss,et al.  Non-parametric identification of non-linear oscillating systems , 2003 .

[30]  L. Hurst,et al.  The evolution of isochores: evidence from SNP frequency distributions. , 2002, Genetics.

[31]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[32]  Hongkai Ji,et al.  Why do human diversity levels vary at a megabase scale? , 2005, Genome research.

[33]  Hans Ellegren,et al.  Compositional evolution of noncoding DNA in the human and chimpanzee genomes. , 2003, Molecular biology and evolution.

[34]  Laurent Duret,et al.  The Decline of Isochores in Mammals: An Assessment of the GC ContentVariation Along the Mammalian Phylogeny , 2004, Journal of Molecular Evolution.

[35]  M. Antezana Mammalian GC Content Is Very Close to Mutational Equilibrium , 2005, Journal of Molecular Evolution.

[36]  Daniel J. Gaffney,et al.  The scale of mutational variation in the murid genome. , 2005, Genome research.

[37]  Laurent Duret,et al.  The GC Content of Primates and Rodents Genomes Is Not at Equilibrium: A Reply to Antezana , 2006, Journal of Molecular Evolution.

[38]  L. Duret,et al.  GC-content evolution in mammalian genomes: the biased gene conversion hypothesis. , 2001, Genetics.

[39]  Jens Timmer,et al.  Non-parametric identification of non-linear oscillating systems , 2003 .

[40]  J Timmer,et al.  Variations in substitution rate in human and mouse genomes. , 2004, Physical review letters.

[41]  Hans Ellegren,et al.  Male-driven biased gene conversion governs the evolution of base composition in human alu repeats. , 2005, Molecular biology and evolution.

[42]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[43]  J G Sumner,et al.  Using the tangle: a consistent construction of phylogenetic distance matrices for quartets. , 2005, Mathematical biosciences.

[44]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[45]  Terence Hwa,et al.  Regional and Time-resolved Mutation Patterns of the Human Genome , 2004, German Conference on Bioinformatics.

[46]  Terence Hwa,et al.  Substantial Regional Variation in Substitution Rates in the Human Genome: Importance of GC Content, Gene Density, and Telomere-Specific Effects , 2005, Journal of Molecular Evolution.

[47]  James A. Cuff,et al.  Genome sequence, comparative analysis and haplotype structure of the domestic dog , 2005, Nature.

[48]  Alain Arneodo,et al.  Replication-associated strand asymmetries in mammalian genomes: toward detection of replication origins. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[49]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[50]  Jotun Hein,et al.  Statistical Methods in Bioinformatics: An Introduction , 2002 .

[51]  G Bernardi,et al.  Isochores and the evolutionary genomics of vertebrates. , 2000, Gene.

[52]  Laurent Duret,et al.  A new perspective on isochore evolution. , 2006, Gene.

[53]  N. Sueoka Intrastrand parity rules of DNA base composition and usage biases of synonymous codons , 1995, Journal of Molecular Evolution.

[54]  A. Smit Interspersed repeats and other mementos of transposable elements in mammalian genomes. , 1999, Current opinion in genetics & development.

[55]  L. Duret,et al.  Adaptation or biased gene conversion? Extending the null hypothesis of molecular evolution. , 2007, Trends in genetics : TIG.

[56]  David Haussler,et al.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. , 2003, Genome research.

[57]  S. Pääbo,et al.  A neutral explanation for the correlation of diversity with recombination rates in humans. , 2003, American journal of human genetics.