Assessing Substitution Variation Across Sites in Grass Chloroplast DNA

We assess the similarity of base substitution processes, described by empirically derived 4 × 4 matrices, using chi-square homogeneity tests. Such significance analyses allow us to assess variation in sequence evolution across sites and we apply them to matrices derived from noncoding sites in different contexts in grass chloroplast DNA. We show that there is statistically significant variation in rates and patterns of mutation among noncoding sites in different contexts and then demonstrate a similar and significant influence of context on substitutions at fourfold degenerate sites of coding regions from grass chloroplast DNA. These results show that context has the same general effect on substitution bias in coding and noncoding DNA: the A+T content of flanking bases is correlated with rate of substitution, transition bias, and GC → AT pressure, while the number of flanking pyrimidines on a single strand is correlated with a mutational bias, or skew, toward pyrimidines. Despite the similarity in general trends, however, when we compare coding and noncoding matrices we find that there is a statistically significant difference between them even when we control for context. Most noticeably, fourfold degenerate sites in coding sequences are undergoing substitution at a higher rate and there are also significant differences in the relationship between pyrimidines skew and the number of flanking pyrimidines. Possible reasons for the differences between coding and noncoding sites are discussed. Furthermore, our analysis illustrates a simple statistical way for comparing substitution processes across sites allowing us to better study variation in evolutionary processes across a genome.

[1]  Sudhir Kumar,et al.  Neutral substitutions occur at a faster rate in exons than in noncoding DNA in primate genomes. , 2003, Genome research.

[2]  D. Hartl,et al.  Evolution of noncoding and silent coding sites in the Plasmodium falciparum and Plasmodium reichenowi genomes. , 2005, Molecular biology and evolution.

[3]  Jerrold I. Davis,et al.  Phylogeny and subfamilial classification of the grasses (Poaceae) , 2001 .

[4]  A. Zharkikh Estimation of evolutionary distances between nucleotide sequences , 1994, Journal of Molecular Evolution.

[5]  M. Clegg,et al.  The Influence of Specific Neighboring Bases on Substitution Bias in Noncoding Regions of the Plant Chloroplast Genome , 1997, Journal of Molecular Evolution.

[6]  D. Graur,et al.  Inferring the Pattern of Spontaneous Mutation from the Pattern of Substitution in Unitary Pseudogenes of Mycobacterium leprae and a Comparison of Mutation Patterns Among Distantly Related Organisms , 2005, Journal of Molecular Evolution.

[7]  B. Morton,et al.  The Role of Context-Dependent Mutations in Generating Compositional and Codon Usage Bias in Grass Chloroplast DNA , 2003, Journal of Molecular Evolution.

[8]  Christopher B. Burge,et al.  DNA sequence evolution with neighbor-dependent mutation , 2001, RECOMB '02.

[9]  Brandon S Gaut,et al.  Variation in Mutation Dynamics Across the Maize Genome as a Function of Regional and Flanking Base Composition , 2006, Genetics.

[10]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[11]  B. Morton,et al.  Neighboring base composition and transversion/transition bias in a comparison of rice and maize chloroplast noncoding regions. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Zhongming Zhao,et al.  Neighboring-nucleotide effects on single nucleotide polymorphisms: a study of 2.6 million polymorphisms across the human genome. , 2002, Genome research.

[13]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[14]  M. Bulmer,et al.  Neighboring base effects on substitution rates in pseudogenes. , 1986, Molecular biology and evolution.

[15]  P. Lio’,et al.  Models of molecular evolution and phylogeny. , 1998, Genome research.

[16]  R. Matyášek,et al.  Cytosine methylation of plastid genome in higher plants. Fact or artefact? , 2001, Plant science : an international journal of experimental plant biology.

[17]  M Krawczak,et al.  Neighboring-nucleotide effects on the rates of germ-line single-base-pair substitution in human genes. , 1998, American journal of human genetics.

[18]  P. Andolfatto Adaptive evolution of non-coding DNA in Drosophila , 2005, Nature.

[19]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.