Genomic Determinants of Protein Evolution and Polymorphism in Arabidopsis

Recent results from Drosophila suggest that positive selection has a substantial impact on genomic patterns of polymorphism and divergence. However, species with smaller population sizes and/or stronger population structure may not be expected to exhibit Drosophila-like patterns of sequence variation. We test this prediction and identify determinants of levels of polymorphism and rates of protein evolution using genomic data from Arabidopsis thaliana and the recently sequenced Arabidopsis lyrata genome. We find that, in contrast to Drosophila, there is no negative relationship between nonsynonymous divergence and silent polymorphism at any spatial scale examined. Instead, synonymous divergence is a major predictor of silent polymorphism, which suggests variation in mutation rate as the main determinant of silent variation. Variation in rates of protein divergence is mainly correlated with gene expression level and breadth, consistent with results for a broad range of taxa, and map-based estimates of recombination rate are only weakly correlated with nonsynonymous divergence. Variation in mutation rates and the strength of purifying selection seem to be major drivers of patterns of polymorphism and divergence in Arabidopsis. Nevertheless, a model allowing for varying negative and positive selection by functional gene category explains the data better than a homogeneous model, implying the action of positive selection on a subset of genes. Genes involved in disease resistance and abiotic stress display high proportions of adaptive substitution. Our results are important for a general understanding of the determinants of rates of protein evolution and the impact of selection on patterns of polymorphism and divergence.

[1]  B. Gaut,et al.  Factors that contribute to variation in evolutionary rate among Arabidopsis genes. , 2011, Molecular biology and evolution.

[2]  L. Rieseberg,et al.  Effective population size is positively correlated with levels of adaptive divergence among annual sunflowers. , 2011, Molecular biology and evolution.

[3]  Martin J. Lercher,et al.  The Effects of Network Neighbours on Protein Evolution , 2011, PloS one.

[4]  Richard M. Clark,et al.  The Arabidopsis lyrata genome sequence and the basis of rapid genome size change , 2011, Nature Genetics.

[5]  D. Halligan,et al.  Positive and negative selection on noncoding DNA close to protein-coding genes in wild house mice. , 2011, Molecular biology and evolution.

[6]  Joy Bergelson,et al.  Association mapping of local climate-sensitive quantitative trait loci in Arabidopsis thaliana , 2010, Proceedings of the National Academy of Sciences.

[7]  Laurent Duret,et al.  Detecting positive selection within genomes: the problem of biased gene conversion , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[8]  S. Wright,et al.  Genome-wide evidence for efficient positive and purifying selection in Capsella grandiflora, a plant species with a large effective population size. , 2010, Molecular biology and evolution.

[9]  A. Eyre-Walker,et al.  Genome wide analyses reveal little evidence for adaptive evolution in many plant species. , 2010, Molecular biology and evolution.

[10]  P. Ingvarsson Natural selection on synonymous and nonsynonymous mutations shapes patterns of polymorphism in Populus tremula. , 2010, Molecular biology and evolution.

[11]  B. Charlesworth,et al.  Elements of Evolutionary Genetics , 2010 .

[12]  J. Welch,et al.  Quantifying Adaptive Evolution in the Drosophila Immune System , 2009, PLoS genetics.

[13]  P. Keightley,et al.  Estimating the rate of adaptive molecular evolution in the presence of slightly deleterious mutations and population size change. , 2009, Molecular biology and evolution.

[14]  D. Petrov,et al.  Pervasive Natural Selection in the Drosophila Genome? , 2009, PLoS genetics.

[15]  Richard M. Clark,et al.  Sequencing of natural strains of Arabidopsis thaliana with short reads. , 2008, Genome research.

[16]  P. Andolfatto,et al.  The Impact of Natural Selection on the Genome: Emerging Patterns in Drosophila and Arabidopsis , 2008 .

[17]  P. Andolfatto,et al.  Positive and negative selection on noncoding DNA in Drosophila simulans. , 2008, Molecular biology and evolution.

[18]  Claus O. Wilke,et al.  Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution , 2008, Cell.

[19]  M. Nordborg,et al.  Selection on amino acid substitutions in Arabidopsis. , 2008, Molecular biology and evolution.

[20]  G. Coop,et al.  No effect of recombination on the efficacy of natural selection in primates. , 2008, Genome research.

[21]  Alex Wong,et al.  Evolution of protein-coding genes in Drosophila. , 2008, Trends in genetics : TIG.

[22]  D. Petrov,et al.  Genomewide Spatial Correspondence Between Nonsynonymous Divergence and Neutral Polymorphism Reveals Extensive Adaptation in Drosophila , 2007, Genetics.

[23]  Peter Andolfatto,et al.  Hitchhiking effects of recurrent beneficial amino acid substitutions in the Drosophila melanogaster genome. , 2007, Genome research.

[24]  Colin N. Dewey,et al.  Population Genomics: Whole-Genome Analysis of Polymorphism and Divergence in Drosophila simulans , 2007, PLoS biology.

[25]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[26]  Richard M. Clark,et al.  Common Sequence Polymorphisms Shaping Genetic Diversity in Arabidopsis thaliana , 2007, Science.

[27]  Pär K Ingvarsson,et al.  Gene expression and protein length influence codon usage and rates of sequence evolution in Populus tremula. , 2006, Molecular biology and evolution.

[28]  A. Eyre-Walker,et al.  Synonymous codon usage in Escherichia coli: selection for translational accuracy. , 2006, Molecular biology and evolution.

[29]  Eduardo P C Rocha,et al.  The quest for the universals of protein evolution. , 2006, Trends in genetics : TIG.

[30]  John J Welch,et al.  Estimating the Genomewide Rate of Adaptive Protein Evolution in Drosophila , 2006, Genetics.

[31]  P. Andolfatto Adaptive evolution of non-coding DNA in Drosophila , 2005, Nature.

[32]  Olaf R. P. Bininda-Emonds,et al.  transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences , 2005, BMC Bioinformatics.

[33]  Stefan R. Henz,et al.  A gene expression map of Arabidopsis thaliana development , 2005, Nature Genetics.

[34]  Doron Lancet,et al.  Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification , 2005, Bioinform..

[35]  Blake C Meyers,et al.  Effects of gene expression on molecular evolution in Arabidopsis thaliana and Arabidopsis lyrata. , 2004, Molecular biology and evolution.

[36]  A. Eyre-Walker,et al.  The genomic rate of adaptive amino acid substitution in Drosophila. , 2004, Molecular biology and evolution.

[37]  B. Charlesworth,et al.  Recombination and base composition: the case of the highly self-fertilizing plant Arabidopsis thaliana , 2004, Genome Biology.

[38]  Carlos D. Bustamante,et al.  The cost of inbreeding in Arabidopsis , 2002, Nature.

[39]  Adam Eyre-Walker,et al.  Adaptive protein evolution in Drosophila , 2002, Nature.

[40]  H. Akashi,et al.  Gene expression and molecular evolution. , 2001, Current opinion in genetics & development.

[41]  D. Weinreich,et al.  Contrasting patterns of nonneutral evolution in proteins encoded in nuclear and mitochondrial genomes. , 2000, Genetics.

[42]  T. Jukes,et al.  The neutral theory of molecular evolution. , 2000, Genetics.

[43]  N. Goldman,et al.  A codon-based model of nucleotide substitution for protein-coding DNA sequences. , 1994, Molecular biology and evolution.

[44]  M. Nei,et al.  Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. , 1986, Molecular biology and evolution.

[45]  M. Kimura The Neutral Theory of Molecular Evolution: Introduction , 1983 .

[46]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[47]  Carlos D. Bustamante,et al.  Bayesian Analysis Suggests that Most Amino Acid Replacements in Drosophila Are Driven by Positive Selection , 2003, Journal of Molecular Evolution.

[48]  K. Kuma,et al.  Functional constraints against variations on molecules from the tissue level: slowly evolving brain-specific genes demonstrated by protein kinase and immunoglobulin supergene families. , 1995, Molecular biology and evolution.