Unconstrained evolution in short introns? – An analysis of genome‐wide polymorphism and divergence data from Drosophila

An unconstrained reference sequence facilitates the detection of selection. In Drosophila, sequence variation in short introns seems to be least influenced by selection and dominated by mutation and drift. Here, we test this with genome‐wide sequences using an African population (Malawi) of D. melanogaster and data from the related outgroup species D. simulans, D. sechellia, D. erecta and D. yakuba. The distribution of mutations deviates from equilibrium, and the content of A and T (AT) nucleotides shows an excess of variance among introns. We explain this by a complex mutational pattern: a shift in mutational bias towards AT, leading to a slight nonequilibrium in base composition and context‐dependent mutation rates, with G or C (GC) sites mutating most frequently in AT‐rich introns. By comparing the corresponding allele frequency spectra of AT‐rich vs. GC‐rich introns, we can rule out the influence of directional selection or biased gene conversion on the mutational pattern. Compared with neutral equilibrium expectations, polymorphism spectra show an excess of low frequency and a paucity of intermediate frequency variants, irrespective of the direction of mutation. Combining the information from different outgroups with the polymorphism data and using a generalized linear model, we find evidence for shared ancestral polymorphism between D. melanogaster and D. simulans, D. sechellia, arguing against a bottleneck in D. melanogaster. Generally, we find that short introns can be used as a neutral reference on a genome‐wide level, if the spatially and temporally varying mutational pattern is accounted for.

[1]  Colin N. Dewey,et al.  Genomic Variation in Natural Populations of Drosophila melanogaster , 2012, Genetics.

[2]  Claus Vogl,et al.  The allele-frequency spectrum in a decoupled Moran model with mutation, drift, and directional selection, assuming small mutation rates , 2012, Theoretical population biology.

[3]  Philipp W. Messer,et al.  Faster than Neutral Evolution of Constrained Sequences: The Complex Interplay of Mutational Biases and Weak Selection , 2011, Genome biology and evolution.

[4]  A. Roychoudhury,et al.  Sufficiency of the number of segregating sites in the limit under finite-sites mutation. , 2010, Theoretical population biology.

[5]  Anna-Sophie Fiston-Lavier,et al.  Drosophila melanogaster recombination rate calculator. , 2010, Gene.

[6]  J. Parsch,et al.  On the utility of short intron sequences as a reference for the detection of positive and negative selection in Drosophila. , 2010, Molecular biology and evolution.

[7]  B. Charlesworth,et al.  Patterns of DNA-Sequence Divergence Between Drosophila miranda and D. pseudoobscura , 2009, Journal of Molecular Evolution.

[8]  Andrew G Clark,et al.  Strong evidence for lineage and sequence specificity of substitution rates and patterns in Drosophila. , 2009, Molecular biology and evolution.

[9]  Michael M. Desai,et al.  The Polymorphism Frequency Spectrum of Finitely Many Sites Under Selection , 2008, Genetics.

[10]  Ruth Hershberg,et al.  Selection on codon bias. , 2008, Annual review of genetics.

[11]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[12]  Hong Li,et al.  The Correlation Between Recombination Rate and Dinucleotide Bias in Drosophila melanogaster , 2008, Journal of Molecular Evolution.

[13]  P. Andolfatto,et al.  Positive and negative selection on noncoding DNA in Drosophila simulans. , 2008, Molecular biology and evolution.

[14]  G. Achaz Testing for Neutrality in Samples With Sequencing Errors , 2008, Genetics.

[15]  B. Charlesworth,et al.  Non-neutral processes drive the nucleotide composition of non-coding sequences in Drosophila , 2008, Biology Letters.

[16]  C. Schlötterer,et al.  African Drosophila melanogaster and D. simulans Populations Have Similar Levels of Sequence Variability, Suggesting Comparable Effective Population Sizes , 2008, Genetics.

[17]  Colin N. Dewey,et al.  Population Genomics: Whole-Genome Analysis of Polymorphism and Divergence in Drosophila simulans , 2007, PLoS biology.

[18]  Ryan D. Hernandez,et al.  Context-dependent mutation rates may cause spurious signatures of a fixation bias favoring higher GC-content in humans. , 2007, Molecular biology and evolution.

[19]  Ryan D. Hernandez,et al.  Context dependence, ancestral misidentification, and spurious signatures of natural selection. , 2007, Molecular biology and evolution.

[20]  D. Hartl,et al.  Inaugural Article: Prevalence of positive selection among nearly neutral amino acid replacements in Drosophila , 2007 .

[21]  Chenhui Zhang,et al.  Adaptive genic evolution in the Drosophila genomes , 2007, Proceedings of the National Academy of Sciences.

[22]  W. Stephan,et al.  Contrasting patterns of sequence divergence and base composition between Drosophila introns and intergenic regions , 2006, Biology Letters.

[23]  W. Stephan,et al.  Inferring the Demographic History and Rate of Adaptive Substitution in Drosophila , 2006, PLoS genetics.

[24]  D. Halligan,et al.  Ubiquitous selective constraints in the Drosophila genome revealed by a genome-wide interspecies comparison. , 2006, Genome research.

[25]  Piyush Goel,et al.  Molecular Evolution in the Drosophila melanogaster Species Subgroup: Frequent Parameter Fluctuations on the Timescale of Molecular Divergence , 2006, Genetics.

[26]  Eric Bazin,et al.  GC-Biased Segregation of Noncoding Polymorphisms in Drosophila , 2006, Genetics.

[27]  P. Andolfatto Adaptive evolution of non-coding DNA in Drosophila , 2005, Nature.

[28]  W. Stephan,et al.  Inferring the effects of demography and selection on Drosophila melanogaster populations from a chromosome-wide scan of DNA variation. , 2005, Molecular biology and evolution.

[29]  Brian Charlesworth,et al.  Patterns of intron sequence evolution in Drosophila are dependent upon length and GC content , 2005, Genome Biology.

[30]  D. Petrov,et al.  Codon Bias and Noncoding GC Content Correlate Negatively with Recombination Rate on the Drosophila X Chromosome , 2005, Journal of Molecular Evolution.

[31]  Kevin R. Thornton,et al.  Multilocus patterns of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations. , 2005, Genome research.

[32]  Dmitri A Petrov,et al.  Genomic Heterogeneity of Background Substitutional Patterns in Drosophila melanogaster , 2005, Genetics.

[33]  Peter F. Arndt,et al.  Identification and Measurement of Neigbor Dependent Nucleotide Substitution Processes , 2005, German Conference on Bioinformatics.

[34]  Chung-I Wu,et al.  Inference of positive and negative selection on the 5' regulatory regions of Drosophila genes. , 2004, Molecular biology and evolution.

[35]  J. Wall,et al.  Linkage disequilibrium patterns across a recombination gradient in African Drosophila melanogaster. , 2003, Genetics.

[36]  W. Stephan,et al.  Demography and natural selection have shaped genetic variation in Drosophila melanogaster: a multi-locus approach. , 2003, Genetics.

[37]  Sudhir Kumar,et al.  Temporal patterns of fruit fly (Drosophila) evolution revealed by mutation clocks. , 2003, Molecular biology and evolution.

[38]  Adam Eyre-Walker,et al.  Adaptive protein evolution in Drosophila , 2002, Nature.

[39]  D. Hartl,et al.  Directional selection and the site-frequency spectrum. , 2001, Genetics.

[40]  K. J. Fryxell,et al.  Cytosine deamination plays a primary role in the evolution of mammalian isochores. , 2000, Molecular biology and evolution.

[41]  P. Andolfatto,et al.  A genome-wide departure from the standard neutral model in natural populations of Drosophila. , 2000, Genetics.

[42]  Justin C. Fay,et al.  Hitchhiking under positive Darwinian selection. , 2000, Genetics.

[43]  N. Takahata,et al.  Paleo-demography of the Drosophila melanogaster subgroup: application of the maximum likelihood method. , 1999, Genes & genetic systems.

[44]  K. Olsen,et al.  Phylogeographic studies in plants: problems and prospects , 1998 .

[45]  A. Clark,et al.  Neutral behavior of shared polymorphism. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[46]  H. Akashi,et al.  Molecular evolution between Drosophila melanogaster and D. simulans: reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster. , 1996, Genetics.

[47]  B. Charlesworth,et al.  The pattern of neutral molecular variation under the background selection model. , 1995, Genetics.

[48]  Y. Fu,et al.  Statistical properties of segregating sites. , 1995, Theoretical population biology.

[49]  M. Nei,et al.  Molecular phylogeny and divergence times of drosophilid species. , 1995, Molecular biology and evolution.

[50]  H. Akashi,et al.  Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. , 1995, Genetics.

[51]  H. Akashi Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. , 1994, Genetics.

[52]  B. Charlesworth,et al.  The effect of deleterious mutations on neutral molecular variation. , 1993, Genetics.

[53]  W. Li,et al.  Statistical tests of neutrality of mutations. , 1993, Genetics.

[54]  M. Kreitman,et al.  Adaptive protein evolution at the Adh locus in Drosophila , 1991, Nature.

[55]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[56]  M. Kimura,et al.  The neutral theory of molecular evolution. , 1983, Scientific American.

[57]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[58]  J. L. King,et al.  Non-Darwinian evolution. , 1969, Science.

[59]  M. Kimura Evolutionary Rate at the Molecular Level , 1968, Nature.

[60]  B. Charlesworth,et al.  Studying Patterns of Recent Evolution at Synonymous Sites and Intronic Sites in Drosophila melanogaster , 2009, Journal of Molecular Evolution.

[61]  Melanie A. Huntley,et al.  Evolution of genes and genomes on the Drosophila phylogeny , 2007, Nature.

[62]  A. Kern,et al.  Patterns of polymorphism and divergence from noncoding sequences of Drosophila melanogaster and D. simulans: evidence for nonequilibrium processes. , 2005, Molecular biology and evolution.

[63]  Adam Eyre-Walker,et al.  Mutation pressure, natural selection, and the evolution of base composition in Drosophila , 2004, Genetica.

[64]  Wolfgang Stephan,et al.  In vivo introduction of unpreferred synonymous codons into the Drosophila Adh gene results in reduced levels of ADH protein. , 2003, Genetics.

[65]  H. Akashi,et al.  Inferring the fitness effects of DNA mutations from polymorphism and divergence data: statistical power to detect directional selection under stationarity and free recombination. , 1999, Genetics.

[66]  O. T. Solbrig Demography and natural selection. , 1980 .

[67]  J. Haigh,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[68]  S. Wright,et al.  Evolution in Mendelian Populations. , 1931, Genetics.