Background Selection as Baseline for Nucleotide Variation across the Drosophila Genome

The constant removal of deleterious mutations by natural selection causes a reduction in neutral diversity and efficacy of selection at genetically linked sites (a process called Background Selection, BGS). Population genetic studies, however, often ignore BGS effects when investigating demographic events or the presence of other types of selection. To obtain a more realistic evolutionary expectation that incorporates the unavoidable consequences of deleterious mutations, we generated high-resolution landscapes of variation across the Drosophila melanogaster genome under a BGS scenario independent of polymorphism data. We find that BGS plays a significant role in shaping levels of variation across the entire genome, including long introns and intergenic regions distant from annotated genes. We also find that a very large percentage of the observed variation in diversity across autosomes can be explained by BGS alone, up to 70% across individual chromosome arms at 100-kb scale, thus indicating that BGS predictions can be used as baseline to infer additional types of selection and demographic events. This approach allows detecting several outlier regions with signal of recent adaptive events and selective sweeps. The use of a BGS baseline, however, is particularly appropriate to investigate the presence of balancing selection and our study exposes numerous genomic regions with the predicted signature of higher polymorphism than expected when a BGS context is taken into account. Importantly, we show that these conclusions are robust to the mutation and selection parameters of the BGS model. Finally, analyses of protein evolution together with previous comparisons of genetic maps between Drosophila species, suggest temporally variable recombination landscapes and, thus, local BGS effects that may differ between extant and past phases. Because genome-wide BGS and temporal changes in linkage effects can skew approaches to estimate demographic and selective events, future analyses should incorporate BGS predictions and capture local recombination variation across genomes and along lineages.

[1]  P. Keightley,et al.  A Comparison of Models to Infer the Distribution of Fitness Effects of New Mutations , 2013, Genetics.

[2]  J. Braverman,et al.  Linkage disequilibria and the site frequency spectra in the su(s) and su(w(a)) regions of the Drosophila melanogaster X chromosome. , 2000, Genetics.

[3]  J. True,et al.  Differences in crossover frequency and distribution among three sibling species of Drosophila. , 1996, Genetics.

[4]  B. Charlesworth,et al.  Estimating Selection on Nonsynonymous Mutations , 2006, Genetics.

[5]  D. Lindsley,et al.  The Genome of Drosophila Melanogaster , 1992 .

[6]  B. Charlesworth The Role of Background Selection in Shaping Patterns of Molecular Evolution and Variation: Evidence from Variability on the Drosophila X Chromosome , 2012, Genetics.

[7]  N L Kaplan,et al.  The "hitchhiking effect" revisited. , 1989, Genetics.

[8]  N. Barton,et al.  Genetic hitchhiking. , 2000, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[9]  J. Fay,et al.  A human population bottleneck can account for the discordance between patterns of mitochondrial versus nuclear DNA variation. , 1999, Molecular biology and evolution.

[10]  B. Charlesworth,et al.  Codon Usage Bias and Effective Population Sizes on the X Chromosome versus the Autosomes in Drosophila melanogaster , 2012, Molecular biology and evolution.

[11]  Philip W. Hedrick,et al.  Genetic Polymorphism in Heterogeneous Environments: The Age of Genomics , 2006 .

[12]  Robert Kofler,et al.  Sequencing of Pooled DNA Samples (Pool-Seq) Uncovers Complex Dynamics of Transposable Element Insertions in Drosophila melanogaster , 2012, PLoS genetics.

[13]  B. Charlesworth,et al.  The effects of deleterious mutations on evolution in non-recombining genomes. , 2009, Trends in genetics : TIG.

[14]  Carlos D. Bustamante,et al.  Bayesian Analysis Suggests that Most Amino Acid Replacements in Drosophila Are Driven by Positive Selection , 2003, Journal of Molecular Evolution.

[15]  C. Langley,et al.  Transposable elements in natural populations of Drosophila melanogaster , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[16]  T. Mackay,et al.  Dopa decarboxylase (Ddc) affects variation in Drosophila longevity , 2003, Nature Genetics.

[17]  J. Hey,et al.  Interactions between natural selection, recombination and gene density in the genes of Drosophila. , 2002, Genetics.

[18]  P. Andolfatto,et al.  Effective Population Size and the Efficacy of Selection on the X Chromosomes of Two Closely Related Drosophila Species , 2010, Genome biology and evolution.

[19]  C. Schlötterer,et al.  Genome-wide patterns of latitudinal differentiation among populations of Drosophila melanogaster from North America , 2012, Molecular ecology.

[20]  P. Andolfatto Adaptive evolution of non-coding DNA in Drosophila , 2005, Nature.

[21]  H. A. Orr,et al.  A Pseudohitchhiking Model of X vs. Autosomal Diversity , 2004, Genetics.

[22]  S. Schaeffer Molecular population genetics of sequence length diversity in the Adh region of Drosophila pseudoobscura. , 2002, Genetical research.

[23]  Colin N. Dewey,et al.  Population Genomics: Whole-Genome Analysis of Polymorphism and Divergence in Drosophila simulans , 2007, PLoS biology.

[24]  Philipp W. Messer,et al.  Frequent adaptation and the McDonald–Kreitman test , 2012, Proceedings of the National Academy of Sciences.

[25]  Kevin R. Thornton,et al.  An Approximate Bayesian Estimator Suggests Strong, Recurrent Selective Sweeps in Drosophila , 2008, PLoS genetics.

[26]  N L Kaplan,et al.  Deleterious background selection with recombination. , 1995, Genetics.

[27]  V. Rowntree,et al.  Gene Genealogies Strongly Distorted by Weakly Interfering Mutations in Constant Environments , 2010, Genetics.

[28]  Ryan D. Hernandez,et al.  Assessing the Evolutionary Impact of Amino Acid Mutations in the Human Genome , 2008, PLoS genetics.

[29]  D. Schluter,et al.  Adaptation from standing genetic variation. , 2008, Trends in ecology & evolution.

[30]  Alex Wong,et al.  Evolution of protein-coding genes in Drosophila. , 2008, Trends in genetics : TIG.

[31]  B. Charlesworth,et al.  The distribution of transposable elements within and between chromosomes in a population of Drosophila melanogaster. I. Element frequencies and distribution. , 1992, Genetical research.

[32]  Functional Genetics in the Post-Genomics Era: Building a Better Roadmap in Drosophila , 2013, Genetics.

[33]  B. Charlesworth The effect of background selection against deleterious mutations on weakly selected, linked variants. , 1994, Genetical research.

[34]  B. Charlesworth,et al.  The population genetics of Drosophila transposable elements. , 1989, Annual review of genetics.

[35]  Anders Albrechtsen,et al.  Natural Selection Affects Multiple Aspects of Genetic Variation at Putatively Neutral Sites across the Human Genome , 2011, PLoS genetics.

[36]  J. M. Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[37]  A. Clark,et al.  Faster-X Evolution of Gene Expression in Drosophila , 2012, PLoS genetics.

[38]  B. Charlesworth,et al.  The effect of deleterious mutations on neutral molecular variation. , 1993, Genetics.

[39]  Boris I. Shraiman,et al.  Correlated Evolution of Nearby Residues in Drosophilid Proteins , 2011, PLoS genetics.

[40]  J. M. Comeron,et al.  The Many Landscapes of Recombination in Drosophila melanogaster , 2012, PLoS genetics.

[41]  M. Aguadé,et al.  Natural selection on synonymous sites is correlated with gene length and recombination in Drosophila. , 1999, Genetics.

[42]  Andrew G. Clark,et al.  Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster , 2009, Genome biology and evolution.

[43]  Y. Rinott,et al.  Pervasive Adaptive Protein Evolution Apparent in Diversity Patterns around Amino Acid Substitutions in Drosophila simulans , 2011, PLoS genetics.

[44]  J. Felsenstein The evolutionary advantage of recombination. , 1974, Genetics.

[45]  D. Charlesworth Balancing Selection and Its Effects on Sequences in Nearby Genome Regions , 2006, PLoS genetics.

[46]  B. Charlesworth,et al.  Direct estimation of per nucleotide and genomic deleterious mutation rates in Drosophila , 2007, Nature.

[47]  P. Keightley,et al.  Joint Inference of the Distribution of Fitness Effects of Deleterious Mutations and Population Demography Based on Nucleotide Polymorphism Frequencies , 2007, Genetics.

[48]  Ruth Hershberg,et al.  Selection on codon bias. , 2008, Annual review of genetics.

[49]  Daniel R. Schrider,et al.  Rates and Genomic Consequences of Spontaneous Mutational Events in Drosophila melanogaster , 2013, Genetics.

[50]  W Stephan,et al.  The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. , 1995, Genetics.

[51]  Sònia Casillas,et al.  Purifying selection maintains highly conserved noncoding sequences in Drosophila. , 2007, Molecular biology and evolution.

[52]  Aleksandra M Walczak,et al.  The Structure of Genealogies in the Presence of Purifying Selection: A Fitness-Class Coalescent , 2010, Genetics.

[53]  Kevin R. Thornton,et al.  A second-generation assembly of the Drosophila simulans genome provides new insights into patterns of lineage-specific divergence , 2013, Genome research.

[54]  Ryan D. Hernandez,et al.  Classic Selective Sweeps Were Rare in Recent Human Evolution , 2011, Science.

[55]  P. Green,et al.  Widespread Genomic Signatures of Natural Selection in Hominid Evolution , 2009, PLoS genetics.

[56]  Russell B. Corbett-Detig,et al.  Population Genomics of Sub-Saharan Drosophila melanogaster: African Diversity and Non-African Admixture , 2012, PLoS genetics.

[57]  Justin C. Fay,et al.  Testing the neutral theory of molecular evolution with genomic data from Drosophila , 2002, Nature.

[58]  D. Petrov,et al.  Population genomics of transposable elements in Drosophila melanogaster. , 2011, Molecular biology and evolution.

[59]  Y. Fu,et al.  Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. , 1997, Genetics.

[60]  M. Turelli,et al.  STABLE TWO‐ALLELE POLYMORPHISMS MAINTAINED BY FLUCTUATING FITNESSES AND SEED BANKS: PROTECTING THE BLUES IN LINANTHUS PARRYAE , 2001, Evolution; international journal of organic evolution.

[61]  Philipp W. Messer,et al.  SLiM: Simulating Evolution with Selection and Linkage , 2013, Genetics.

[62]  B. Charlesworth,et al.  Recombination Rates May Affect the Ratio of X to Autosomal Noncoding Polymorphism in African Populations of Drosophila melanogaster , 2009, Genetics.

[63]  Colin N. Dewey,et al.  Genomic Variation in Natural Populations of Drosophila melanogaster , 2012, Genetics.

[64]  B. Charlesworth,et al.  The pattern of neutral molecular variation under the background selection model. , 1995, Genetics.

[65]  D. Presgraves,et al.  Recombination Enhances Protein Adaptation in Drosophila melanogaster , 2005, Current Biology.

[66]  Yun S. Song,et al.  The Hitchhiking Effect on Linkage Disequilibrium Between Linked Neutral Loci , 2006, Genetics.

[67]  D. Petrov,et al.  Genomewide Spatial Correspondence Between Nonsynonymous Divergence and Neutral Polymorphism Reveals Extensive Adaptation in Drosophila , 2007, Genetics.

[68]  M. Noor,et al.  Fine-Scale Crossover Rate Heterogeneity in Drosophila pseudoobscura , 2006, Journal of Molecular Evolution.

[69]  D. Hartl,et al.  Effects of X-linkage and sex-biased gene expression on the rate of adaptive protein evolution in Drosophila. , 2008, Molecular biology and evolution.

[70]  W. Wong,et al.  Bayes empirical bayes inference of amino acid sites under positive selection. , 2005, Molecular biology and evolution.

[71]  A. Llopart The rapid evolution of X-linked male-biased gene expression and the large-X effect in Drosophila yakuba, D. santomea, and their hybrids. , 2012, Molecular biology and evolution.

[72]  Adam Eyre-Walker,et al.  Adaptive protein evolution in Drosophila , 2002, Nature.

[73]  D. Halligan,et al.  Estimation of the Spontaneous Mutation Rate per Nucleotide Site in a Drosophila melanogaster Full-Sib Family , 2013, Genetics.

[74]  Andrew G Clark,et al.  Strong evidence for lineage and sequence specificity of substitution rates and patterns in Drosophila. , 2009, Molecular biology and evolution.

[75]  J. Hermisson,et al.  Soft Sweeps , 2005, Genetics.

[76]  P. Keightley,et al.  Estimating the Rate of Adaptive Molecular Evolution When the Evolutionary Divergence Between Species is Small , 2012, Journal of Molecular Evolution.

[77]  J. Parsch,et al.  Positive correlation between evolutionary rate and recombination rate in Drosophila genes with male-biased expression. , 2005, Molecular biology and evolution.

[78]  B. Charlesworth,et al.  Genetic recombination and molecular evolution. , 2009, Cold Spring Harbor symposia on quantitative biology.

[79]  Bryan D. Kolaczkowski,et al.  Genomic Differentiation Between Temperate and Tropical Australian Populations of Drosophila melanogaster , 2011, Genetics.

[80]  J. Pool,et al.  History and Structure of Sub-Saharan Populations of Drosophila melanogaster , 2006, Genetics.

[81]  A. Long,et al.  Joint Estimates of Quantitative Trait Locus Effect and Frequency Using Synthetic Recombinant Populations of Drosophila melanogaster , 2007, Genetics.

[82]  M. Nordborg,et al.  The effect of gene conversion on intralocus associations. , 1998, Genetics.

[83]  Justin C. Fay,et al.  Evidence for Hitchhiking of Deleterious Mutations within the Human Genome , 2011, PLoS genetics.

[84]  B. Charlesworth Background selection and patterns of genetic diversity in Drosophila melanogaster. , 1996, Genetical research.

[85]  B. Payseur,et al.  Genomic signatures of selection at linked sites: unifying the disparity among species , 2013, Nature Reviews Genetics.

[86]  Laurence Loewe,et al.  Inferring the distribution of mutational effects on fitness in Drosophila , 2006, Biology Letters.

[87]  W. G. Hill,et al.  The effect of linkage on limits to artificial selection. , 1966, Genetical research.

[88]  A. Eyre-Walker,et al.  Estimating the distribution of fitness effects from DNA sequence data: Implications for the molecular clock , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[89]  B. Charlesworth,et al.  Molecular Evolution in Nonrecombining Regions of the Drosophila melanogaster Genome , 2012, Genome biology and evolution.

[90]  A. Clark,et al.  Contrasting the efficacy of selection on the X and autosomes in Drosophila. , 2008, Molecular biology and evolution.

[91]  B. Charlesworth,et al.  Muller's ratchet and the pattern of variation at a neutral locus. , 2002, Genetics.

[92]  D. Begun,et al.  Genomic Analysis of Adaptive Differentiation in Drosophila melanogaster , 2008, Genetics.

[93]  T. Mackay,et al.  The genomic rate of transposable element movement in Drosophila melanogaster. , 1995, Molecular biology and evolution.

[94]  V. Hartenstein,et al.  Drosophila melanogaster , 2005 .

[95]  J. M. Comeron,et al.  The Hill–Robertson effect: evolutionary consequences of weak selection and linkage in finite populations , 2008, Heredity.

[96]  M. Noor,et al.  Recombination Modulates How Selection Affects Linked Sites in Drosophila , 2012, PLoS biology.

[97]  D. Begun,et al.  Differential strengths of positive selection revealed by hitchhiking effects at small physical scales in Drosophila melanogaster. , 2014, Molecular biology and evolution.

[98]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[99]  J. Akey,et al.  Fitting background-selection predictions to levels of nucleotide variation and divergence along the human autosomes. , 2005, Genome research.

[100]  Nick Goldman,et al.  Accuracy and Power of Statistical Methods for Detecting Adaptive Evolution in Protein Coding Sequences and for Identifying Positively Selected Sites , 2004, Genetics.

[101]  J. Gillespie Genetic drift in an infinite population. The pseudohitchhiking model. , 2000, Genetics.

[102]  B. Charlesworth,et al.  The effect of recombination on background selection. , 1996, Genetical research.

[103]  B. Charlesworth,et al.  Estimating the Parameters of Selection on Nonsynonymous Mutations in Drosophila pseudoobscura and D. miranda , 2010, Genetics.

[104]  Ziheng Yang,et al.  Estimating the distribution of selection coefficients from phylogenetic data with applications to mitochondrial and viral DNA. , 2003, Molecular biology and evolution.

[105]  A. Clark,et al.  Fine-Scale Heterogeneity in Crossover Rate in the garnet-scalloped Region of the Drosophila melanogaster X Chromosome , 2013, Genetics.

[106]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[107]  Matthew W. Hahn,et al.  Toward a Selection Theory of Molecular Evolution , 2008, Evolution; international journal of organic evolution.

[108]  B. Charlesworth,et al.  Reduced efficacy of selection in regions of the Drosophila genome that lack crossing over , 2007, Genome Biology.

[109]  Sudhir Kumar,et al.  Temporal patterns of fruit fly (Drosophila) evolution revealed by mutation clocks. , 2003, Molecular biology and evolution.

[110]  Daniel B. Weissman,et al.  Limits to the Rate of Adaptive Substitution in Sexual Populations , 2012, PLoS genetics.

[111]  Marian Thomson,et al.  Analysis of the genome sequences of three Drosophila melanogaster spontaneous mutation accumulation lines. , 2009, Genome research.

[112]  T. Mackay,et al.  Deficiency mapping of quantitative trait loci affecting longevity in Drosophila melanogaster. , 2000, Genetics.

[113]  Nicholas H. Barton,et al.  The Relative Rates of Evolution of Sex Chromosomes and Autosomes , 1987, The American Naturalist.

[114]  W. Stephan Genetic hitchhiking versus background selection: the controversy and its implications , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[115]  D. Petrov,et al.  X-Linked Genes Evolve Higher Codon Bias in Drosophila and Caenorhabditis , 2005, Genetics.

[116]  B. Charlesworth,et al.  Correlated Evolution of Synonymous and Nonsynonymous Sites in Drosophila , 2004, Journal of Molecular Evolution.

[117]  Kevin R. Thornton,et al.  Abundance and Distribution of Transposable Elements in Two Drosophila QTL Mapping Resources , 2013, Molecular biology and evolution.

[118]  D. Hartl,et al.  Maximum likelihood and Bayesian methods for estimating the distribution of selective effects among classes of mutations using DNA polymorphism data. , 2003, Theoretical population biology.

[119]  G. McVean,et al.  The effects of Hill-Robertson interference between weakly selected mutations on patterns of molecular evolution and variation. , 2000, Genetics.

[120]  B. Charlesworth,et al.  Background Selection in Single Genes May Explain Patterns of Codon Bias , 2007, Genetics.

[121]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[122]  N. Barton,et al.  Genetic linkage and natural selection , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[123]  P. Keightley,et al.  A Comparison of Models to Infer the Distribution of Fitness Effects of New Mutations , 2013, Genetics.

[124]  A. Betancourt,et al.  Linkage limits the power of natural selection in Drosophila , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[125]  Brian Charlesworth,et al.  The Effects of Deleterious Mutations on Evolution at Linked Sites , 2012, Genetics.

[126]  T Darden,et al.  Evolution and extinction of transposable elements in Mendelian populations. , 1985, Genetics.

[127]  P. Andolfatto,et al.  The Impact of Natural Selection on the Genome: Emerging Patterns in Drosophila and Arabidopsis , 2008 .

[128]  J. M. Comeron,et al.  Local effects of limited recombination: historical perspective and consequences for population estimates of adaptive evolution. , 2010, The Journal of heredity.

[129]  Philipp W. Messer,et al.  Population genomics of rapid adaptation by soft selective sweeps. , 2013, Trends in ecology & evolution.

[130]  C. Langley,et al.  Transposable Elements in Mendelian Populations. II. Distribution of Three COPIA-like Elements in a Natural Population of DROSOPHILA MELANOGASTER. , 1983, Genetics.

[131]  Kevin R. Thornton,et al.  The Drosophila melanogaster Genetic Reference Panel , 2012, Nature.

[132]  M. Kreitman,et al.  Population, evolutionary and genomic consequences of interference selection. , 2002, Genetics.

[133]  Justin C. Fay,et al.  Weighing the evidence for adaptation at the molecular level. , 2011, Trends in genetics : TIG.

[134]  S. Nuzhdin,et al.  Survival Analysis of Life Span Quantitative Trait Loci in Drosophila melanogaster , 2005, Genetics.

[135]  Adam Eyre-Walker,et al.  Changing effective population size and the McDonald-Kreitman test. , 2002, Genetics.

[136]  P. Keightley,et al.  Estimating the rate of adaptive molecular evolution in the presence of slightly deleterious mutations and population size change. , 2009, Molecular biology and evolution.

[137]  B. Charlesworth,et al.  Reduced Effectiveness of Selection Caused by a Lack of Recombination , 2009, Current Biology.

[138]  M. Noor,et al.  Recombination rate variation in closely related species , 2011, Heredity.

[139]  D. Petrov,et al.  Pervasive Natural Selection in the Drosophila Genome? , 2009, PLoS genetics.

[140]  Peter Andolfatto,et al.  Hitchhiking effects of recurrent beneficial amino acid substitutions in the Drosophila melanogaster genome. , 2007, Genome research.

[141]  J. Hey,et al.  Reduced natural selection associated with low recombination in Drosophila melanogaster. , 1993, Molecular biology and evolution.