Pervasive Hitchhiking at Coding and Regulatory Sites in Humans

Much effort and interest have focused on assessing the importance of natural selection, particularly positive natural selection, in shaping the human genome. Although scans for positive selection have identified candidate loci that may be associated with positive selection in humans, such scans do not indicate whether adaptation is frequent in general in humans. Studies based on the reasoning of the MacDonald–Kreitman test, which, in principle, can be used to evaluate the extent of positive selection, suggested that adaptation is detectable in the human genome but that it is less common than in Drosophila or Escherichia coli. Both positive and purifying natural selection at functional sites should affect levels and patterns of polymorphism at linked nonfunctional sites. Here, we search for these effects by analyzing patterns of neutral polymorphism in humans in relation to the rates of recombination, functional density, and functional divergence with chimpanzees. We find that the levels of neutral polymorphism are lower in the regions of lower recombination and in the regions of higher functional density or divergence. These correlations persist after controlling for the variation in GC content, density of simple repeats, selective constraint, mutation rate, and depth of sequencing coverage. We argue that these results are most plausibly explained by the effects of natural selection at functional sites—either recurrent selective sweeps or background selection—on the levels of linked neutral polymorphism. Natural selection at both coding and regulatory sites appears to affect linked neutral polymorphism, reducing neutral polymorphism by 6% genome-wide and by 11% in the gene-rich half of the human genome. These findings suggest that the effects of natural selection at linked sites cannot be ignored in the study of neutral human polymorphism.

[1]  Chung-I Wu,et al.  Inference of positive and negative selection on the 5' regulatory regions of Drosophila genes. , 2004, Molecular biology and evolution.

[2]  Holly M. Mortensen,et al.  Convergent adaptation of human lactase persistence in Africa and Europe , 2007, Nature Genetics.

[3]  Chenhui Zhang,et al.  Adaptive genic evolution in the Drosophila genomes , 2007, Proceedings of the National Academy of Sciences.

[4]  K. Holsinger The neutral theory of molecular evolution , 2004 .

[5]  Ryan D. Hernandez,et al.  Natural selection on protein-coding genes in the human genome , 2005, Nature.

[6]  A. Eyre-Walker,et al.  The genomic rate of adaptive amino acid substitution in Drosophila. , 2004, Molecular biology and evolution.

[7]  Joshua M Akey,et al.  Genomic signatures of positive selection in humans and the limits of outlier approaches. , 2006, Genome research.

[8]  W. Stephan,et al.  Detecting a local signature of genetic hitchhiking along a recombining chromosome. , 2002, Genetics.

[9]  Joanna L. Kelley,et al.  Dietary Change and Adaptive Evolution of enamelin in Humans and Among Primates , 2008, Genetics.

[10]  D. Schluter,et al.  Genetic and developmental basis of evolutionary pelvic reduction in threespine sticklebacks , 2004, Nature.

[11]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[12]  P. Donnelly,et al.  Comparison of Fine-Scale Recombination Rates in Humans and Chimpanzees , 2005, Science.

[13]  Nadav Ahituv,et al.  Exploiting human--fish genome comparisons for deciphering gene regulation. , 2004, Human molecular genetics.

[14]  C. Aquadro,et al.  Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster , 1992, Nature.

[15]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[16]  M. Przeworski Estimating the time since the fixation of a beneficial allele. , 2003, Genetics.

[17]  Steven L Salzberg,et al.  Automated correction of genome sequence errors. , 2004, Nucleic acids research.

[18]  C. Bustamante,et al.  Distinguishing Between Selective Sweeps and Demography Using DNA Polymorphism Data , 2005, Genetics.

[19]  Carlos Bustamante,et al.  Genomic scans for selective sweeps using SNP data. , 2005, Genome research.

[20]  David K. Smith,et al.  MBEToolbox: a Matlab toolbox for sequence data analysis in molecular biology and evolution , 2005, BMC Bioinformatics.

[21]  Colin N. Dewey,et al.  Population Genomics: Whole-Genome Analysis of Polymorphism and Divergence in Drosophila simulans , 2007, PLoS biology.

[22]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[23]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[24]  J. Ajioka,et al.  Lack of polymorphism on the Drosophila fourth chromosome resulting from selection. , 1991, Genetics.

[25]  W Stephan,et al.  The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. , 1995, Genetics.

[26]  International Human Genome Sequencing Consortium Finishing the euchromatic sequence of the human genome , 2004 .

[27]  K. Konvička,et al.  Matching strategies for genetic association studies in structured populations. , 2004, American journal of human genetics.

[28]  Justin C. Fay,et al.  A Catalog of Neutral and Deleterious Polymorphism in Yeast , 2008, PLoS genetics.

[29]  Carlos D Bustamante,et al.  Localizing Recent Adaptive Evolution in the Human Genome , 2007, PLoS genetics.

[30]  R. Nielsen Statistical tests of selective neutrality in the age of genomics , 2001, Heredity.

[31]  Justin C. Fay,et al.  Testing the neutral theory of molecular evolution with genomic data from Drosophila , 2002, Nature.

[32]  D. Petrov,et al.  References and Notes Materials and Methods Tables S1 and S2 References and Notes Pesticide Resistance via Transposition-mediated Adaptive Gene Truncation in Drosophila , 2022 .

[33]  B. Charlesworth,et al.  The effect of deleterious mutations on neutral molecular variation. , 1993, Genetics.

[34]  D. Petrov,et al.  Pervasive Natural Selection in the Drosophila Genome? , 2009, PLoS genetics.

[35]  M. Nachman,et al.  Single nucleotide polymorphisms and recombination rate in humans. , 2001, Trends in genetics : TIG.

[36]  Francisco M. De La Vega,et al.  Population genetic analysis of shotgun assemblies of genomic sequences from multiple individuals. , 2008, Genome research.

[37]  Peter Andolfatto,et al.  Hitchhiking effects of recurrent beneficial amino acid substitutions in the Drosophila melanogaster genome. , 2007, Genome research.

[38]  J. M. Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[39]  S. Pääbo,et al.  A neutral explanation for the correlation of diversity with recombination rates in humans. , 2003, American journal of human genetics.

[40]  J. Wall,et al.  Why is there so little intragenic linkage disequilibrium in humans? , 2001, Genetical research.

[41]  Jane Charlesworth,et al.  The McDonald-Kreitman test and slightly deleterious mutations. , 2008, Molecular biology and evolution.

[42]  Pardis C Sabeti,et al.  Detecting recent positive selection in the human genome from haplotype structure , 2002, Nature.

[43]  Adam Eyre-Walker,et al.  Adaptive protein evolution in Drosophila , 2002, Nature.

[44]  M. Goodman,et al.  The genomic record of Humankind's evolutionary roots. , 1999, American journal of human genetics.

[45]  R. Kulathinal,et al.  Fine-scale mapping of recombination rate in Drosophila refines its correlation to diversity and divergence , 2008, Proceedings of the National Academy of Sciences.

[46]  Brian Charlesworth,et al.  Genetic linkage and molecular evolution , 2001, Current Biology.

[47]  M. Kimura The Neutral Theory of Molecular Evolution: Introduction , 1983 .

[48]  N. Takahata,et al.  Allelic genealogy and human evolution. , 1993, Molecular biology and evolution.

[49]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[50]  Justin C. Fay,et al.  Positive and negative selection on the human genome. , 2001, Genetics.

[51]  Peter Donnelly,et al.  The Influence of Recombination on Human Genetic Diversity , 2006, PLoS genetics.

[52]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[53]  P. Donnelly,et al.  The Fine-Scale Structure of Recombination Rate Variation in the Human Genome , 2004, Science.

[54]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[55]  J. Lupski,et al.  The complete genome of an individual by massively parallel DNA sequencing , 2008, Nature.

[56]  Molly Przeworski,et al.  Fine-scale recombination patterns differ between chimpanzees and humans , 2005, Nature Genetics.

[57]  Liqing Zhang,et al.  Human SNPs reveal no evidence of frequent positive selection. , 2005, Molecular biology and evolution.

[58]  Andreas Prlic,et al.  Ensembl 2007 , 2006, Nucleic Acids Res..

[59]  P. Insel,et al.  A Single Amino Acid Mutation Contributes to Adaptive Beach Mouse Color Pattern , 2006, Science.

[60]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[61]  Carlos D. Bustamante,et al.  The cost of inbreeding in Arabidopsis , 2002, Nature.

[62]  James J. Cai PGEToolbox: A Matlab toolbox for population genetics and evolution. , 2008, The Journal of heredity.

[63]  Justin C. Fay,et al.  Hitchhiking under positive Darwinian selection. , 2000, Genetics.

[64]  M. Kreitman,et al.  Adaptive protein evolution at the Adh locus in Drosophila , 1991, Nature.

[65]  Adam Eyre-Walker,et al.  The genomic rate of adaptive evolution. , 2006, Trends in ecology & evolution.

[66]  N L Kaplan,et al.  The "hitchhiking effect" revisited. , 1989, Genetics.

[67]  Carlos D. Bustamante,et al.  Bayesian Analysis Suggests that Most Amino Acid Replacements in Drosophila Are Driven by Positive Selection , 2003, Journal of Molecular Evolution.

[68]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[69]  G. Coop,et al.  No effect of recombination on the efficacy of natural selection in primates. , 2008, Genome research.

[70]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[71]  Hongkai Ji,et al.  Why do human diversity levels vary at a megabase scale? , 2005, Genome research.

[72]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[73]  W. Stephan,et al.  Distinguishing the hitchhiking and background selection models. , 2003, Genetics.

[74]  P. Andolfatto Adaptive evolution of non-coding DNA in Drosophila , 2005, Nature.

[75]  J. Gillespie Genetic drift in an infinite population. The pseudohitchhiking model. , 2000, Genetics.

[76]  R. ffrench-Constant,et al.  A Single P450 Allele Associated with Insecticide Resistance in Drosophila , 2002, Science.

[77]  P. Donnelly,et al.  A Fine-Scale Map of Recombination Rates and Hotspots Across the Human Genome , 2005, Science.

[78]  Peter Donnelly,et al.  Human recombination hot spots hidden in regions of strong marker association , 2005, Nature Genetics.

[79]  D. Petrov,et al.  Genomewide Spatial Correspondence Between Nonsynonymous Divergence and Neutral Polymorphism Reveals Extensive Adaptation in Drosophila , 2007, Genetics.

[80]  A. Eyre-Walker,et al.  The rate of adaptive evolution in enteric bacteria. , 2006, Molecular biology and evolution.

[81]  John J Welch,et al.  Estimating the Genomewide Rate of Adaptive Protein Evolution in Drosophila , 2006, Genetics.

[82]  B. Charlesworth,et al.  The pattern of neutral molecular variation under the background selection model. , 1995, Genetics.

[83]  G. Coop,et al.  High-Resolution Mapping of Crossovers Reveals Extensive Variation in Fine-Scale Recombination Patterns Among Humans , 2008, Science.

[84]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[85]  Ryan D. Hernandez,et al.  Assessing the Evolutionary Impact of Amino Acid Mutations in the Human Genome , 2008, PLoS genetics.

[86]  D. Schluter,et al.  The Genetic Architecture of Parallel Armor Plate Reduction in Threespine Sticklebacks , 2004, PLoS biology.

[87]  Molly Przeworski,et al.  The signature of positive selection at randomly chosen loci. , 2002, Genetics.

[88]  B. Charlesworth The effect of background selection against deleterious mutations on weakly selected, linked variants. , 1994, Genetical research.

[89]  A. E. Hirsh,et al.  Nonadaptive explanations for signatures of partial selective sweeps in Drosophila. , 2008, Molecular biology and evolution.

[90]  P. Andolfatto Controlling Type-I Error of the McDonald–Kreitman Test in Genomewide Scans for Selection on Noncoding DNA , 2008, Genetics.

[91]  G. McVean,et al.  The effects of Hill-Robertson interference between weakly selected mutations on patterns of molecular evolution and variation. , 2000, Genetics.

[92]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..