Human genomic regions with exceptionally high levels of population differentiation identified from 911 whole-genome sequences

BackgroundPopulation differentiation has proved to be effective for identifying loci under geographically localized positive selection, and has the potential to identify loci subject to balancing selection. We have previously investigated the pattern of genetic differentiation among human populations at 36.8 million genomic variants to identify sites in the genome showing high frequency differences. Here, we extend this dataset to include additional variants, survey sites with low levels of differentiation, and evaluate the extent to which highly differentiated sites are likely to result from selective or other processes.ResultsWe demonstrate that while sites with low differentiation represent sampling effects rather than balancing selection, sites showing extremely high population differentiation are enriched for positive selection events and that one half may be the result of classic selective sweeps. Among these, we rediscover known examples, where we actually identify the established functional SNP, and discover novel examples including the genes ABCA12, CALD1 and ZNF804, which we speculate may be linked to adaptations in skin, calcium metabolism and defense, respectively.ConclusionsWe identify known and many novel candidate regions for geographically restricted positive selection, and suggest several directions for further research.

[1]  C. Darwin The Descent of Man and Selection in Relation to Sex: INDEX , 1871 .

[2]  N. Pierce Origin of Species , 1914, Nature.

[3]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[4]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[5]  B. Charlesworth,et al.  The effect of deleterious mutations on neutral molecular variation. , 1993, Genetics.

[6]  H. Bandelt,et al.  Median-joining networks for inferring intraspecific phylogenies. , 1999, Molecular biology and evolution.

[7]  J. Woo,et al.  Milk Supplementation of the Diet of Postmenopausal Chinese Women on a Low Calcium Intake Retards Bone Loss , 2001, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[8]  Leena Peltonen,et al.  Identification of a variant associated with adult-type hypolactasia , 2002, Nature Genetics.

[9]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[10]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[11]  Intakes of calcium and other nutrients related to bone health in Japanese female college students: a study using the duplicate portion sampling method. , 2005, The Tohoku journal of experimental medicine.

[12]  S. Gabriel,et al.  Calibrating a coalescent simulation of human genome sequence variation. , 2005, Genome research.

[13]  Keith C. Cheng,et al.  SLC24A5, a Putative Cation Exchanger, Affects Pigmentation in Zebrafish and Humans , 2005, Science.

[14]  D. Charlesworth Balancing Selection and Its Effects on Sequences in Nearby Genome Regions , 2006, PLoS genetics.

[15]  Xiaoquan Wen,et al.  Correction: A Map of Recent Positive Selection in the Human Genome , 2006, PLoS Biology.

[16]  Y. Koda,et al.  Population differences of two coding SNPs in pigmentation-related genes SLC24A5 and SLC45A2 , 2006, International Journal of Legal Medicine.

[17]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[18]  Hans Eiberg,et al.  Blue eye color in humans may be caused by a perfectly associated founder mutation in a regulatory element located within the HERC2 gene inhibiting OCA2 expression , 2008, Human Genetics.

[19]  Pardis C Sabeti,et al.  Genome-wide detection and characterization of positive selection in human populations , 2007, Nature.

[20]  Nicholas G Martin,et al.  A single SNP in an evolutionary conserved region within intron 86 of the HERC2 gene determines human blue-brown eye color. , 2008, American journal of human genetics.

[21]  C. Spencer,et al.  Identification of loci associated with schizophrenia by genome-wide association and follow-up , 2008, Nature Genetics.

[22]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[23]  J. Stephenson 1000 Genomes Project , 2008 .

[24]  M. Akiyama,et al.  Harlequin ichthyosis model mouse reveals alveolar collapse and severe fetal skin barrier defects. , 2008, Human molecular genetics.

[25]  D. Conrad,et al.  Population Differentiation as an Indicator of Recent Positive Selection in Humans: An Empirical Evaluation , 2009, Genetics.

[26]  Gary K. Chen,et al.  Fast and flexible simulation of DNA sequence data. , 2008, Genome research.

[27]  M. Nalls,et al.  Reduced Neutrophil Count in People of African Descent Is Due To a Regulatory Variant in the Duffy Antigen Receptor for Chemokines Gene , 2009, PLoS genetics.

[28]  Joshua M Akey,et al.  Where do we go from here? Constructing genomic maps of positive selection in humans: , 2009 .

[29]  P. Green,et al.  Widespread Genomic Signatures of Natural Selection in Hominid Evolution , 2009, PLoS genetics.

[30]  Joseph K. Pickrell,et al.  Signals of recent positive selection in a worldwide sample of human populations. , 2009, Genome research.

[31]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[32]  B. Kessler,et al.  Post‐translational modification of the deubiquitinating enzyme otubain 1 modulates active RhoA levels and susceptibility to Yersinia invasion , 2010, The FEBS journal.

[33]  A. Gylfason,et al.  Fine-scale recombination rate differences between sexes, populations and individuals , 2010, Nature.

[34]  Or Zuk,et al.  A Composite of Multiple Signals Distinguishes Causal Variants in Regions of Positive Selection , 2010, Science.

[35]  G. Abecasis,et al.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes , 2010, Genetic epidemiology.

[36]  Tsun-Po Yang,et al.  Genevar: a database and Java application for the analysis and visualization of SNP-gene associations in eQTL studies , 2010, Bioinform..

[37]  Giovanna Morelli,et al.  Phylogenetic diversity and historical patterns of pandemic spread of Yersinia pestis , 2010, Nature Genetics.

[38]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[39]  Joseph K. Pickrell,et al.  The Genetics of Human Adaptation: Hard Sweeps, Soft Sweeps, and Polygenic Adaptation , 2010, Current Biology.

[40]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[41]  Ryan D. Hernandez,et al.  Classic Selective Sweeps Were Rare in Recent Human Evolution , 2011, Science.

[42]  Ivan Nikolov,et al.  Fine mapping of ZNF 804 A and genome-wide significant evidence for its involvement in schizophrenia and bipolar disorder , 2011 .

[43]  Emmanouil Collab A map of human genome variation from population-scale sequencing , 2011, Nature.

[44]  M. Akiyama,et al.  Abca12-mediated lipid transport and Snap29-dependent trafficking of lamellar granules are crucial for epidermal morphogenesis in a zebrafish model of ichthyosis , 2011, Disease Models & Mechanisms.

[45]  G. Abecasis,et al.  Low-coverage sequencing: implications for design of complex trait association studies. , 2011, Genome research.

[46]  D. Kelsell,et al.  Harlequin ichthyosis: a review of clinical and molecular findings in 45 cases. , 2011, Archives of dermatology.

[47]  Jonathan K. Pritchard,et al.  Adaptations to Climate-Mediated Selective Pressures in Humans , 2011, PLoS genetics.

[48]  D. Rujescu,et al.  Fine mapping of ZNF804A and genome-wide significant evidence for its involvement in schizophrenia and bipolar disorder , 2011, Molecular Psychiatry.

[49]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[50]  Gabor T. Marth,et al.  Demographic history and rare allele sharing among human populations , 2011, Proceedings of the National Academy of Sciences.

[51]  J. Mandl,et al.  Keratinocyte ATP binding cassette transporter expression is regulated by ultraviolet light. , 2012, Journal of photochemistry and photobiology. B, Biology.

[52]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[53]  J. Abrams,et al.  Smooth muscle caldesmon modulates peristalsis in the wild type and non‐innervated zebrafish intestine , 2012, Neurogastroenterology and motility : the official journal of the European Gastrointestinal Motility Society.

[54]  B. Maher,et al.  ZNF804a Regulates Expression of the Schizophrenia-Associated Genes PRSS16, COMT, PDE4B, and DRD2 , 2012, PloS one.

[55]  C. Scott,et al.  Harlequin ichthyosis: ABCA12 mutations underlie defective lipid transport, reduced protease regulation and skin-barrier dysfunction , 2012, Cell and Tissue Research.

[56]  Gabor T. Marth,et al.  Integrative Annotation of Variants from 1092 Humans: Application to Cancer Genomics , 2013, Science.

[57]  Eric S. Lander,et al.  Identifying Recent Adaptations in Large-Scale Genomic Data , 2013, Cell.

[58]  Hwee Tong Tan,et al.  Identification and functional validation of caldesmon as a potential gastric cancer metastasis-associated protein. , 2013, Journal of proteome research.

[59]  Min Hu,et al.  FOXP2 targets show evidence of positive selection in European populations. , 2013, American journal of human genetics.

[60]  R. Ferriani,et al.  Caldesmon: New Insights for Diagnosing Endometriosis1 , 2013, Biology of reproduction.

[61]  David Comas,et al.  Extreme Population Differences in the Human Zinc Transporter ZIP4 (SLC39A4) Are Explained by Positive Selection in Sub-Saharan Africa , 2014, PLoS genetics.

[62]  Zachary A. Szpiech,et al.  selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection , 2014, Molecular biology and evolution.

[63]  Sreeurpa Ray Human Evolutionary Genetics , 2014, The Yale Journal of Biology and Medicine.

[64]  Sreeurpa Ray,et al.  The Cell: A Molecular Approach , 1996 .

[65]  Pavlos Pavlidis,et al.  1000 Genomes Selection Browser 1.0: a genome browser dedicated to signatures of natural selection in modern humans , 2013, Nucleic Acids Res..

[66]  Adam Auton,et al.  The 1000 Genomes Project , 2015 .