Hierarchical Bayesian model of population structure reveals convergent adaptation to high altitude in human populations

Detecting genes involved in local adaptation is challenging and of fundamental importance in evolutionary, quantitative, and medical genetics. To this aim, a standard strategy is to perform genome scans in populations of different origins and environments, looking for genomic regions of high differentiation. Because shared population history or population sub-structure may lead to an excess of false positives, analyses are often done on multiple pairs of populations, which leads to i) a global loss of power as compared to a global analysis, and ii) the need for multiple tests corrections. In order to alleviate these problems, we introduce a new hierarchical Bayesian method to detect markers under selection that can deal with complex demographic histories, where sampled populations share part of their history. Simulations show that our approach is both more powerful and less prone to false positive loci than approaches based on separate analyses of pairs of populations or those ignoring existing complex structures. In addition, our method can identify selection occurring at different levels (i.e. population or region-specific adaptation), as well as convergent selection in different regions. We apply our approach to the analysis of a large SNP dataset from low- and high-altitude human populations from America and Asia. The simultaneous analysis of these two geographic areas allows us to identify several new candidate genome regions for altitudinal selection, and we show that convergent evolution among continents has been quite common. In addition to identifying several genes and biological processes involved in high altitude adaptation, we identify two specific biological pathways that could have evolved in both continents to counter toxic effects induced by hypoxia.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  R. Durbin,et al.  Revising the human mutation rate: implications for understanding human evolution , 2012, Nature Reviews Genetics.

[3]  M. Krasnow,et al.  The Hypoxic Response: Huffing and HIFing , 1997, Cell.

[4]  Wei Wang,et al.  Natural selection on EPAS1 (HIF2α) associated with low hemoglobin concentration in Tibetan highlanders , 2010, Proceedings of the National Academy of Sciences.

[5]  M. Bennett,et al.  mGluR1-mediated potentiation of NMDA receptors involves a rise in intracellular calcium and activation of protein kinase C , 2001, Neuropharmacology.

[6]  Jacob A. Tennessen,et al.  Parallel Adaptive Divergence among Geographically Diverse Human Populations , 2011, PLoS genetics.

[7]  Yu-ping Peng,et al.  Neuroprotection of Interleukin-6 Against NMDA-induced Neurotoxicity is Mediated by JAK/STAT3, MAPK/ERK, and PI3K/AKT Signaling Pathways , 2012, Cellular and Molecular Neurobiology.

[8]  R. Wanders,et al.  Fatty acid omega‐oxidation as a rescue pathway for fatty acid oxidation disorders in humans , 2011, The FEBS journal.

[9]  Jessica L. Crisci,et al.  Recent progress in polymorphism-based population genetic inference. , 2012, The Journal of heredity.

[10]  Laurent Excoffier,et al.  Evidence for polygenic adaptation to pathogens in the human genome. , 2013, Molecular biology and evolution.

[11]  J. David,et al.  Adaptation to alcoholic fermentation in Drosophila: a parallel selection imposed by environmental ethanol and acetic acid. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[12]  J. Losos,et al.  CONVERGENCE, ADAPTATION, AND CONSTRAINT , 2011, Evolution; international journal of organic evolution.

[13]  Airat Bekmetjev,et al.  Comparing gene set analysis methods on single-nucleotide polymorphism data from Genetic Analysis Workshop 16 , 2009, BMC proceedings.

[14]  Pardis C Sabeti,et al.  Detecting recent positive selection in the human genome from haplotype structure , 2002, Nature.

[15]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[16]  S. Narum,et al.  Comparison of FST outlier tests for SNP loci under selection , 2011, Molecular ecology resources.

[17]  D. J. Funk,et al.  Divergent selection and heterogeneous genomic divergence , 2009, Molecular ecology.

[18]  Asan,et al.  Altitude adaptation in Tibet caused by introgression of Denisovan-like DNA , 2014, Nature.

[19]  Jonathan K. Pritchard,et al.  The Genetic Architecture of Adaptations to High Altitude in Ethiopia , 2012, PLoS genetics.

[20]  G. Haddad,et al.  Mechanisms underlying hypoxia-induced neuronal apoptosis , 2000, Progress in Neurobiology.

[21]  D. Balding,et al.  Identifying adaptive genetic divergence among populations from genome scans , 2004, Molecular ecology.

[22]  Asan,et al.  Sequencing of 50 Human Exomes Reveals Adaptation to High Altitude , 2010, Science.

[23]  Jinchuan Xing,et al.  Genetic Evidence for High-Altitude Adaptation in Tibet , 2010, Science.

[24]  K. Kidd,et al.  Ethnic Related Selection for an ADH Class I Variant within East Asia , 2008, PloS one.

[25]  S WRIGHT,et al.  Genetical structure of populations. , 1950, Nature.

[26]  Kenneth K Kidd,et al.  Evidence of positive selection on a class I ADH locus. , 2007, American journal of human genetics.

[27]  Anushya Muruganujan,et al.  PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees , 2012, Nucleic Acids Res..

[28]  L. Moore,et al.  Interleukin-6 response to exercise and high-altitude exposure: influence of α-adrenergic blockade , 2001 .

[29]  B Rannala,et al.  Estimating gene flow in island populations. , 1996, Genetical research.

[30]  S. Tishkoff,et al.  Living the high life: high-altitude adaptation , 2010, Genome Biology.

[31]  Megan J. Wilson,et al.  Andean and Tibetan patterns of adaptation to high altitude , 2013, American journal of human biology : the official journal of the Human Biology Council.

[32]  Pardis C Sabeti,et al.  Genome-wide detection and characterization of positive selection in human populations , 2007, Nature.

[33]  A. F. Bennett,et al.  The Molecular Diversity of Adaptive Convergence , 2012, Science.

[34]  D. J. Funk,et al.  Selection and Genomic Differentiation During Ecological Speciation: Isolating the Contributions of Host Association via a Comparative Genome Scan of Neochlamisus bebbianae Leaf Beetles , 2008, Evolution; international journal of organic evolution.

[35]  O. Gaggiotti,et al.  Quantifying population structure using the F‐model , 2010, Molecular ecology resources.

[36]  Marek Kimmel,et al.  simuPOP: a forward-time population genetics simulation environment , 2005, Bioinform..

[37]  P. Snelgrove,et al.  Parallel adaptive evolution of Atlantic cod on both sides of the Atlantic Ocean in response to temperature , 2010, Proceedings of the Royal Society B: Biological Sciences.

[38]  E. Dennis,et al.  Enhanced Low Oxygen Survival in Arabidopsis through Increased Metabolic Flux in the Fermentative Pathway1 , 2003, Plant Physiology.

[39]  A. Bhatnagar Surviving hypoxia: the importance of rafts, anchors, and fluidity. , 2003, Circulation research.

[40]  Ann Marie Craig,et al.  NMDA Receptor Subunits Have Differential Roles in Mediating Excitotoxic Neuronal Death Both In Vitro and In Vivo , 2007, The Journal of Neuroscience.

[41]  Saurabh Ghosh,et al.  EGLN1 involvement in high-altitude adaptation revealed through genetic analysis of extreme constitution types defined in Ayurveda , 2010, Proceedings of the National Academy of Sciences.

[42]  David B. Witonsky,et al.  Reconstructing Native American Population History , 2012, Nature.

[43]  Sameer Soi,et al.  Genetic adaptation to high altitude in the Ethiopian highlands , 2011, Genome Biology.

[44]  Christian Schlötterer,et al.  Detecting Selective Sweeps: A New Approach Based on Hidden Markov Models , 2009, Genetics.

[45]  Joseph K. Pickrell,et al.  The Genetics of Human Adaptation: Hard Sweeps, Soft Sweeps, and Polygenic Adaptation , 2010, Current Biology.

[46]  W. Salzburger,et al.  Uninformative polymorphisms bias genome scans for signatures of selection , 2012, BMC Evolutionary Biology.

[47]  John D. Storey The positive false discovery rate: a Bayesian interpretation and the q-value , 2003 .

[48]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[49]  Renata C. Geer,et al.  The NCBI BioSystems database , 2009, Nucleic Acids Res..

[50]  Ya-ping Zhang,et al.  Genetic adaptation of the hypoxia-inducible factor pathway to oxygen pressure among eurasian human populations. , 2012, Molecular biology and evolution.

[51]  T. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2006, Nucleic Acids Res..

[52]  Raimund Erbel,et al.  Two New Loci for Body-Weight Regulation Identified in a Joint Analysis of Genome-Wide Association Studies for Early-Onset Extreme Obesity in French and German Study Groups , 2010, PLoS genetics.

[53]  D. MacArthur,et al.  High altitude adaptation in Daghestani populations from the Caucasus , 2011, Human Genetics.

[54]  S. Wright THE INTERPRETATION OF POPULATION STRUCTURE BY F‐STATISTICS WITH SPECIAL REGARD TO SYSTEMS OF MATING , 1965 .

[55]  Patrik Nosil,et al.  Heterogeneous Genomic Differentiation Between Walking-Stick Ecotypes: “Isolation by Adaptation” and Multiple Roles for Divergent Selection , 2008, Evolution; international journal of organic evolution.

[56]  K. Kidd,et al.  The evolution and population genetics of the ALDH2 locus: random genetic drift, selection, and low levels of recombination , 2004, Annals of human genetics.

[57]  Matthieu Foll,et al.  Enhanced AFLP genome scans detect local adaptation in high‐altitude populations of a small rodent (Microtus arvalis) , 2011, Molecular ecology.

[58]  Joseph Lachance,et al.  Population Genomics of Human Adaptation. , 2013, Annual review of ecology, evolution, and systematics.

[59]  T. Hornbein,et al.  High Altitude : An Exploration of Human Adaptation , 2001 .

[60]  David B. Witonsky,et al.  Admixture facilitates genetic adaptations to high altitude in Tibet , 2014, Nature Communications.

[61]  R. Nielsen,et al.  On Detecting Incomplete Soft or Hard Selective Sweeps Using Haplotype Structure , 2014, Molecular biology and evolution.

[62]  L. Jorde,et al.  Genetic determinants of Tibetan high-altitude adaptation , 2011, Human Genetics.

[63]  Nicholas Stiffler,et al.  Population Genomics of Parallel Adaptation in Threespine Stickleback using Sequenced RAD Tags , 2010, PLoS genetics.

[64]  L. Bernatchez,et al.  Generic scan using AFLP markers as a means to assess the role of directional selection in the divergence of sympatric whitefish ecotypes. , 2004, Molecular biology and evolution.

[65]  O. Gaggiotti,et al.  A Genome-Scan Method to Identify Selected Loci Appropriate for Both Dominant and Codominant Markers: A Bayesian Perspective , 2008, Genetics.

[66]  Yi Peng,et al.  The ADH1B Arg47His polymorphism in East Asian populations and expansion of rice domestication in history , 2010, BMC Evolutionary Biology.

[67]  Taane G. Clark,et al.  Detecting and characterizing genomic signatures of positive selection in global populations. , 2013, American journal of human genetics.

[68]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[69]  C. Beall Andean, Tibetan, and Ethiopian patterns of adaptation to high-altitude hypoxia. , 2006, Integrative and comparative biology.

[70]  M. Arnegard,et al.  The probability of genetic parallelism and convergence in natural populations , 2012, Proceedings of the Royal Society B: Biological Sciences.

[71]  G. Tseng,et al.  Comprehensive literature review and statistical considerations for GWAS meta-analysis , 2012, Nucleic acids research.

[72]  Hui Zhang,et al.  Genetic variations in Tibetan populations and high-altitude adaptation at the Himalayas. , 2011, Molecular biology and evolution.

[73]  L. Excoffier,et al.  Detecting loci under selection in a hierarchically structured population , 2009, Heredity.

[74]  H. Young,et al.  The talented interferon-gamma , 2013 .

[75]  Megan J. Wilson,et al.  Identifying positive selection candidate loci for high-altitude adaptation in Andean populations , 2009, Human Genomics.

[76]  Luca Pagani,et al.  Genetic Signatures Reveal High-Altitude Adaptation in a Set of Ethiopian Populations , 2013, Molecular biology and evolution.

[77]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[78]  L. Jorde,et al.  Metabolic insight into mechanisms of high-altitude adaptation in Tibetans. , 2012, Molecular genetics and metabolism.

[79]  Rui Mei,et al.  Identifying Signatures of Natural Selection in Tibetan and Andean Populations Using Dense Genome Scan Data , 2010, PLoS genetics.

[80]  Yiping Shen,et al.  A genome-wide search for signals of high-altitude adaptation in Tibetans. , 2011, Molecular biology and evolution.

[81]  M. Jakobsson,et al.  Joint analysis of demography and selection in population genetics: where do we stand and where could we go? , 2012, Molecular ecology.

[82]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..