Disentangling Signatures of Selection Before and After European Colonization in Latin Americans

Throughout human evolutionary history, large-scale migrations have led to intermixing (i.e., admixture) between previously separated human groups. While classical and recent work have shown that studying admixture can yield novel historical insights, the extent to which this process contributed to adaptation remains underexplored. Here, we introduce a novel statistical model, specific to admixed populations, that identifies loci under selection while determining whether the selection likely occurred post-admixture or prior to admixture in one of the ancestral source populations. Through extensive simulations we show that this method is able to detect selection, even in recently formed admixed populations, and to accurately differentiate between selection occurring in the ancestral or admixed population. We apply this method to genome-wide SNP data of ~4,000 individuals in five admixed Latin American cohorts from Brazil, Chile, Colombia, Mexico and Peru. Our approach replicates previous reports of selection in the HLA region that are consistent with selection post-admixture. We also report novel signals of selection in genomic regions spanning 47 genes, reinforcing many of these signals with an alternative, commonly-used local-ancestry-inference approach. These signals include several genes involved in immunity, which may reflect responses to endemic pathogens of the Americas and to the challenge of infectious disease brought by European contact. In addition, some of the strongest signals inferred to be under selection in the Native American ancestral groups of modern Latin Americans overlap with genes implicated in energy metabolism phenotypes, plausibly reflecting adaptations to novel dietary sources available in the Americas.

[1]  L. Quintana-Murci,et al.  The genomic signatures of natural selection in admixed human populations , 2021, bioRxiv.

[2]  Christopher R. Gignoux,et al.  Clotting factor genes are associated with preeclampsia in high altitude pregnant women in the Peruvian Andes , 2021, medRxiv.

[3]  Katharine L Korunes,et al.  Rapid adaptation to malaria facilitated by admixture in the human population of Cabo Verde. , 2021, eLife.

[4]  E. McDonagh,et al.  Open Targets Platform: supporting systematic drug–target identification and prioritisation , 2020, Nucleic Acids Res..

[5]  Gautier Koscielny,et al.  Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics , 2020, Nucleic Acids Res..

[6]  Katharine L Korunes,et al.  Rapid adaptation to malaria facilitated by admixture in the human population of Cabo Verde , 2020, bioRxiv.

[7]  S. Eyheramendy,et al.  Postadmixture Selection on Chileans Targets Haplotype Involved in Pigmentation, Thermogenesis and Immune Defense against Pathogens , 2020, Genome biology and evolution.

[8]  J. Dipierri,et al.  Fine-scale genomic analyses of admixed individuals reveal unrecognized genetic ancestry components in Argentina , 2020, bioRxiv.

[9]  William J. Astle,et al.  Trans-ethnic and Ancestry-Specific Blood-Cell Genetics in 746,667 Individuals from 5 Global Populations , 2020, Cell.

[10]  L. Liang,et al.  Shared Genetic and Experimental Links between Obesity-Related Traits and Asthma Subtypes in UK Biobank. , 2020, The Journal of allergy and clinical immunology.

[11]  I. Mathieson Limited Evidence for Selection at the FADS Locus in Native American Populations , 2019, bioRxiv.

[12]  R. Bucala,et al.  Macrophage migration inhibitory factor promoter polymorphisms are associated with disease activity in rheumatoid arthritis patients from Southern Mexico , 2019, Molecular genetics & genomic medicine.

[13]  Andrew B. Conley,et al.  Admixture-enabled selection for rapid adaptive evolution in the Americas , 2019, Genome Biology.

[14]  Wei Wang,et al.  Replication of previous genome-wide association studies of HKDC1, BACE2, SLC16A11 and TMEM163 SNPs in a gestational diabetes mellitus case–control sample from Han Chinese population , 2019, Diabetes, metabolic syndrome and obesity : targets and therapy.

[15]  Carina M. Schlebusch,et al.  Population history and genetic adaptation of the Fulani nomads: inferences from genome-wide data and the lactase persistence trait , 2019, BMC Genomics.

[16]  Deborah A. Bolnick,et al.  Comparing signals of natural selection between three Indigenous North American populations , 2019, Proceedings of the National Academy of Sciences.

[17]  Scott M. Williams,et al.  The Missing Diversity in Human Genetic Studies , 2019, Cell.

[18]  R. Nielsen,et al.  Ohana: detecting selection in multiple populations by modelling ancestral admixture components , 2019, bioRxiv.

[19]  Christopher R. Gignoux,et al.  Population History and Gene Divergence in Native Mexicans Inferred from 76 Human Exomes , 2019, bioRxiv.

[20]  R. D. del Ángel,et al.  The Role of Host Cholesterol During Flavivirus Infection , 2018, Front. Cell. Infect. Microbiol..

[21]  Laura J. Scott,et al.  Trans-ethnic association study of blood pressure determinants in over 750,000 individuals , 2018, Nature Genetics.

[22]  J. Greenbaum,et al.  Impact of Genetic Polymorphisms on Human Immune Cell Gene Expression , 2018, Cell.

[23]  C. Warinner,et al.  The genetic prehistory of the Andean highlands 7000 years BP though European contact , 2018, Science Advances.

[24]  Fernando Racimo,et al.  Identifying loci under positive selection in complex population histories , 2018, bioRxiv.

[25]  Kyle J. Gaulton,et al.  Maternal and fetal genetic effects on birth weight and their relevance to cardio-metabolic risk factors , 2018, Nature Genetics.

[26]  Philipp W. Messer,et al.  SLiM 3: Forward Genetic Simulations Beyond the Wright–Fisher Model , 2018, bioRxiv.

[27]  K. Kidd,et al.  Recent Selection on a Class I ADH Locus Distinguishes Southwest Asian Populations Including Ashkenazi Jews , 2018, Genes.

[28]  Po-Ru Loh,et al.  Mixed-model association for biobank-scale datasets , 2018, Nature Genetics.

[29]  Samuel E. Jones,et al.  Meta-analysis of genome-wide association studies for body fat distribution in 694 649 individuals of European ancestry , 2018, bioRxiv.

[30]  M. Stoneking,et al.  Strong selection during the last millennium for African ancestry in the admixed population of Madagascar , 2018, Nature Communications.

[31]  D. Balding,et al.  Latin Americans show wide-spread Converso ancestry and imprint of local Native ancestry on physical appearance , 2018, Nature Communications.

[32]  I. Ruczinski,et al.  Evolution of Hominin Polyunsaturated Fatty Acid Metabolism: From Africa to the New World , 2017, bioRxiv.

[33]  M. Bortolini,et al.  Genetic signature of natural selection in first Americans , 2017, Proceedings of the National Academy of Sciences.

[34]  He Gao,et al.  Genome-wide association analysis identifies novel blood pressure loci and offers biological insights into cardiovascular risk , 2017, Nature Genetics.

[35]  Gautier Koscielny,et al.  Open Targets: a platform for therapeutic target identification and validation , 2016, Nucleic Acids Res..

[36]  N. Risch,et al.  Genome-wide association analyses using electronic health records identify new loci influencing blood pressure variation , 2016, Nature Genetics.

[37]  M. Aldenderfer,et al.  Late Archaic–Early Formative period microbotanical evidence for potato at Jiskairumoko in the Titicaca Basin of southern Peru , 2016, Proceedings of the National Academy of Sciences.

[38]  Yancy Lo,et al.  Going global by adapting local: A review of recent human adaptation , 2016, Science.

[39]  Fernando Racimo,et al.  Signatures of Archaic Adaptive Introgression in Present-Day Human Populations , 2016, bioRxiv.

[40]  Timothy E. Reddy,et al.  HKDC1 Is a Novel Hexokinase Involved in Whole-Body Glucose Use. , 2016, Endocrinology.

[41]  V. Mohan,et al.  Hexokinase Domain Containing 1 (HKDC1) Gene Variants and their Association with Gestational Diabetes Mellitus in a South Indian Population , 2016, Annals of human genetics.

[42]  Sayan Mukherjee,et al.  Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia. , 2016, American journal of human genetics.

[43]  Shuhua Xu,et al.  Ancestry variation and footprints of natural selection along the genome in Latin American populations , 2016, Scientific Reports.

[44]  Yongtao Guan,et al.  Correction: Strong Selection at MHC in Mexicans since Admixture , 2016, PLoS genetics.

[45]  Liran Carmel,et al.  Archaic Adaptive Introgression in TBX15/WARS2 , 2015, bioRxiv.

[46]  Carl D. Langefeld,et al.  Genomic Insights into the Ancestry and Demographic History of South America , 2015, PLoS genetics.

[47]  D. Reich,et al.  Genome-wide patterns of selection in 230 ancient Eurasians , 2015, Nature.

[48]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[49]  Anders Albrechtsen,et al.  Greenlandic Inuit show genetic signatures of diet and climate adaptation , 2015, Science.

[50]  Andrew B. Conley,et al.  Ancestry, admixture and fitness in Colombian genomes , 2015, Scientific Reports.

[51]  Swapan Mallick,et al.  Massive migration from the steppe was a source for Indo-European languages in Europe , 2015, Nature.

[52]  L. Mudd,et al.  Preeclampsia and Diabetes , 2015, Current Diabetes Reports.

[53]  Christopher D. Brown,et al.  Coordinated Regulatory Variation Associated with Gestational Hyperglycemia Regulates Expression of the Novel Hexokinase HKDC1 , 2014, Nature Communications.

[54]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[55]  Melinda C Aldrich,et al.  Genome-wide scan of 29,141 African Americans finds no evidence of directional selection since admixture. , 2014, American journal of human genetics.

[56]  D. Balding,et al.  Admixture in Latin America: Geographic Structure, Phenotypic Diversity and Self-Perception of Ancestry Based on 7,342 Individuals , 2014, PLoS genetics.

[57]  Joseph K. Pickrell,et al.  Natural selection for the Duffy-null allele in the recently admixed people of Madagascar , 2014, Proceedings of the Royal Society B: Biological Sciences.

[58]  Christopher R. Gignoux,et al.  The genetics of Mexico recapitulates Native American substructure and affects biomedical traits , 2014, Science.

[59]  Pardis C. Sabeti,et al.  Natural selection and infectious disease in human populations , 2014, Nature Reviews Genetics.

[60]  Haitao Wen,et al.  Inflammasomes and metabolic disorders: old genes in modern diseases. , 2014, Molecular cell.

[61]  D. Falush,et al.  A Genetic Atlas of Human Admixture History , 2014, Science.

[62]  Yongtao Guan Detecting Structure of Haplotypes and Local Ancestry , 2014, Genetics.

[63]  Bonnie Berger,et al.  Ancient human genomes suggest three ancestral populations for present-day Europeans , 2013, Nature.

[64]  Timothy E. Reddy,et al.  Identification of HKDC1 and BACE2 as Genes Influencing Glycemic Traits During Pregnancy Through Genome-Wide Association Studies , 2013, Diabetes.

[65]  C. Bustamante,et al.  RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference. , 2013, American journal of human genetics.

[66]  N. Patterson,et al.  Estimating and interpreting FST: The impact of rare variants , 2013, Genome research.

[67]  Jake K. Byrnes,et al.  Reconstructing the Population Genetic History of the Caribbean , 2013, PLoS genetics.

[68]  Pedro C. Avila,et al.  Analysis of Latino populations from GALA and MEC studies reveals genomic loci with biased local ancestry estimation , 2013, Bioinform..

[69]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[70]  F. Borrego The CD300 molecules: an emerging family of regulators of the immune system. , 2013, Blood.

[71]  Swapan Mallick,et al.  Ancient Admixture in Human History , 2012, Genetics.

[72]  O. Delaneau,et al.  A linear complexity phasing method for thousands of genomes , 2011, Nature Methods.

[73]  Gabor T. Marth,et al.  Demographic history and rare allele sharing among human populations , 2011, Proceedings of the National Academy of Sciences.

[74]  Carey N Lumeng,et al.  Inflammatory links between obesity and metabolic disease. , 2011, The Journal of clinical investigation.

[75]  M. van Dijk,et al.  STOX1: Key Player in Trophoblast Dysfunction Underlying Early Onset Preeclampsia with Growth Retardation , 2010, Journal of pregnancy.

[76]  M. Bortolini,et al.  A functional ABCA1 gene variant is associated with low HDL-cholesterol levels and shows evidence of positive selection in Native Americans. , 2010, Human molecular genetics.

[77]  Asan,et al.  Sequencing of 50 Human Exomes Reveals Adaptation to High Altitude , 2010, Science.

[78]  David B. Witonsky,et al.  Human adaptations to diet, subsistence, and ecoregion are due to subtle shifts in allele frequency , 2010, Proceedings of the National Academy of Sciences.

[79]  E. Campbell,et al.  Metabolic Shifts in Immunity and Inflammation , 2010, The Journal of Immunology.

[80]  Ryan D. Hernandez,et al.  Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data , 2009, PLoS genetics.

[81]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[82]  T. Beaty,et al.  Genetic Admixture in Brazilians Exposed to Infection with Leishmania chagasi , 2009, Annals of human genetics.

[83]  E. Boerwinkle,et al.  Genome-wide distribution of ancestry in Mexican Americans , 2008, Human Genetics.

[84]  C. Aguilar-Salinas,et al.  Association of the ATP-Binding Cassette Transporter A1 R230C Variant With Early-Onset Type 2 Diabetes in a Mexican Population , 2008, Diabetes.

[85]  Mattias Jakobsson,et al.  Genetic Variation and Population Structure in Native Americans , 2007, PLoS genetics.

[86]  Rui Mei,et al.  Recent genetic selection in the ancestral admixture of Puerto Ricans. , 2007, American journal of human genetics.

[87]  S. Zamudio High-altitude hypoxia and preeclampsia. , 2007, Frontiers in bioscience : a journal and virtual library.

[88]  Marie van Dijk,et al.  Maternal segregation of the Dutch preeclampsia locus at 10q22 with a new member of the winged helix gene family , 2005, Nature Genetics.

[89]  D. Balding,et al.  A method for quantifying differentiation between populations at multi-allelic loci and its implications for investigating identity and paternity , 2005, Genetica.

[90]  Pardis C Sabeti,et al.  Genetic signatures of strong recent positive selection at the lactase gene. , 2004, American journal of human genetics.

[91]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[92]  K. Shuai,et al.  Regulation of JAK–STAT signalling in the immune system , 2003, Nature Reviews Immunology.

[93]  T. Calandra,et al.  Macrophage migration inhibitory factor: a regulator of innate immunity , 2003, Nature Reviews Immunology.

[94]  D. Swallow,et al.  The Causal Element for the Lactase Persistence/ non‐persistence Polymorphism is Located in a 1 Mb Region of Linkage Disequilibrium in Europeans , 2003, Annals of human genetics.

[95]  B. Sibai,et al.  Diagnosis and management of gestational hypertension and preeclampsia. , 2003, Obstetrics and gynecology.

[96]  M. Bagot,et al.  Triggering CD101 molecule on human cutaneous dendritic cells inhibits T cell proliferation via IL‐10 production , 2000, European journal of immunology.

[97]  M. A. Crook,et al.  Is Type II diabetes mellitus a disease of the innate immune system? , 1998, Diabetologia.

[98]  E. Engleman,et al.  V7 (CD101) ligation inhibits TCR/CD3-induced IL-2 production by blocking Ca2+ flux and nuclear factor of activated T cell nuclear translocation. , 1998, Journal of immunology.

[99]  B. Sibai,et al.  The relationship between abnormal glucose tolerance and hypertensive disorders of pregnancy in healthy nulliparous women , 1997 .

[100]  Michael R. Green,et al.  Gene Expression , 1993, Progress in Gene Expression.

[101]  J. Long The genetic structure of admixed populations. , 1991, Genetics.

[102]  H. Akaike A new look at the statistical model identification , 1974 .

[103]  H GRUNEBERG,et al.  Human genetics. , 1947, The Eugenics review.