Insight into the genomic history of the Near East from whole-genome sequences and genotypes of Yemenis

We report high-coverage whole-genome sequencing data from 46 Yemeni individuals as well as genome-wide genotyping data from 169 Yemenis from diverse locations. We use this dataset to define the genetic diversity in Yemen and how it relates to people elsewhere in the Near East. Yemen is a vast region with substantial cultural and geographic diversity, but we found little genetic structure correlating with geography among the Yemenis – probably reflecting continuous movement of people between the regions. African ancestry from admixture in the past 800 years is widespread in Yemen and is the main contributor to the country’s limited genetic structure, with some individuals in Hudayda and Hadramout having up to 20% of their genetic ancestry from Africa. In contrast, individuals from Maarib appear to have been genetically isolated from the African gene flow and thus have genomes likely to reflect Yemen’s ancestry before the admixture. This ancestry was comparable to the ancestry present during the Bronze Age in the distant Northern regions of the Near East. After the Bronze Age, the South and North of the Near East therefore followed different genetic trajectories: in the North the Levantines admixed with a Eurasian population carrying steppe ancestry whose impact never reached as far south as the Yemen, where people instead admixed with Africans leading to the genetic structure observed in the Near East today.

[1]  Asan,et al.  A Rare Deep-Rooting D0 African Y-Chromosomal Haplogroup and Its Implications for the Expansion of Modern Humans Out of Africa. , 2019, Genetics.

[2]  Yali Xue,et al.  A Transient Pulse of Genetic Admixture from the Crusaders in the Near East Identified from Ancient Genome Sequences , 2019, American journal of human genetics.

[3]  Shane A. McCarthy,et al.  Enrichment of low-frequency functional variants revealed by whole-genome sequencing of multiple isolated European populations , 2017, Nature Communications.

[4]  C. Tyler-Smith,et al.  Continuity and Admixture in the Last Five Millennia of Levantine History from Ancient Canaanite and Present-Day Lebanese Genome Sequences , 2017, bioRxiv.

[5]  Mattias Jakobsson,et al.  Tracing the peopling of the world through genomics , 2017, Nature.

[6]  E. Zeggini,et al.  Chad Genetic Diversity Reveals an African History Marked by Multiple Holocene Eurasian Migrations , 2016, American journal of human genetics.

[7]  G. D. Poznik Identifying Y-chromosome haplogroups in arbitrarily large samples of sequenced or genotyped men , 2016, bioRxiv.

[8]  K. Veeramah,et al.  Early Neolithic genomes from the eastern Fertile Crescent , 2016, Science.

[9]  Swapan Mallick,et al.  Genomic insights into the origin of farming in the ancient Near East , 2016, Nature.

[10]  Yun S. Song,et al.  The Simons Genome Diversity Project: 300 genomes from 142 diverse populations , 2016, Nature.

[11]  Peer Bork,et al.  Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees , 2016, Nucleic Acids Res..

[12]  Günther Specht,et al.  mtDNA-Server: next-generation sequencing data analysis of human mitochondrial DNA in the cloud , 2016, Nucleic Acids Res..

[13]  C. Tyler-Smith,et al.  Genetic evidence for an origin of the Armenians from Bronze Age mixing of multiple populations , 2015, European Journal of Human Genetics.

[14]  Anders Eriksson,et al.  Upper Palaeolithic genomes reveal deep roots of modern Eurasians , 2015, Nature Communications.

[15]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[16]  Swapan Mallick,et al.  Massive migration from the steppe was a source for Indo-European languages in Europe , 2015, Nature.

[17]  R. Durbin,et al.  Inferring human population size and separation history from multiple genome sequences , 2014, Nature Genetics.

[18]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[19]  R. Mägi,et al.  Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans , 2013, Nature.

[20]  Bonnie Berger,et al.  Ancient human genomes suggest three ancestral populations for present-day Europeans , 2013, Nature.

[21]  Bonnie Berger,et al.  Ancient west Eurasian ancestry in southern and eastern Africa , 2013, Proceedings of the National Academy of Sciences.

[22]  L. Berthiaume,et al.  Wnt acylation: seeing is believing. , 2014, Nature chemical biology.

[23]  C. Tyler-Smith,et al.  Genome-Wide Diversity in the Levant Reveals Recent Structuring by Culture , 2013, PLoS genetics.

[24]  Joseph K. Pickrell,et al.  Inferring Admixture Histories of Human Populations Using Linkage Disequilibrium , 2012, Genetics.

[25]  O. Delaneau,et al.  Supplementary Information for ‘ Improved whole chromosome phasing for disease and population genetic studies ’ , 2012 .

[26]  Swapan Mallick,et al.  Ancient Admixture in Human History , 2012, Genetics.

[27]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[28]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[29]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[30]  Jonathan Scott Friedlaender,et al.  A Human Genome Diversity Cell Line Panel , 2002, Science.