Between Lake Baikal and the Baltic Sea: genomic history of the gateway to Europe

BackgroundThe history of human populations occupying the plains and mountain ridges separating Europe from Asia has been eventful, as these natural obstacles were crossed westward by multiple waves of Turkic and Uralic-speaking migrants as well as eastward by Europeans. Unfortunately, the material records of history of this region are not dense enough to reconstruct details of population history. These considerations stimulate growing interest to obtain a genetic picture of the demographic history of migrations and admixture in Northern Eurasia.ResultsWe genotyped and analyzed 1076 individuals from 30 populations with geographical coverage spanning from Baltic Sea to Baikal Lake. Our dense sampling allowed us to describe in detail the population structure, provide insight into genomic history of numerous European and Asian populations, and significantly increase quantity of genetic data available for modern populations in region of North Eurasia. Our study doubles the amount of genome-wide profiles available for this region.We detected unusually high amount of shared identical-by-descent (IBD) genomic segments between several Siberian populations, such as Khanty and Ket, providing evidence of genetic relatedness across vast geographic distances and between speakers of different language families. Additionally, we observed excessive IBD sharing between Khanty and Bashkir, a group of Turkic speakers from Southern Urals region. While adding some weight to the “Finno-Ugric” origin of Bashkir, our studies highlighted that the Bashkir genepool lacks the main “core”, being a multi-layered amalgamation of Turkic, Ugric, Finnish and Indo-European contributions, which points at intricacy of genetic interface between Turkic and Uralic populations. Comparison of the genetic structure of Siberian ethnicities and the geography of the region they inhabit point at existence of the “Great Siberian Vortex” directing genetic exchanges in populations across the Siberian part of Asia.Slavic speakers of Eastern Europe are, in general, very similar in their genetic composition. Ukrainians, Belarusians and Russians have almost identical proportions of Caucasus and Northern European components and have virtually no Asian influence. We capitalized on wide geographic span of our sampling to address intriguing question about the place of origin of Russian Starovers, an enigmatic Eastern Orthodox Old Believers religious group relocated to Siberia in seventeenth century. A comparative reAdmix analysis, complemented by IBD sharing, placed their roots in the region of the Northern European Plain, occupied by North Russians and Finno-Ugric Komi and Karelian people. Russians from Novosibirsk and Russian Starover exhibit ancestral proportions close to that of European Eastern Slavs, however, they also include between five to 10 % of Central Siberian ancestry, not present at this level in their European counterparts.ConclusionsOur project has patched the hole in the genetic map of Eurasia: we demonstrated complexity of genetic structure of Northern Eurasians, existence of East-West and North-South genetic gradients, and assessed different inputs of ancient populations into modern populations.

[1]  Mikhail S. Gelfand,et al.  Genomic study of the Ket: a Paleo-Eskimo-related ethnic group with significant ancient North Eurasian ancestry , 2015, Scientific Reports.

[2]  Luca Pagani,et al.  The GenoChip: A New Tool for Genetic Anthropology , 2012, Genome biology and evolution.

[3]  R. Mägi,et al.  Genetic Structure of Europeans: A View from the North–East , 2009, PloS one.

[4]  Bonnie Berger,et al.  Ancient human genomes suggest three ancestral populations for present-day Europeans , 2013, Nature.

[5]  Swapan Mallick,et al.  Ancient Admixture in Human History , 2012, Genetics.

[6]  K. Veeramah,et al.  Extensive genome-wide autozygosity in the population isolates of Daghestan , 2015, European Journal of Human Genetics.

[7]  Li Jin,et al.  Genome-wide signatures of male-mediated migration shaping the Indian gene pool , 2015, Journal of Human Genetics.

[8]  Mattias Jakobsson,et al.  Genomic evidence for the Pleistocene and recent population history of Native Americans , 2015, Science.

[9]  Kenneth Lange,et al.  Enhancements to the ADMIXTURE algorithm for individual ancestry estimation , 2011, BMC Bioinformatics.

[10]  Ariella L. Gladstein,et al.  No Evidence from Genome-Wide Data of a Khazar Origin for the Ashkenazi Jews , 2013, Human biology.

[11]  B. Browning,et al.  A fast, powerful method for detecting identity by descent. , 2011, American journal of human genetics.

[12]  A. Róna-Tas Hungarians and Europe in the Early Middle Ages: An Introduction to Early Hungarian History , 1999 .

[13]  L. Cavalli-Sforza The Human Genome Diversity Project: past, present and future , 2005, Nature Reviews Genetics.

[14]  R. Mägi,et al.  A Selective Sweep on a Deleterious Mutation in CPT1A in Arctic Populations. , 2014, American journal of human genetics.

[15]  Eran Halperin,et al.  A model based approach for analysis of spatial structure in genetic data , 2013 .

[16]  Josyf Mychaleckyj,et al.  Robust relationship inference in genome-wide association studies , 2010, Bioinform..

[17]  Amit R. Indap,et al.  Genes mirror geography within Europe , 2008, Nature.

[18]  V. S. Arun,et al.  Population Differentiation of Southern Indian Male Lineages Correlates with Agricultural Expansions Predating the Caste System , 2012, PloS one.

[19]  G. McVean A Genealogical Interpretation of Principal Components Analysis , 2009, PLoS genetics.

[20]  Luigi Luca Cavalli-sfroza The History and Geography of Human Genes , 1994 .

[21]  V. Stepanov,et al.  [Genetic diversity of Khakassian gene pool: subethnic differensiation and the structure of Y-chromosome haplogroups,]. , 2011, Molekuliarnaia biologiia.

[22]  V. Stepanov,et al.  Gene-pool structure of Tuvinians inferred from Y-chromosome marker data , 2013, Russian Journal of Genetics.

[23]  Gudmundur A. Thorisson,et al.  The International HapMap Project Web site. , 2005, Genome research.

[24]  August E. Woerner,et al.  Higher Levels of Neanderthal Ancestry in East Asians than in Europeans , 2013, Genetics.

[25]  R. Mägi,et al.  Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans , 2013, Nature.

[26]  Pablo Villoslada,et al.  Analysis and Application of European Genetic Substructure Using 300 K SNP Information , 2008, PLoS genetics.

[27]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[28]  Mark D Shriver,et al.  Measuring European population stratification with microarray genotype data. , 2007, American journal of human genetics.

[29]  Alkes L. Price,et al.  Reconstructing Indian Population History , 2009, Nature.

[30]  Sohini Ramachandran,et al.  Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[31]  E. Balanovska,et al.  Ancient DNA Reveals Prehistoric Gene-Flow from Siberia in the Complex Human Population History of North East Europe , 2013, PLoS genetics.

[32]  Yun S. Song,et al.  The Simons Genome Diversity Project: 300 genomes from 142 diverse populations , 2016, Nature.

[33]  Peter A Underhill,et al.  The Caucasus as an asymmetric semipermeable barrier to ancient human migrations. , 2012, Molecular biology and evolution.

[34]  S. Heath,et al.  Investigation of the fine structure of European populations with applications to disease association studies , 2008, European Journal of Human Genetics.

[35]  Mait Metspalu,et al.  The Genetic Legacy of the Expansion of Turkic-Speaking Nomads across Eurasia , 2014, bioRxiv.

[36]  Eleazar Eskin,et al.  Spatial localization of recent ancestors for admixed individuals , 2014 .

[37]  Swapan Mallick,et al.  Massive migration from the steppe was a source for Indo-European languages in Europe , 2015, Nature.

[38]  Mehedi Hassan,et al.  Differential Evolution Approach to Detect Recent Admixture , 2015 .

[39]  Ajay K. Royyuru,et al.  Geographic population structure analysis of worldwide human populations infers their biogeographical origins , 2014, Nature Communications.

[40]  V. Stepanov,et al.  The origin of Yakuts: Analysis of the Y-chromosome haplotypes , 2008, Molecular Biology.

[41]  Reconstructing Genetic History of Siberian and Northeastern European Populations , 2015 .

[42]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[43]  Joseph K. Pickrell,et al.  Inference of population splits and mixtures from genome-wide allele frequency data , 2012 .

[44]  D. Reich,et al.  Genome-wide patterns of selection in 230 ancient Eurasians , 2015, Nature.

[45]  V. Stepanov,et al.  Genetic diversity of the Khakass gene pool: Subethnic differentiation and the structure of Y-chromosome haplogroups , 2011, Molecular Biology.

[46]  V. Stepanov,et al.  Gene pool of Buryats: Clinal variability and territorial subdivision based on data of Y-chromosome markers , 2014, Russian Journal of Genetics.

[47]  C. Tyler-Smith,et al.  Parallel evolution of genes and languages in the Caucasus region. , 2011, Molecular biology and evolution.

[48]  B. Malyarchuk,et al.  Polymorphism of the Y-Chromosome Diallelic Loci in Ethnic Groups of the Altai–Sayan Region , 2002, Russian Journal of Genetics.

[49]  M. Hammer,et al.  High Levels of Y-Chromosome Differentiation among Native Siberian Populations and the Genetic Signature of a Boreal Hunter-Gatherer Way of Life , 2002, Human biology.

[50]  M. Jakobsson,et al.  Origins and Genetic Legacy of Neolithic Farmers and Hunter-Gatherers in Europe , 2012, Science.

[51]  Joseph K. Pickrell,et al.  Signals of recent positive selection in a worldwide sample of human populations. , 2009, Genome research.

[52]  E. Eller Population substructure and isolation by distance in three continental regions. , 1999, American journal of physical anthropology.

[53]  C. Tyler-Smith,et al.  Genetic evidence for an origin of the Armenians from Bronze Age mixing of multiple populations , 2015, European Journal of Human Genetics.

[54]  Early farmers from across Europe directly descended from Neolithic Aegeans , 2015 .

[55]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[56]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[57]  D. F. Roberts,et al.  The History and Geography of Human Genes , 1996 .

[58]  C. Tyler-Smith,et al.  Genetic Heritage of the Balto-Slavic Speaking Populations: A Synthesis of Autosomal, Mitochondrial and Y-Chromosomal Data , 2015, PloS one.

[59]  E. V. Balanovskaya GENETIC HERITAGE OF THE BALTO-SLAVIC SPEAKING POPULATIONS: A SYNTHESIS OF AUTOSOMAL, MITOCHONDRIAL AND Y-CHROMOSOMAL DATA , 2015 .

[60]  A. Clark,et al.  Indigenous Arabs are descendants of the earliest split from ancient Eurasian populations , 2016, Genome research.

[61]  Alexander S. Mikheyev,et al.  Toward high-resolution population genomics using archaeological samples , 2016, DNA research : an international journal for rapid publication of reports on genes and genomes.

[62]  A. Torroni,et al.  Reconciling evidence from ancient and contemporary genomes: a major source for the European Neolithic within Mediterranean Europe , 2017, Proceedings of the Royal Society B: Biological Sciences.

[63]  Shuichi Matsumura,et al.  Genetic Discontinuity Between Local Hunter-Gatherers and Central Europe’s First Farmers , 2009, Science.

[64]  M. Hammer,et al.  Coevolution of genes and languages and high levels of population structure among the highland populations of Daghestan , 2015, Journal of Human Genetics.

[65]  John Novembre,et al.  Perspectives on human population structure at the cusp of the sequencing era. , 2011, Annual review of genomics and human genetics.

[66]  B. Peter,et al.  Admixture, Population Structure, and F-Statistics , 2015, Genetics.

[67]  John Novembre,et al.  Inferring genetic ancestry: opportunities, challenges, and implications. , 2010, American journal of human genetics.

[68]  Mapping Human Genetic Diversity in Asia , 2013 .

[69]  J. Relethford Global Analysis of Regional Differences in Craniometric Diversity and Population Substructure , 2001, Human biology.

[70]  David Reich,et al.  Discerning the Ancestry of European Americans in Genetic Association Studies , 2007, PLoS genetics.

[71]  John Novembre,et al.  Visualizing spatial population structure with estimated effective migration surfaces , 2014 .

[72]  Michael C. Westaway,et al.  Genomic analyses inform on migration events during the peopling of Eurasia , 2016, Nature.

[73]  Saharon Rosset,et al.  The genome-wide structure of the Jewish people , 2010, Nature.

[74]  G. Coop,et al.  Inferring Recent Demography from Isolation by Distance of Long Shared Sequence Blocks , 2016, Genetics.

[75]  Mait Metspalu,et al.  Autosomal and uniparental portraits of the native populations of Sakha (Yakutia): implications for the peopling of Northeast Eurasia , 2013, BMC Evolutionary Biology.

[76]  D. Graur,et al.  The ‘extremely ancient’ chromosome that isn’t: a forensic bioinformatic investigation of Albert Perry’s X-degenerate portion of the Y chromosome , 2014, European Journal of Human Genetics.

[77]  Pablo Villoslada,et al.  European Population Substructure: Clustering of Northern and Southern Populations , 2006, PLoS genetics.

[78]  D. Zaridze,et al.  Combining Two Technologies for Full Genome Sequencing of Human , 2009, Acta naturae.

[79]  Nicolas Ray,et al.  Principal component analysis under population genetic models of range expansion and admixture. , 2010, Molecular biology and evolution.

[80]  Omar E. Cornejo,et al.  The genetic prehistory of the New World Arctic , 2014, Science.