Massive migration from the steppe was a source for Indo-European languages in Europe

We generated genome-wide data from 69 Europeans who lived between 8,000–3,000 years ago by enriching ancient DNA libraries for a target set of almost 400,000 polymorphisms. Enrichment of these positions decreases the sequencing required for genome-wide ancient DNA analysis by a median of around 250-fold, allowing us to study an order of magnitude more individuals than previous studies and to obtain new insights about the past. We show that the populations of Western and Far Eastern Europe followed opposite trajectories between 8,000–5,000 years ago. At the beginning of the Neolithic period in Europe, ∼8,000–7,000 years ago, closely related groups of early farmers appeared in Germany, Hungary and Spain, different from indigenous hunter-gatherers, whereas Russia was inhabited by a distinctive population of hunter-gatherers with high affinity to a ∼24,000-year-old Siberian. By ∼6,000–5,000 years ago, farmers throughout much of Europe had more hunter-gatherer ancestry than their predecessors, but in Russia, the Yamnaya steppe herders of this time were descended not only from the preceding eastern European hunter-gatherers, but also from a population of Near Eastern ancestry. Western and Eastern Europe came into contact ∼4,500 years ago, as the Late Neolithic Corded Ware people from Germany traced ∼75% of their ancestry to the Yamnaya, documenting a massive migration into the heartland of Europe from its eastern periphery. This steppe ancestry persisted in all sampled central Europeans until at least ∼3,000 years ago, and is ubiquitous in present-day Europeans. These results provide support for a steppe origin of at least some of the Indo-European languages of Europe.

[1]  V. Cabrera,et al.  Ancient DNA from Hunter-Gatherer and Farmer Groups from Northern Spain Supports a Random Dispersion Model for the Neolithic Expansion into Europe , 2012, PloS one.

[2]  Erik L. L. Sonnhammer,et al.  Kalign – an accurate and fast multiple sequence alignment algorithm , 2005, BMC Bioinformatics.

[3]  Alkes L. Price,et al.  Reconstructing Indian Population History , 2009, Nature.

[4]  Charles R. Clement,et al.  First Farmers: The Origins of Agricultural Societies , 2006 .

[5]  Martín Almagro Gorbea Iberia. Protohistory of the Far West of Europe: from Neolithic to Roman conquest , 2014 .

[6]  P. Bogucki,et al.  Ancient Europe 8000 B.C.-A.D. 1000 : encyclopedia of the barbarian world , 2004 .

[7]  Manfred Kayser,et al.  Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation , 2009, Human mutation.

[8]  David W. Anthony,et al.  The Horse, the Wheel, and Language: How Bronze-Age Riders from the Eurasian Steppes Shaped the Modern World , 2008 .

[9]  L L Cavalli-Sforza,et al.  The genetic legacy of Paleolithic Homo sapiens sapiens in extant Europeans: a Y chromosome perspective. , 2000, Nature Reviews Genetics.

[10]  S. Tofanelli,et al.  Low-Pass DNA Sequencing of 1200 Sardinians Reconstructs European Y-Chromosome Phylogeny , 2013, Science.

[11]  Matthias Meyer,et al.  Illumina sequencing library preparation for highly multiplexed target capture and sequencing. , 2010, Cold Spring Harbor protocols.

[12]  R. J. Herrera,et al.  A major Y-chromosome haplogroup R1b Holocene era founder effect in Central and Western Europe , 2011, European Journal of Human Genetics.

[13]  R. Villems,et al.  Ancient DNA from European Early Neolithic Farmers Reveals Their Near Eastern Affinities , 2010, PLoS biology.

[14]  Eric Crubézy,et al.  Ancient DNA provides new insights into the history of south Siberian Kurgan people , 2009, Human Genetics.

[15]  B. Cunliffe Facing the Ocean: The Atlantic and Its Peoples 8000 BC-AD 1500 , 2001 .

[16]  K. Linduff Metallurgy in ancient eastern Eurasia from the Urals to the Yellow River , 2004 .

[17]  Adrian W. Briggs,et al.  Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA , 2009, Nucleic acids research.

[18]  Swapan Mallick,et al.  Partial uracil–DNA–glycosylase treatment for screening of ancient DNA , 2015, Philosophical Transactions of the Royal Society B: Biological Sciences.

[19]  F. Menotti Living on the Lake in Prehistoric Europe: 150 Years of Lake-Dwelling Research , 2004 .

[20]  Mattias Jakobsson,et al.  Genomic Diversity and Admixture Differs for Stone-Age Scandinavian Foragers and Farmers , 2014, Science.

[21]  Andreas Keller,et al.  Population Genomic Analysis of Ancient and Modern Genomes Yields New Insights into the Genetic Ancestry of the Tyrolean Iceman and the Genetic Structure of Europe , 2014, PLoS genetics.

[22]  David Reich,et al.  Principal component analysis of genetic data , 2008, Nature Genetics.

[23]  D. Anthony,et al.  The "Kurgan Culture," Indo-European Origins, and the Domestication of the Horse: A Reconsideration [and Comments and Replies] , 1986, Current Anthropology.

[24]  E. Vlachos,et al.  Bulletin , 2003 .

[25]  K. Alt,et al.  Human paleogenetics of Europe--the known knowns and the known unknowns. , 2015, Journal of human evolution.

[26]  Bonnie Berger,et al.  Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow , 2012, Molecular biology and evolution.

[27]  H. Künsch The Jackknife and the Bootstrap for General Stationary Observations , 1989 .

[28]  Shuichi Matsumura,et al.  Genetic Discontinuity Between Local Hunter-Gatherers and Central Europe’s First Farmers , 2009, Science.

[29]  C. Stringer,et al.  The earliest evidence for anatomically modern humans in northwestern Europe , 2011, Nature.

[30]  Philip L. F. Johnson,et al.  A Revised Timescale for Human Evolution Based on Ancient Mitochondrial Genomes , 2013, Current Biology.

[31]  Stephen Shennan,et al.  Prehistoric population history: from the Late Glacial to the Late Neolithic in Central and Northern Europe , 2007 .

[32]  L. Poliakov Le mythe aryen : essai sur les sources du racisme et des nationalismes , 1971 .

[33]  Swapan Mallick,et al.  Ancient Admixture in Human History , 2012, Genetics.

[34]  Adam Powell,et al.  2000 Years of Parallel Societies in Stone Age Central Europe , 2013, Science.

[35]  J. Bocquet-Appel,et al.  Detection of diffusion and contact zones of early farming in Europe from the space-time distribution of 14C dates , 2009 .

[36]  R. J. Herrera,et al.  The phylogenetic and geographic structure of Y-chromosome haplogroup R1a , 2014, European Journal of Human Genetics.

[37]  Peter L. Ralph,et al.  The Geography of Recent Genetic Ancestry across Europe , 2012, PLoS biology.

[38]  Svante Pääbo,et al.  Temporal Patterns of Nucleotide Misincorporations and DNA Fragmentation in Ancient DNA , 2012, PloS one.

[39]  D. Reich,et al.  Genetic structure of a unique admixed population: implications for medical research. , 2010, Human molecular genetics.

[40]  Michael Armand P. Canilao First Farmers: The Origins of Agricultural Societies , 2016 .

[41]  Joseph K. Pickrell,et al.  Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data , 2012, PLoS genetics.

[42]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[43]  Anne Cambon-Thomsen,et al.  Phylogeography of Y-chromosome haplogroup I reveals distinct domains of prehistoric gene flow in europe. , 2004, American journal of human genetics.

[44]  J. Mallory In Search of the Indo-Europeans / Language, Archaeology and Myth , 1992 .

[45]  N. Brucato,et al.  Ancient DNA reveals male diffusion through the Neolithic Mediterranean route , 2011, Proceedings of the National Academy of Sciences.

[46]  Bonnie Berger,et al.  Ancient human genomes suggest three ancestral populations for present-day Europeans , 2013, Nature.

[47]  D. Reich,et al.  Cost-effective, high-throughput DNA sequencing libraries for multiplexed target capture , 2012, Genome research.

[48]  Book Reviews,et al.  The Bronze Age and Early Iron Age Peoples of Eastern Central Asia , 1998 .

[49]  O. Balanovsky,et al.  Mitochondrial Genome Sequencing in Mesolithic North East Europe Unearths a New Sub-Clade within the Broadly Distributed Human Haplogroup C1 , 2014, PloS one.

[50]  Cristina E. Valdiosera,et al.  Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments , 2013, Proceedings of the National Academy of Sciences.

[51]  B. Kromer,et al.  Tracing the genetic origin of Europe's first farmers reveals insights into their social organization , 2015, Proceedings of the Royal Society B: Biological Sciences.

[52]  Federico Sánchez-Quinto,et al.  Genomic Affinities of Two 7,000-Year-Old Iberian Hunter-Gatherers , 2012, Current Biology.

[53]  C. Beckwith Empires of the Silk Road: A History of Central Eurasia from the Bronze Age to the Present , 2009 .

[54]  James Mallory,et al.  The Oxford Introduction to Proto-Indo-European and the Proto-Indo-European World , 2006 .

[55]  B. Ludes,et al.  Strong genetic admixture in the Altai at the Middle Bronze Age revealed by uniparental and ancestry informative markers. , 2014, Forensic science international. Genetics.

[56]  Thomas V. Gamkrelidze,et al.  The Early History of Indo-European Languages , 1990 .

[57]  János Dani,et al.  Genome flux and stasis in a five millennium transect of European prehistory , 2014, Nature Communications.

[58]  G. Chaubey,et al.  Reconstructing the Origin of Andaman Islanders , 2005, Science.

[59]  A. Pike,et al.  Ancient DNA, Strontium isotopes, and osteological analyses shed light on social and kinship organization of the Later Stone Age , 2008, Proceedings of the National Academy of Sciences.

[60]  H. Bandelt,et al.  The Archaeogenetics of Europe , 2010, Current Biology.

[61]  Arcadi Navarro,et al.  Derived immune and ancestral pigmentation alleles in a 7,000-year-old Mesolithic European , 2014, Nature.

[62]  Natalie M. Myres,et al.  New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing , 2012, Nature Communications.

[63]  Erik Meijer,et al.  Delete-m Jackknife for Unequal m , 1999, Stat. Comput..

[64]  Shuichi Matsumura,et al.  Ancient DNA from the First European Farmers in 7500-Year-Old Neolithic Sites , 1975, Science.

[65]  Philip L. F. Johnson,et al.  The complete genome sequence of a Neanderthal from the Altai Mountains , 2013 .

[66]  B. Bramanti ANCIENT DNA: GENETIC ANALYSIS OF aDNA FROM SIXTEEN SKELETONS OF THE VEDROVICE COLLECTION , 2008 .

[67]  Philip L. F. Johnson,et al.  A Draft Sequence of the Neandertal Genome , 2010, Science.

[68]  Volker Heyd,et al.  The Transformation of Europe in the Third Millennium BC: the example of ‘Le Petit-Chasseur I + III’ (Sion, Valais, Switzerland) , 2007 .

[69]  Michael C. Westaway,et al.  Genomic structure in Europeans dating back at least 36,200 years , 2014, Science.

[70]  Martin Kircher,et al.  Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform , 2011, Nucleic acids research.

[71]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[72]  H. Ostrer,et al.  Increased Resolution of Y Chromosome Haplogroup T Defines Relationships among Populations of the Near East, Europe, and Africa , 2011, Human biology.

[73]  Saharon Rosset,et al.  A "Copernican" reassessment of the human mitochondrial DNA tree from its root. , 2012, American journal of human genetics.

[74]  Mark George Thomas,et al.  Direct evidence for positive selection of skin, hair, and eye pigmentation in Europeans during the last 5,000 y , 2014, Proceedings of the National Academy of Sciences.

[75]  B. Kromer,et al.  Tracing the genetic origin of Europe’s first farmers reveals insights into their social organization , 2014, bioRxiv.

[76]  Qiaomei Fu,et al.  DNA analysis of an early modern human from Tianyuan Cave, China , 2013, Proceedings of the National Academy of Sciences.

[77]  N. Brucato,et al.  Ancient DNA suggests the leading role played by men in the Neolithic dissemination , 2011, Proceedings of the National Academy of Sciences.

[78]  Hong Zhu,et al.  Evidence that a West-East admixed population lived in the Tarim Basin as early as the early Bronze Age , 2010, BMC Biology.

[79]  I. M. D'iakonov On the Original Home of the Speakers of Indo-European , 1984 .

[80]  Colin Renfrew,et al.  Archaeology: Theories, Methods, and Practice , 2012 .

[81]  J. Mallory,et al.  The Tarim Mummies: Ancient China and the Mystery of the Earliest Peoples from the West , 2000 .

[82]  M. Jakobsson,et al.  Origins and Genetic Legacy of Neolithic Farmers and Hunter-Gatherers in Europe , 2012, Science.

[83]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[84]  Philip L. F. Johnson,et al.  Patterns of damage in genomic DNA sequences from a Neandertal , 2007, Proceedings of the National Academy of Sciences.

[85]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[86]  Nirav C. Merchant,et al.  Neolithic mitochondrial haplogroup H genomes and the genetic origins of Europeans , 2013, Nature Communications.

[87]  R. J. Herrera,et al.  Neolithic patrilineal signals indicate that the Armenian plateau was repopulated by agriculturalists , 2011, European Journal of Human Genetics.

[88]  C. Meyer,et al.  Das Massengrab von Wiederstedt, Ldkr. Mansfelder Land: Auswertung und Gedanken zur Deutung im Kontext der Linienbandkeramik , 2004 .

[89]  Adrian W. Briggs,et al.  Preparation of next-generation sequencing libraries from damaged DNA. , 2012, Methods in molecular biology.

[90]  D. Anthony,et al.  Archaeology and Migration: Approaches to an Archaeological Proof of Migration , 2000 .

[91]  V. Childe,et al.  The Dawn of European CivilisationThe Aryans. A Study of Indo-European Origins , 1927 .

[92]  Kenneth Lange,et al.  Enhancements to the ADMIXTURE algorithm for individual ancestry estimation , 2011, BMC Bioinformatics.

[93]  R. Gray,et al.  Language-tree divergence times support the Anatolian theory of Indo-European origin , 2003, Nature.

[94]  Ben Krause-Kyora,et al.  Ancient DNA insights from the Middle Neolithic in Germany , 2013, Archaeological and Anthropological Sciences.

[95]  David H. Alexander,et al.  Fast model-based estimation of ancestry in unrelated individuals. , 2009, Genome research.

[96]  David B. Witonsky,et al.  Reconstructing Native American Population History , 2012, Nature.

[97]  Qiaomei Fu,et al.  A mitochondrial genome sequence of a hominin from Sima de los Huesos , 2013, Nature.

[98]  Mark George Thomas,et al.  Ancient DNA Reveals Lack of Continuity between Neolithic Hunter-Gatherers and Contemporary Scandinavians , 2009, Current Biology.

[99]  C. Lalueza-Fox,et al.  Unravelling migrations in the steppe: mitochondrial DNA sequences from ancient Central Asians , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[100]  Niccolò Mazzucco,et al.  Pastores trashumantes del Neolítico antiguo en un entorno de alta montaña: secuencia crono-cultural de la Cova de Els Trocs (San Feliú de Veri, Huesca) , 2013 .

[101]  D. Telegin,et al.  RELATIVE AND ABSOLUTE CHRONOLOGY OF YAMNAYA AND CATACOMB MONUMENTS THE ISSUE OF CO-EXISTENCE , 2003 .

[102]  A. Whittle,et al.  Going over : the Mesolithic-Neolithic transition in north-west Europe , 2007 .

[103]  N. von Wurmb-Schwark,et al.  Emerging genetic patterns of the European Neolithic: perspectives from a late Neolithic Bell Beaker burial site in Germany. , 2012, American journal of physical anthropology.

[104]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[105]  E. Balanovska,et al.  Ancient DNA Reveals Prehistoric Gene-Flow from Siberia in the Complex Human Population History of North East Europe , 2013, PLoS genetics.

[106]  Simon J. Greenhill,et al.  Mapping the Origins and Expansion of the Indo-European Language Family , 2012, Science.

[107]  D. Anthony Migration in Archeology: The Baby and the Bathwater , 1990 .

[108]  Jay H. Jasanoff Language Contact, Creolization, and Genetic Linguistics , 1988 .

[109]  Peter A Underhill,et al.  New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree. , 2008, Genome research.

[110]  B. Berger,et al.  Ancient human genomes suggest three ancestral populations for present-day Europeans , 2013, Nature.

[111]  C. Lalueza-Fox,et al.  Palaeogenetic evidence supports a dual model of Neolithic spreading into Europe , 2007, Proceedings of the Royal Society B: Biological Sciences.

[112]  Joseph K. Pickrell,et al.  Ancient DNA Reveals Key Stages in the Formation of Central European Mitochondrial Genetic Diversity , 2013, Science.

[113]  Chiara Batini,et al.  The Y-Chromosome Tree Bursts into Leaf: 13,000 High-Confidence SNPs Covering the Majority of Known Clades , 2014, Molecular biology and evolution.

[114]  D. Falush,et al.  A Genetic Atlas of Human Admixture History , 2014, Science.

[115]  Martin Kircher,et al.  A Complete mtDNA Genome of an Early Modern Human from Kostenki, Russia , 2010, Current Biology.

[116]  L. Chikhi,et al.  Ancient DNA from an Early Neolithic Iberian population supports a pioneer colonization by first farmers , 2012, Molecular ecology.

[117]  Adrian E. Raftery,et al.  mclust Version 4 for R : Normal Mixture Modeling for Model-Based Clustering , Classification , and Density Estimation , 2012 .

[118]  H. Bandelt,et al.  Human Mitochondrial DNA and the Evolution of Homo sapiens , 2006 .

[119]  C. Renfrew,et al.  Archaeology and Language: The Puzzle of Indo-European Origins , 1988, American Antiquity.

[120]  R. Mägi,et al.  Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans , 2013, Nature.

[121]  T. Price,et al.  Europe's First Farmers: Europe's first farmers: an introduction , 2000 .

[122]  David Glenn Smith,et al.  Examining the Farming/Language Dispersal Hypothesis. , 2005 .

[123]  Heng Li,et al.  Genome sequence of a 45,000-year-old modern human from western Siberia , 2014, Nature.

[124]  P. Rudan,et al.  The western and eastern roots of the Saami--the story of genetic "outliers" told by mitochondrial DNA and Y chromosomes. , 2004, American journal of human genetics.