Mapping the Origins and Expansion of the Indo-European Language Family

A Family of Languages English is part of the large Indo-European language family, which includes Celtic, Germanic, Italic, Balto-Slavic, and Indo-Iranian languages. The origin of this family is hotly debated: one hypothesis places the origin north of the Caspian Sea in the Pontic steppes, from where it was disseminated by Kurgan semi-nomadic pastoralists; a second suggests that Anatolia, in modern-day Turkey, is the source, and the language radiated with the spread of agriculture. Bouckaert et al. (p. 957) used phylogenetic methods and modeling to assess the geographical spread of the Indo-European language group. The findings support the suggestion that the origin of the language family was indeed Anatolia 7 to 10 thousand years ago—contemporaneous with the spread of agriculture. Spatial models of language lineage evolution support an Anatolian homeland for Indo-European languages. There are two competing hypotheses for the origin of the Indo-European language family. The conventional view places the homeland in the Pontic steppes about 6000 years ago. An alternative hypothesis claims that the languages spread from Anatolia with the expansion of farming 8000 to 9500 years ago. We used Bayesian phylogeographic approaches, together with basic vocabulary data from 103 ancient and contemporary Indo-European languages, to explicitly model the expansion of the family and test these hypotheses. We found decisive support for an Anatolian origin over a steppe origin. Both the inferred timing and root location of the Indo-European language trees fit with an agricultural expansion from Anatolia beginning 8000 to 9500 years ago. These results highlight the critical role that phylogeographic inference can play in resolving debates about human prehistory.

[1]  W. V. Snyder,et al.  Algorithm 531: Contour Plotting [J6] , 1978, TOMS.

[2]  Joseph Felsenstein,et al.  The number of evolutionary trees , 1978 .

[3]  Alice B. Kehoe Archaeology and Language: The Puzzle of Indo-European Origins , 1989, American Antiquity.

[4]  S. E. Hills,et al.  Illustration of Bayesian Inference in Normal Data Models Using Gibbs Sampling , 1990 .

[5]  Gerald M. Moser Changing Africa: The First Literary Generation of Independent Cape Verde , 1992 .

[6]  J. Kruskal,et al.  An Indoeuropean classification : a lexicostatistical experiment , 1992 .

[7]  D. Harris The Origins and Spread of Agriculture and Pastoralism in Eurasia , 1996 .

[8]  M. Steel,et al.  Modeling the covarion hypothesis of nucleotide substitution. , 1998, Mathematical biosciences.

[9]  W. H. Neill,et al.  Modelling animal movement as a persistent random walk in two dimensions: expected magnitude of net displacement , 2000 .

[10]  Andrew Rambaut,et al.  Estimating the rate of molecular evolution: incorporating non-contemporaneous sequences into maximum likelihood phylogenies , 2000, Bioinform..

[11]  Jonathan P. Bollback,et al.  Bayesian Inference of Phylogeny and Its Impact on Evolutionary Biology , 2001, Science.

[12]  M. Suchard,et al.  Bayesian selection of continuous-time Markov chain evolutionary models. , 2001, Molecular biology and evolution.

[13]  Tandy Warnow,et al.  Indo‐European and Computational Cladistics , 2002 .

[14]  C. Holden,et al.  Bantu language trees reflect the spread of farming across sub-Saharan Africa: a maximum-parsimony analysis , 2002, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[15]  Michael Balter Early Date for the Birth of Indo-European Languages , 2003, Science.

[16]  James Steele,et al.  Neolithic transition in Europe: the radiocarbon record revisited , 2003, Antiquity.

[17]  M. Suchard,et al.  Hierarchical phylogenetic models for analyzing multipartite sequence data. , 2003, Systematic biology.

[18]  J. Diamond,et al.  Farmers and Their Languages: The First Expansions , 2003, Science.

[19]  R. Gray,et al.  Language-tree divergence times support the Anatolian theory of Indo-European origin , 2003, Nature.

[20]  M. Pagel,et al.  Bayesian estimation of ancestral character states on phylogenies. , 2004, Systematic biology.

[21]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[22]  O. Pybus,et al.  Bayesian coalescent inference of past population dynamics from molecular sequences. , 2005, Molecular biology and evolution.

[23]  Frederic Bartumeus,et al.  ANIMAL SEARCH STRATEGIES: A QUANTITATIVE RANDOM‐WALK ANALYSIS , 2005 .

[24]  M. Suchard,et al.  Joint Bayesian estimation of alignment and phylogeny. , 2005, Systematic biology.

[25]  G. Nicholls,et al.  FROM WORDS TO DATES: WATER INTO WINE, MATHEMAGIC OR PHYLOGENETIC INFERENCE? , 2005 .

[26]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[27]  P. Forster,et al.  Phylogenetic Methods and the Prehistory of Languages , 2006 .

[28]  James Mallory,et al.  The Oxford Introduction to Proto-Indo-European and the Proto-Indo-European World , 2006 .

[29]  S. Ho,et al.  Relaxed Phylogenetics and Dating with Confidence , 2006, PLoS biology.

[30]  Geoff K. Nicholls,et al.  Dated ancestral trees from binary trait data and their application to the diversification of languages , 2007, 0711.1874.

[31]  M. Pagel,et al.  Frequency of word-use predicts rates of lexical evolution throughout Indo-European history , 2007, Nature.

[32]  Christopher J. Lee,et al.  Wagner and Dollo: a stochastic duet by composing two parsimonious solos. , 2008, Systematic biology.

[33]  David W. Anthony,et al.  The Horse, the Wheel, and Language: How Bronze-Age Riders from the Eurasian Steppes Shaped the Modern World , 2008 .

[34]  Simon J. Greenhill,et al.  Languages Evolve in Punctuational Bursts , 2008, Science.

[35]  Christopher Ehret,et al.  Bayesian phylogenetic analysis of Semitic languages identifies an Early Bronze Age origin of Semitic in the Near East , 2009, Proceedings of the Royal Society B: Biological Sciences.

[36]  L. Chikhi Update to Chikhi et al.'s “Clinal Variation in the Nuclear DNA of Europeans” (1998): Genetic Data and Storytelling—From Archaeogenetics to Astrologenetics? , 2009, Human biology.

[37]  Simon J. Greenhill,et al.  Does horizontal transmission invalidate cultural phylogenies? , 2009, Proceedings of the Royal Society B: Biological Sciences.

[38]  Simon J. Greenhill,et al.  Language Phylogenies Reveal Expansion Pulses and Pauses in Pacific Settlement , 2009, Science.

[39]  Alexei J. Drummond,et al.  Bayesian Phylogeography Finds Its Roots , 2009, PLoS Comput. Biol..

[40]  Simon J. Greenhill,et al.  The shape and tempo of language evolution , 2010, Proceedings of the Royal Society B: Biological Sciences.

[41]  William Martin,et al.  Networks uncover hidden lexical borrowing in Indo-European language evolution , 2010, Proceedings of the Royal Society B: Biological Sciences.

[42]  Simon J. Greenhill,et al.  Rise and fall of political complexity in island South-East Asia and the Pacific , 2010, Nature.

[43]  R. Villems,et al.  Ancient DNA from European Early Neolithic Farmers Reveals Their Near Eastern Affinities , 2010, PLoS biology.

[44]  M. Suchard,et al.  Phylogeography takes a relaxed random walk in continuous space and time. , 2010, Molecular biology and evolution.

[45]  N. Brucato,et al.  Ancient DNA reveals male diffusion through the Neolithic Mediterranean route , 2011, Proceedings of the National Academy of Sciences.

[46]  Rebecca R. Gray,et al.  Testing spatiotemporal hypothesis of bacterial evolution using methicillin-resistant Staphylococcus aureus ST239 genome-wide data within a bayesian framework. , 2011, Molecular biology and evolution.

[47]  R. Pinhasi,et al.  Craniometric data support a mosaic model of demic and cultural Neolithic diffusion to outlying regions of Europe , 2011, Proceedings of the Royal Society B: Biological Sciences.

[48]  Marc A Suchard,et al.  A Bayesian phylogenetic method to estimate unknown sequence ages. , 2011, Molecular biology and evolution.

[49]  M. Suchard,et al.  Bayesian Phylogenetics with BEAUti and the BEAST 1.7 , 2012, Molecular biology and evolution.

[50]  Uri Tadmor,et al.  Borrowability and the notion of basic vocabulary , 2012 .

[51]  Michael D. Hendy,et al.  Mathematical Elegance with Biochemical Realism: The Covarion Model of Molecular Evolution , 2001, Journal of Molecular Evolution.