Principal component analysis under population genetic models of range expansion and admixture.

In a series of highly influential publications, Cavalli-Sforza and colleagues used principal component (PC) analysis to produce maps depicting how human genetic diversity varies across geographic space. Within Europe, the first axis of variation (PC1) was interpreted as evidence for the demic diffusion model of agriculture, in which farmers expanded from the Near East approximately 10,000 years ago and replaced the resident hunter-gatherer populations with little or no interbreeding. These interpretations of the PC maps have been recently questioned as the original results can be reproduced under models of spatially covarying allele frequencies without any expansion. Here, we study PC maps for data simulated under models of range expansion and admixture. Our simulations include a spatially realistic model of Neolithic farmer expansion and assume various levels of interbreeding between farmer and resident hunter-gatherer populations. An important result is that under a broad range of conditions, the gradients in PC1 maps are oriented along a direction perpendicular to the axis of the expansion, rather than along the same axis as the expansion. We propose that this surprising pattern is an outcome of the "allele surfing" phenomenon, which creates sectors of high allele-frequency differentiation that align perpendicular to the direction of the expansion.

[1]  Robert R. Sokal,et al.  Spatial Autocorrelations of HLA Frequencies in Europe Support Demic Diffusion of Early Farmers , 1982, The American Naturalist.

[2]  L. Cavalli-Sforza,et al.  Demic expansions and human evolution , 1993, Science.

[3]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[4]  D. F. Roberts,et al.  The History and Geography of Human Genes , 1996 .

[5]  M. Stephens,et al.  Interpreting principal component analyses of spatial population genetic variation , 2008, Nature Genetics.

[6]  Amit R. Indap,et al.  Genes mirror geography within Europe , 2008, Nature.

[7]  Shuichi Matsumura,et al.  Genetic Discontinuity Between Local Hunter-Gatherers and Central Europe’s First Farmers , 2009, Science.

[8]  Alberto Piazza,et al.  Simulation and Separation by Principal Components of Multiple Demic Expansions in Europe , 1986, The American Naturalist.

[9]  Lounès Chikhi,et al.  Estimating the impact of prehistoric admixture on the genome of Europeans. , 2004, Molecular biology and evolution.

[10]  Paul Mellars,et al.  Neanderthals and the modern human colonization of Europe , 2004, Nature.

[11]  Newton E. Morton,et al.  The Neolithic Transition and the Genetics of Populations in Europe. , 2022 .

[12]  Olivier François,et al.  Bayesian clustering algorithms ascertaining spatial population structure: a new computer program and a comparison study , 2007 .

[13]  Oskar Hallatschek,et al.  Gene surfing in expanding populations. , 2007, Theoretical population biology.

[14]  C. Hoggart,et al.  Genome-wide association analysis of metabolic traits in a birth cohort from a founder population , 2008, Nature Genetics.

[15]  Flora Jay,et al.  Spatial inference of admixture proportions and secondary contact zones. , 2009, Molecular biology and evolution.

[16]  Ole A. Andreassen,et al.  The Impact of Divergence Time on the Nature of Population Structure: An Example from Iceland , 2009, PLoS genetics.

[17]  P. Rowley-Conwy Human Prehistory: Hunting for the Earliest Farmers , 2009, Current Biology.

[18]  R R Sokal,et al.  Indo-European origins: a computer-simulation test of five hypotheses. , 1995, American journal of physical anthropology.

[19]  G. Barbujani,et al.  Genetic evidence on origin and dispersal of human populations speaking languages of the Nostratic macrofamily. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Justin M. J. Travis,et al.  Landscape structure and boundary effects determine the fate of mutations occurring during range expansions , 2008, Heredity.

[21]  T. Sluckin,et al.  Modelling Paleoindian dispersals , 1998 .

[22]  G. McVean A Genealogical Interpretation of Principal Components Analysis , 2009, PLoS genetics.

[23]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[24]  J. Diamond,et al.  Farmers and Their Languages: The First Expansions , 2003, Science.

[25]  Mark D Shriver,et al.  Measuring European population stratification with microarray genotype data. , 2007, American journal of human genetics.

[26]  B. Rannala,et al.  The Bayesian revolution in genetics , 2004, Nature Reviews Genetics.

[27]  Lounès Chikhi,et al.  Y genetic data support the Neolithic demic diffusion model , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  R. Cann The history and geography of human genes , 1995, The Journal of Asian Studies.

[29]  Laurent Excoffier,et al.  The effect of the Neolithic expansion on European molecular diversity , 2005, Proceedings of the Royal Society B: Biological Sciences.

[30]  V. Pascali,et al.  Clines of nuclear DNA markers suggest a largely neolithic ancestry of the European gene pool. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[31]  S. Heath,et al.  Investigation of the fine structure of European populations with applications to disease association studies , 2008, European Journal of Human Genetics.

[32]  C. Tyler-Smith,et al.  Human Evolutionary Genetics , 2004 .

[33]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[34]  Ron Pinhasi,et al.  Tracing the Origin and Spread of Agriculture in Europe , 2005, PLoS biology.

[35]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[36]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[37]  Guido Barbujani,et al.  Africans and Asians abroad: genetic diversity in Europe. , 2004, Annual review of genomics and human genetics.

[38]  Laurent Excoffier,et al.  splatche: a program to simulate genetic diversity taking into account environmental heterogeneity , 2004 .

[39]  R R Sokal,et al.  A Problem with Synthetic Maps , 1999, Human biology.

[40]  Pablo Villoslada,et al.  Analysis and Application of European Genetic Substructure Using 300 K SNP Information , 2008, PLoS genetics.

[41]  R J Mitchell,et al.  Y-chromosomal diversity in Europe is clinal and influenced primarily by geography, rather than by language. , 2000, American journal of human genetics.

[42]  Norman Davies,et al.  Europe: A History , 1996 .

[43]  Seraina Klopfstein,et al.  The fate of mutations surfing on the wave of a range expansion. , 2006, Molecular biology and evolution.

[44]  R R Sokal,et al.  Spatial patterns of human gene frequencies in Europe. , 1989, American journal of physical anthropology.

[45]  Zachary A. Szpiech,et al.  Genotype, haplotype and copy-number variation in worldwide human populations , 2008, Nature.

[46]  S. Zegura,et al.  Human Evolutionary Genetics: Origins, Peoples and Disease. , 2005 .

[47]  P. Underhill,et al.  Y chromosome diversity, human expansion, drift, and cultural evolution , 2009, Proceedings of the National Academy of Sciences.

[48]  John Novembre,et al.  Global distribution of genomic diversity underscores rich complex history of continental human populations. , 2009, Genome research.

[49]  P. Mellars Archeology and the dispersal of modern humans in Europe: Deconstructing the “Aurignacian” , 2006 .

[50]  Montgomery Slatkin,et al.  ISOLATION BY DISTANCE IN EQUILIBRIUM AND NON‐EQUILIBRIUM POPULATIONS , 1993, Evolution; international journal of organic evolution.

[51]  Christian Gieger,et al.  Correlation between Genetic and Geographic Structure in Europe , 2008, Current Biology.

[52]  M. Slatkin,et al.  Estimation of levels of gene flow from DNA sequence data. , 1992, Genetics.

[53]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[54]  L. Excoffier,et al.  Surfing during population expansions promotes genetic revolutions and structuration. , 2008, Trends in ecology & evolution.

[55]  P. Menozzi,et al.  Synthetic maps of human gene frequencies in Europeans. , 1978, Science.

[56]  L. Excoffier,et al.  The Hidden Side of Invasions: Massive Introgression by Local Genes , 2008, Evolution; international journal of organic evolution.

[57]  D. Nelson,et al.  Genetic drift at expanding frontiers promotes gene segregation , 2007, Proceedings of the National Academy of Sciences.

[58]  L. Excoffier,et al.  Modern Humans Did Not Admix with Neanderthals during Their Range Expansion into Europe , 2004, PLoS biology.

[59]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[60]  Christopher A. Edmonds,et al.  Mutations arising in the wave front of an expanding population. , 2004, Proceedings of the National Academy of Sciences of the United States of America.