Signatures of Introgression across the Allele Frequency Spectrum

The detection of introgression from genomic data is transforming our view of species and the origins of adaptive variation. Among the most widely used approaches to detect introgression is the so-called ABBA BABA test or D statistic, which identifies excess allele sharing between non-sister taxa. Part of the appeal of D is its simplicity, but this also limits its informativeness, particularly about the timing and direction of introgression. Here we present a simple extension, D frequency spectrum or DFS, in which D is partitioned according to the frequencies of derived alleles. We use simulations over a large parameter space to show how DFS caries information about various factors. In particular, recent introgression reliably leads to a peak in DFS among low-frequency derived alleles, whereas violation of model assumptions can lead to a lack of signal at low-frequencies. We also reanalyse published empirical data from six different animal and plant taxa, and interpret the results in the light of our simulations, showing how DFS provides novel insights. We currently see DFS as a descriptive tool that will augment both simple and sophisticated tests for introgression, but in the future it may be usefully incorporated into probabilistic inference frameworks.

[1]  Ryan D. Hernandez,et al.  Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data , 2009, PLoS genetics.

[2]  G. Coop,et al.  Population-genomic inference of the strength and timing of selection against gene flow , 2017, Proceedings of the National Academy of Sciences.

[3]  Philip L. F. Johnson,et al.  The complete genome sequence of a Neanderthal from the Altai Mountains , 2013 .

[4]  Jun Wang,et al.  Population Genomics Reveal Recent Speciation and Rapid Evolutionary Adaptation in Polar Bears , 2014, Cell.

[5]  R. Nielsen,et al.  Inferring Demographic History from a Spectrum of Shared Haplotype Lengths , 2013, PLoS genetics.

[6]  N. Galtier,et al.  Shedding Light on the Grey Zone of Speciation along a Continuum of Genomic Divergence , 2016, bioRxiv.

[7]  David Reich,et al.  Testing for ancient admixture between closely related populations. , 2011, Molecular biology and evolution.

[8]  Swapan Mallick,et al.  Ancient Admixture in Human History , 2012, Genetics.

[9]  C. A. Machado,et al.  Inferring the history of speciation from multilocus DNA sequence data: the case of Drosophila pseudoobscura and close relatives. , 2002, Molecular biology and evolution.

[10]  Simon H. Martin,et al.  Evaluating the Use of ABBA–BABA Statistics to Locate Introgressed Loci , 2014, bioRxiv.

[11]  M. Purugganan,et al.  Cross-species hybridization and the origin of North African date palms , 2019, Proceedings of the National Academy of Sciences.

[12]  L. Excoffier,et al.  Robust Demographic Inference from Genomic and SNP Data , 2013, PLoS genetics.

[13]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[14]  Yun-Xin Fu,et al.  Exploring Population Size Changes Using SNP Frequency Spectra , 2015, Nature Genetics.

[15]  J. Walsh,et al.  Bidirectional adaptive introgression between two ecologically divergent sparrow species , 2018, Evolution; international journal of organic evolution.

[16]  M. Slatkin,et al.  Ancient structure in Africa unlikely to explain Neanderthal and non-African genetic similarity. , 2012, Molecular biology and evolution.

[17]  James Mallet,et al.  How reticulated are species? , 2015, BioEssays : news and reviews in molecular, cellular and developmental biology.

[18]  Anders Eriksson,et al.  Effect of ancient population structure on the degree of polymorphism shared between modern human populations and ancient hominins , 2012, Proceedings of the National Academy of Sciences.

[19]  Simon H. Martin,et al.  Interspecific introgression mediates adaptation to whole genome duplication , 2019, Nature Communications.

[20]  L. Hurst,et al.  Mutation rate analysis via parent–progeny sequencing of the perennial peach. I. A low rate in woody perennials and a higher mutagenicity in hybrids , 2016, Proceedings of the Royal Society B: Biological Sciences.

[21]  Martin Chmelik,et al.  Efficient Strategies for Calculating Blockwise Likelihoods Under the Coalescent , 2015, Genetics.

[22]  G. A. Watterson Allele frequencies after a bottleneck , 1984 .

[23]  J. S. Greenlaw Behavioral and morphological diversification in sharp-tailed sparrows (Ammodramus caudacutus) of the Atlantic coast , 1993 .

[24]  David B. Witonsky,et al.  Reconstructing Native American Population History , 2012, Nature.

[25]  Yun S. Song,et al.  The Simons Genome Diversity Project: 300 genomes from 142 diverse populations , 2016, Nature.

[26]  W. Amos Variation in Heterozygosity Predicts Variation in Human Substitution Rates between Populations, Individuals and Genomic Regions , 2013, PloS one.

[27]  M. Kirkpatrick,et al.  The Origin of a New Sex Chromosome by Introgression between Two Stickleback Fishes , 2018, Molecular biology and evolution.

[28]  J. Akey,et al.  Identifying and Interpreting Apparent Neanderthal Ancestry in African Individuals , 2020, Cell.

[29]  Philip L. F. Johnson,et al.  A Draft Sequence of the Neandertal Genome , 2010, Science.

[30]  Camilo Salazar,et al.  Recombination rate variation shapes barriers to introgression across butterfly genomes , 2018, bioRxiv.

[31]  M. Kronforst,et al.  Reinforcement of mate preference among hybridizing Heliconius butterflies , 2007, Journal of evolutionary biology.

[32]  Simon H. Martin,et al.  Interpreting the genomic landscape of introgression. , 2017, Current opinion in genetics & development.

[33]  D. Reich,et al.  The Date of Interbreeding between Neandertals and Modern Humans , 2012, PLoS genetics.

[34]  Simon H. Martin,et al.  Genome-wide evidence for speciation with gene flow in Heliconius butterflies , 2013, Genome research.

[35]  Fernando Racimo,et al.  Signatures of Archaic Adaptive Introgression in Present-Day Human Populations , 2016, bioRxiv.

[36]  Aaron P. Ragsdale,et al.  Inferring the Joint Demographic History of Multiple Populations: Beyond the Diffusion Approximation , 2017, Genetics.