Inference of recombination maps from a single pair of genomes and its application to archaic samples

Understanding the causes and consequences of recombination rate evolution is a fundamental goal in genetics that requires recombination maps from across the tree of life. Since statistical inference of recombination maps typically depends on large samples, reaching out studies to non-model organisms requires alternative tools. Here we extend the sequentially Markovian coalescent model to jointly infer demography and the variation in recombination along a pair of genomes. Using extensive simulations and sequence data from humans, fruit-flies and a fungal pathogen, we demonstrate that iSMC accurately infers recombination maps under a wide range of scenarios – remarkably, even from a single pair of unphased genomes. We exploit this possibility and reconstruct the recombination maps of archaic hominids. We report that the evolution of the recombination landscape follows the established phylogeny of Neandertals, Denisovans and modern human populations, as expected if the genomic distribution of crossovers in hominids is largely neutral.

[1]  W. G. Hill,et al.  The effect of linkage on limits to artificial selection. , 1966, Genetical research.

[2]  Paul Marjoram,et al.  Fast "coalescent" simulation , 2006, BMC Genetics.

[3]  G. McVean,et al.  A genealogical interpretation of linkage disequilibrium. , 2002, Genetics.

[4]  R. Hudson Two-locus sampling distributions and their application. , 2001, Genetics.

[5]  T. Wiehe,et al.  Topological linkage disequilibrium calculated from coalescent genealogies. , 2018, Theoretical population biology.

[6]  S. Bensch,et al.  Comparative analysis examining patterns of genomic differentiation across multiple episodes of population divergence in birds , 2017, bioRxiv.

[7]  Sriram Sankararaman,et al.  A genetic method for dating ancient genomes provides a direct estimate of human generation interval in the last 45,000 years , 2016, Proceedings of the National Academy of Sciences.

[8]  J. Sekelsky,et al.  Evolution of an MCM Complex in Flies That Promotes Meiotic Crossovers by Blocking BLM Helicase , 2012, Science.

[9]  D. Cutler,et al.  Population demographic history can cause the appearance of recombination hotspots. , 2012, American journal of human genetics.

[10]  K. Paigen,et al.  PRDM9 and Its Role in Genetic Recombination. , 2018, Trends in genetics : TIG.

[11]  T. Mailund,et al.  A fine-scale recombination map of the human–chimpanzee ancestor reveals faster change in humans than in chimpanzees and a strong impact of GC-biased gene conversion , 2014, Genome research.

[12]  Russell B. Corbett-Detig,et al.  The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population , 2015, Genetics.

[13]  Asger Hobolth,et al.  The SMC′ Is a Highly Accurate Approximation to the Ancestral Recombination Graph , 2015, Genetics.

[14]  N L Kaplan,et al.  The coalescent process in models with selection and recombination. , 1988, Genetics.

[15]  P. Donnelly,et al.  Estimating recombination rates from population genetic data. , 2001, Genetics.

[16]  Richard R. Hudson,et al.  The coalescent process in models with selection, recombination and geographic subdivision. , 1991, Genetical research.

[17]  B. Payseur,et al.  Genetic Analysis of Genome-Scale Recombination Rate Evolution in House Mice , 2011, PLoS genetics.

[18]  Yun S. Song,et al.  Robust and scalable inference of population history from hundreds of unphased whole genomes , 2016, Nature Genetics.

[19]  R. Durbin,et al.  Inference of human population history from individual whole-genome sequences. , 2011, Nature.

[20]  B. McDonald,et al.  Genome‐wide evidence for divergent selection between populations of a major agricultural pathogen , 2018, Molecular ecology.

[21]  Thomas Mailund,et al.  zipHMMlib: a highly optimised HMM library exploiting repetitions in the input to speed up the forward algorithm , 2013, BMC Bioinformatics.

[22]  R. Durbin,et al.  Inferring human population size and separation history from multiple genome sequences , 2014, Nature Genetics.

[23]  A. Auton,et al.  Recombination rate estimation in the presence of hotspots. , 2007, Genome research.

[24]  Gil McVean,et al.  Stable recombination hotspots in birds , 2015, Science.

[25]  Philip L. F. Johnson,et al.  A Draft Sequence of the Neandertal Genome , 2010, Science.

[26]  Yun S. Song,et al.  High-throughput inference of pairwise coalescence times identifies signals of selection and enriched disease heritability , 2018, bioRxiv.

[27]  J. Dutheil,et al.  Fine-Scale Recombination Maps of Fungal Plant Pathogens Reveal Dynamic Recombination Landscapes and Intragenic Hotspots , 2017, Genetics.

[28]  P. Visscher,et al.  10 Years of GWAS Discovery: Biology, Function, and Translation. , 2017, American journal of human genetics.

[29]  S. Kingan,et al.  Molecular Evolution at a Meiosis Gene Mediates Species Differences in the Rate and Patterning of Recombination , 2018, Current Biology.

[30]  R. Hudson,et al.  Statistical properties of the number of recombination events in the history of a sample of DNA sequences. , 1985, Genetics.

[31]  Carsten Wiuf,et al.  Gene Genealogies, Variation and Evolution - A Primer in Coalescent Theory , 2004 .

[32]  H. Ellegren,et al.  Determinants of genetic diversity , 2016, Nature Reviews Genetics.

[33]  Philipp W. Messer,et al.  Quantification of GC-biased gene conversion in the human genome , 2014, bioRxiv.

[34]  Jonathan Terhorst,et al.  High-throughput inference of pairwise coalescence times identifies signals of selection and enriched disease heritability , 2018, Nature Genetics.

[35]  Arne Ludwig,et al.  Ancient genomic changes associated with domestication of the horse , 2017, Science.

[36]  Lex E. Flagel,et al.  Speciation and Introgression between Mimulus nasutus and Mimulus guttatus , 2013, bioRxiv.

[37]  M. Slatkin,et al.  Ancient DNA and human history , 2016, Proceedings of the National Academy of Sciences.

[38]  Ethan L. Stewart,et al.  The Impact of Recombination Hotspots on Genome Evolution of a Fungal Plant Pathogen , 2015, Genetics.

[39]  Katherine S. Pollard,et al.  The Role of GC-Biased Gene Conversion in Shaping the Fastest Evolving Regions of the Human Genome , 2011, Molecular biology and evolution.

[40]  Camilo Salazar,et al.  Recombination rate variation shapes barriers to introgression across butterfly genomes , 2018, bioRxiv.

[41]  L. Duret,et al.  The Red Queen Model of Recombination Hotspots Evolution in the Light of Archaic and Modern Human Genomes , 2014, PLoS genetics.

[42]  P. Donnelly,et al.  Drive Against Hotspot Motifs in Primates Implicates the PRDM9 Gene in Meiotic Recombination , 2010, Science.

[43]  G. McVean,et al.  Estimating recombination rates from population-genetic data , 2003, Nature Reviews Genetics.

[44]  John Maynard Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[45]  G. Coop,et al.  Live Hot, Die Young: Transmission Distortion in Recombination Hotspots , 2007, PLoS genetics.

[46]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[47]  Adrian W. Briggs,et al.  A High-Coverage Genome Sequence from an Archaic Denisovan Individual , 2012, Science.

[48]  Andrew H. Chan,et al.  Genome-Wide Fine-Scale Recombination Rate Variation in Drosophila melanogaster , 2012, PLoS genetics.

[49]  Simon H. Martin,et al.  Interpreting the genomic landscape of introgression. , 2017, Current opinion in genetics & development.

[50]  Yun S. Song,et al.  The Simons Genome Diversity Project: 300 genomes from 142 diverse populations , 2016, Nature.

[51]  H. Akaike A new look at the statistical model identification , 1974 .

[52]  J. Dutheil Hidden Markov Models in Population Genomics. , 2017, Methods in molecular biology.

[53]  Heng Li,et al.  Genome sequence of a 45,000-year-old modern human from western Siberia , 2014, Nature.

[54]  B. Payseur,et al.  Genomic signatures of selection at linked sites: unifying the disparity among species , 2013, Nature Reviews Genetics.

[55]  S. Martin,et al.  Patterns of Z chromosome divergence among Heliconius species highlight the importance of historical demography , 2017, bioRxiv.

[56]  L. Keller,et al.  Introgression from Domestic Goat Generated Variation at the Major Histocompatibility Complex of Alpine Ibex , 2014, PLoS genetics.

[57]  R. Griffiths,et al.  An ancestral recombination graph , 1997 .

[58]  Daniel L. Powell,et al.  Natural selection interacts with recombination to shape the evolution of hybrid genomes , 2018, Science.

[59]  Laurent Duret,et al.  Biased gene conversion and the evolution of mammalian genomic landscapes. , 2009, Annual review of genomics and human genetics.

[60]  G. Coop,et al.  PRDM9 Is a Major Determinant of Meiotic Recombination Hotspots in Humans and Mice , 2010, Science.

[61]  C. Smadja,et al.  Variation in recombination frequency and distribution across eukaryotes: patterns and processes , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[63]  H. Ellegren,et al.  Recombination Rate Variation Modulates Gene Sequence Evolution Mainly via GC-Biased Gene Conversion, Not Hill–Robertson Interference, in an Avian System , 2015, Molecular biology and evolution.

[64]  P. Marjoram,et al.  Ancestral Inference from Samples of DNA Sequences with Recombination , 1996, J. Comput. Biol..

[65]  R. Redfield,et al.  The hotspot conversion paradox and the evolution of meiotic recombination. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[66]  Philip L. F. Johnson,et al.  The complete genome sequence of a Neandertal from the Altai Mountains , 2013, Nature.

[67]  L. Duret,et al.  The Red Queen model of recombination hot-spot evolution: a theoretical investigation , 2017, Philosophical Transactions of the Royal Society B: Biological Sciences.

[68]  B. Payseur,et al.  Effects of Demographic History on the Detection of Recombination Hotspots from Linkage Disequilibrium , 2017, Molecular biology and evolution.

[69]  John A. Kamm,et al.  Two-Locus Likelihoods Under Variable Population Size and Fine-Scale Recombination Rate Estimation , 2015, Genetics.

[70]  Montgomery Slatkin,et al.  Linkage disequilibrium — understanding the evolutionary past and mapping the medical future , 2008, Nature Reviews Genetics.

[71]  M. Noor,et al.  Recombining without Hotspots: A Comprehensive Evolutionary Portrait of Recombination in Two Closely Related Species of Drosophila , 2015, bioRxiv.

[72]  Paul R. Staab,et al.  scrm: efficiently simulating long sequences using the approximated coalescent with recombination , 2015, Bioinform..

[73]  Noah A. Rosenberg,et al.  Genealogical trees, coalescent theory and the analysis of genetic polymorphisms , 2002, Nature Reviews Genetics.

[74]  P. Keightley,et al.  Interference among deleterious mutations favours sex and recombination in finite populations , 2006, Nature.

[75]  T. Mailund,et al.  Unraveling recombination rate evolution using ancestral recombination maps , 2014, BioEssays : news and reviews in molecular, cellular and developmental biology.

[76]  H. Ellegren,et al.  Whole‐genome patterns of linkage disequilibrium across flycatcher populations clarify the causes and consequences of fine‐scale recombination rate variation in birds , 2017, Molecular ecology.

[77]  P. Donnelly,et al.  The Fine-Scale Structure of Recombination Rate Variation in the Human Genome , 2004, Science.

[78]  Nathan K. Schaefer,et al.  Natural selection shaped the rise and fall of passenger pigeon genomic diversity , 2017, Science.

[79]  Ryan D. Hernandez,et al.  A Fine-Scale Chimpanzee Genetic Map from Population Sequencing , 2012, Science.

[80]  F. Zhao,et al.  Population Genomics Reveals Speciation and Introgression between Brown Norway Rats and Their Sibling Species , 2017, Molecular biology and evolution.

[81]  G. McVean,et al.  Approximating the coalescent with recombination , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[82]  A. Gylfason,et al.  Fine-scale recombination rate differences between sexes, populations and individuals , 2010, Nature.

[83]  J. Hein,et al.  Recombination as a point process along sequences. , 1999, Theoretical population biology.

[84]  Douglas G. Scofield,et al.  Natural Selection and Recombination Rate Variation Shape Nucleotide Polymorphism Across the Genomes of Three Related Populus Species , 2015, Genetics.

[85]  Morris Swertz,et al.  Genome-wide patterns and properties of de novo mutations in humans , 2015, Nature Genetics.

[86]  Nicolas C. Rochette,et al.  Bio++: efficient extensible libraries and tools for computational molecular evolution. , 2013, Molecular biology and evolution.

[87]  T. Speed,et al.  Biological Sequence Analysis , 1998 .