Widespread selection and gene flow shape the genomic landscape during a radiation of monkeyflowers

Speciation genomic studies aim to interpret patterns of genome-wide variation in light of the processes that give rise to new species. However, interpreting the genomic ‘landscape’ of speciation is difficult, because many evolutionary processes can impact levels of variation. Facilitated by the first chromosome-level assembly for the group, we use whole-genome sequencing and simulations to shed light on the processes that have shaped the genomic landscape during a recent radiation of monkeyflowers. After inferring the phylogenetic relationships among the nine taxa in this radiation, we show that highly similar diversity (π) and differentiation (FST) landscapes have emerged across the group. Variation in these landscapes was strongly predicted by the local density of functional elements and the recombination rate, suggesting that the landscapes have been shaped by widespread natural selection. Using the varying divergence times between pairs of taxa, we show that the correlations between FST and genome features arose almost immediately after a population split and have become stronger over time. Simulations of genomic landscape evolution suggest that background selection (i.e., selection against deleterious mutations) alone is too subtle to generate the observed patterns, but scenarios that involve positive selection and genetic incompatibilities are plausible alternative explanations. Finally, tests for introgression among these taxa reveal widespread evidence of heterogeneous selection against gene flow during this radiation. Thus, combined with existing evidence for adaptation in this system, we conclude that the correlation in FST among these taxa informs us about the genomic basis of adaptation and speciation in this system.What can patterns of genome-wide variation tell us about the speciation process? The answer to this question depends upon our ability to infer the evolutionary processes underlying these patterns. This, however, is difficult, because many processes can leave similar footprints, but some have nothing to do with speciation per se. For example, many studies have found highly heterogeneous levels of genetic differentiation when comparing the genomes of emerging species. These patterns are often referred to as differentiation ‘landscapes’ because they appear as a rugged topography of ‘peaks’ and ‘valleys’ as one scans across the genome. It has often been argued that selection against deleterious mutations, a process referred to as background selection, is primarily responsible for shaping differentiation landscapes early in speciation. If this hypothesis is correct, then it is unlikely that patterns of differentiation will reveal much about the genomic basis of speciation. However, using genome sequences from nine emerging species of monkeyflower coupled with simulations of genomic divergence, we show that it is unlikely that background selection is the primary architect of these landscapes. Rather, differentiation landscapes have probably been shaped by adaptation and gene flow, which are processes that are central to our understanding of speciation. Therefore, our work has important implications for our understanding of what patterns of differentiation can tell us about the genetic basis of adaptation and speciation.

[1]  C. Jiggins,et al.  Glittering gold and the quest for Isla de Muerta , 2017, Journal of evolutionary biology.

[2]  Timothy B Sackton,et al.  Natural Selection Constrains Neutral Diversity across A Wide Range of Species , 2014, bioRxiv.

[3]  N L Kaplan,et al.  Deleterious background selection with recombination. , 1995, Genetics.

[4]  B. Browning,et al.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. , 2007, American journal of human genetics.

[5]  David Reich,et al.  Testing for ancient admixture between closely related populations. , 2011, Molecular biology and evolution.

[6]  Colin N. Dewey,et al.  Genomic Variation in Natural Populations of Drosophila melanogaster , 2012, Genetics.

[7]  M. Murray,et al.  All that's gold does not glitter , 2002 .

[8]  B. Charlesworth Measures of divergence between populations and the effect of forces that reduce variability. , 1998, Molecular biology and evolution.

[9]  Nicholas Stiffler,et al.  Population Genomics of Parallel Adaptation in Threespine Stickleback using Sequenced RAD Tags , 2010, PLoS genetics.

[10]  J. Gillespie Genetic drift in an infinite population. The pseudohitchhiking model. , 2000, Genetics.

[11]  J. Kohn,et al.  CONTRASTING PATTERNS OF FLORAL AND MOLECULAR VARIATION ACROSS A CLINE IN MIMULUS AURANTIACUS , 2005, Evolution; international journal of organic evolution.

[12]  A. Yoder,et al.  What is Speciation Genomics? The roles of ecology, gene flow, and genomic architecture in the formation of species , 2018, Biological Journal of the Linnean Society.

[13]  Camilo Salazar,et al.  Recombination rate variation shapes barriers to introgression across butterfly genomes , 2018, bioRxiv.

[14]  D. J. Funk,et al.  Divergent selection and heterogeneous genomic divergence , 2009, Molecular ecology.

[15]  Miriam Liedvogel,et al.  Correlated patterns of genetic diversity and differentiation across an avian family , 2017, bioRxiv.

[16]  James M Sobel,et al.  Strong premating reproductive isolation drives incipient speciation in Mimulus aurantiacus , 2015, Evolution; international journal of organic evolution.

[17]  R. Burri Dissecting differentiation landscapes: a linked selection's perspective , 2017, Journal of evolutionary biology.

[18]  Matthew W. Hahn,et al.  Toward a Selection Theory of Molecular Evolution , 2008, Evolution; international journal of organic evolution.

[19]  S. Bensch,et al.  Comparative analysis examining patterns of genomic differentiation across multiple episodes of population divergence in birds , 2017, bioRxiv.

[20]  Chung-I Wu The genic view of the process of speciation , 2001 .

[21]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[22]  Keith Bradnam,et al.  CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes , 2007, Bioinform..

[23]  U. Dieckmann,et al.  Hybridization and speciation , 2013, Journal of evolutionary biology.

[24]  R. Faria,et al.  Interpreting the genomic landscape of speciation: a road map for finding barriers to gene flow , 2017, Journal of evolutionary biology.

[25]  R. Durrett,et al.  Spatial models for hybrid zones , 2000, Heredity.

[26]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[27]  R. Gibbs,et al.  Mind the Gap: Upgrading Genomes with Pacific Biosciences RS Long-Read Sequencing Technology , 2012, PloS one.

[28]  Simon H. Martin,et al.  Evaluating the Use of ABBA–BABA Statistics to Locate Introgressed Loci , 2014, bioRxiv.

[29]  R. Burri Interpreting differentiation landscapes in the light of long‐term linked selection , 2017 .

[30]  S. Baird Exploring linkage disequilibrium , 2015, Molecular ecology resources.

[31]  John Maynard Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[32]  H. A. Orr,et al.  PATTERNS OF SPECIATION IN DROSOPHILA , 1989, Evolution; international journal of organic evolution.

[33]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[34]  Andrew Josey Updates , 2003, login Usenix Mag..

[35]  A. Amores,et al.  Stacks: Building and Genotyping Loci De Novo From Short-Read Sequences , 2011, G3: Genes | Genomes | Genetics.

[36]  J. Pease,et al.  MORE ACCURATE PHYLOGENIES INFERRED FROM LOW‐RECOMBINATION REGIONS IN THE PRESENCE OF INCOMPLETE LINEAGE SORTING , 2013, Evolution; international journal of organic evolution.

[37]  D. Marques,et al.  A Combinatorial View on Speciation and Adaptive Radiation. , 2019, Trends in ecology & evolution.

[38]  N. Barton,et al.  Analysis of Hybrid Zones , 1985 .

[39]  N. Nagarajan,et al.  OPERA-LG: efficient and exact scaffolding of large, repeat-rich eukaryotic genomes with performance guarantees , 2016, Genome Biology.

[40]  Matthew W. Hahn,et al.  Phylogenomics Reveals Three Sources of Adaptive Variation during a Rapid Radiation , 2016, PLoS biology.

[41]  H. Ellegren,et al.  Making sense of genomic islands of differentiation in light of speciation , 2016, Nature Reviews Genetics.

[42]  W. Cresko,et al.  Repeated Selection of Alternatively Adapted Haplotypes Creates Sweeping Genomic Remodeling in Stickleback , 2018, Genetics.

[43]  V. Grant Effects of hybridization and selection on floral isolation. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Remco R. Bouckaert,et al.  DensiTree: making sense of sets of phylogenetic trees , 2010, Bioinform..

[45]  W. Salzburger,et al.  The genomics of organismal diversification illuminated by adaptive radiations. , 2015, Trends in genetics : TIG.

[46]  M. Nei,et al.  Mathematical model for studying genetic variation in terms of restriction endonucleases. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[47]  A. Amores,et al.  A RAD-Tag Genetic Map for the Platyfish (Xiphophorus maculatus) Reveals Mechanisms of Karyotype Evolution Among Teleost Fish , 2014, Genetics.

[48]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..

[49]  Lex E. Flagel,et al.  Speciation and Introgression between Mimulus nasutus and Mimulus guttatus , 2013, bioRxiv.

[50]  M. Whitlock,et al.  Background selection and the statistics of population differentiation: consequences for detecting local adaptation , 2018, bioRxiv.

[51]  M. Wikelski,et al.  The genomic landscape underlying phenotypic integrity in the face of gene flow in crows , 2014, Science.

[52]  V. Grant Origin of floral isolation between ornithophilous and sphingophilous plant species. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Matthew W. Hahn,et al.  Genomic Islands of Speciation in Anopheles gambiae , 2005, PLoS biology.

[54]  Simon H. Martin,et al.  Genome-wide evidence for speciation with gene flow in Heliconius butterflies , 2013, Genome research.

[55]  D. Schluter,et al.  Natural Selection on a Major Armor Gene in Threespine Stickleback , 2008, Science.

[56]  Philip L. F. Johnson,et al.  A Draft Sequence of the Neandertal Genome , 2010, Science.

[57]  M. Slatkin Inbreeding coefficients and coalescence times. , 1991, Genetical research.

[58]  Matthew A. Streisfeld,et al.  Divergent Selection Drives Genetic Differentiation in an R2R3-MYB Transcription Factor That Contributes to Incipient Speciation in Mimulus aurantiacus , 2013, PLoS genetics.

[59]  J. M. Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[60]  Pasi Rastas,et al.  Construction of Ultradense Linkage Maps with Lep-MAP2: Stickleback F2 Recombinant Crosses as an Example , 2015, Genome biology and evolution.

[61]  B. Grant,et al.  Gene flow, ancient polymorphism, and ecological adaptation shape the genomic landscape of divergence among Darwin's finches , 2017, Genome research.

[62]  H. Ellegren,et al.  How Linked Selection Shapes the Diversity Landscape in Ficedula Flycatchers , 2019, Genetics.

[63]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[64]  Kevin R. Thornton,et al.  Efficient pedigree recording for fast population genetics simulation , 2018, bioRxiv.

[65]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[66]  J. Felsenstein Phylogenies and the Comparative Method , 1985, The American Naturalist.

[67]  Matthew A. Streisfeld,et al.  Variation in ecophysiological traits might contribute to ecogeographic isolation and divergence between parapatric ecotypes of Mimulus aurantiacus , 2019, Journal of evolutionary biology.

[68]  Madeline A. Chase,et al.  Genomewide variation provides insight into evolutionary relationships in a monkeyflower species complex (Mimulus sect. Diplacus). , 2017, American journal of botany.

[69]  Angel Amores,et al.  Stacks: an analysis tool set for population genomics , 2013, Molecular ecology.

[70]  M. Rockman THE QTN PROGRAM AND THE ALLELES THAT MATTER FOR EVOLUTION: ALL THAT'S GOLD DOES NOT GLITTER , 2012, Evolution; international journal of organic evolution.

[71]  R. Hudson Gene genealogies and the coalescent process. , 1990 .

[72]  M. Kimura Evolutionary Rate at the Molecular Level , 1968, Nature.

[73]  Wei Zhang,et al.  Hybridization Reveals the Evolving Genomic Architecture of Speciation , 2013, Cell reports.

[74]  Mark Yandell,et al.  MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects , 2011, BMC Bioinformatics.

[75]  Dan M. Bolser,et al.  Ensembl Genomes 2016: more genomes, more complexity , 2015, Nucleic Acids Res..

[76]  Pall I. Olason,et al.  Linked selection and recombination rate variation drive the evolution of the genomic landscape of differentiation across the speciation continuum of Ficedula flycatchers , 2015, Genome research.

[77]  C. Alex Buerkle,et al.  Stick Insect Genomes Reveal Natural Selection’s Role in Parallel Speciation , 2014, Science.

[78]  Matthew A. Streisfeld,et al.  Introgressive hybridization facilitates adaptive divergence in a recent radiation of monkeyflowers , 2015, Proceedings of the Royal Society B: Biological Sciences.

[79]  Sofia M. C. Robb,et al.  MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. , 2007, Genome research.

[80]  J. Gale Hitch-Hiking Effect of a Favourable Gene , 1980 .

[81]  B. Charlesworth,et al.  The effect of deleterious mutations on neutral molecular variation. , 1993, Genetics.

[82]  D. M. Thompson Systematics of Mimulus subgenus Schizoplacus (Scrophulariaceae) , 2005 .

[83]  Aurélien Grosdidier,et al.  APDB: a novel measure for benchmarking sequence alignment methods without reference alignments , 2003, ISMB.

[84]  A. Gnirke,et al.  High-quality draft assemblies of mammalian genomes from massively parallel sequence data , 2010, Proceedings of the National Academy of Sciences.

[85]  D. Schluter,et al.  The Ecology of Adaptive Radiation , 2000 .

[86]  T. Mitchell-Olds,et al.  Molecular systematics of the Brassicaceae: evidence from coding plastidic matK and nuclear Chs sequences. , 2001, American journal of botany.

[87]  T. Ohta Slightly Deleterious Mutant Substitutions in Evolution , 1973, Nature.

[88]  M. Grabherr,et al.  Evolution of Darwin’s finches and their beaks revealed by genome sequencing , 2015, Nature.

[89]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[90]  Brook T. Moyers,et al.  Genomic islands of divergence are not affected by geography of speciation in sunflowers , 2013, Nature Communications.

[91]  Matthew W. Hahn,et al.  Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow , 2014, Molecular ecology.

[92]  Philipp W. Messer,et al.  SLiM 3: Forward Genetic Simulations Beyond the Wright–Fisher Model , 2018, bioRxiv.

[93]  Ancient genomic variation underlies repeated ecological adaptation in young stickleback populations , 2017 .

[94]  R. Hudson,et al.  A statistical test for detecting geographic subdivision. , 1992, Molecular biology and evolution.

[95]  Inna Dubchak,et al.  The genome portal of the Department of Energy Joint Genome Institute: 2014 updates , 2013, Nucleic Acids Res..

[96]  P. Nosil,et al.  The genomics of speciation-with-gene-flow. , 2012, Trends in genetics : TIG.

[97]  Matthew A. Streisfeld,et al.  Geographic cline analysis as a tool for studying genome‐wide variation: a case study of pollinator‐mediated divergence in a monkeyflower , 2017, Molecular ecology.

[98]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[99]  Vickery,et al.  Speciation by aneuploidy and polyploidy in Mimulus (Scrophulariaceae) , 1995 .

[100]  Matthew A. Streisfeld,et al.  The geography of divergence with gene flow facilitates multitrait adaptation and the evolution of pollinator isolation in Mimulus aurantiacus , 2015, Evolution; international journal of organic evolution.

[101]  Richard J. Challis,et al.  Genomic islands of speciation separate cichlid ecomorphs in an East African crater lake , 2015, Science.

[102]  J. Kohn,et al.  Environment and pollinator‐mediated selection on parapatric floral races of Mimulus aurantiacus , 2007, Journal of evolutionary biology.

[103]  L. Rieseberg,et al.  Plant Speciation , 2007, Science.

[104]  Pall I. Olason,et al.  The genomic landscape of species divergence in Ficedula flycatchers , 2012, Nature.

[105]  Peter L. Ralph,et al.  Patterns of Neutral Diversity Under General Models of Selective Sweeps , 2011, Genetics.

[106]  Philipp W. Messer,et al.  Tree‐sequence recording in SLiM opens new horizons for forward‐time simulation of whole genomes , 2019, Molecular ecology resources.

[107]  Mario Stanke,et al.  Gene prediction with a hidden Markov model and a new intron submodel , 2003, ECCB.

[108]  Daniel L. Powell,et al.  Natural selection interacts with recombination to shape the evolution of hybrid genomes , 2018, Science.

[109]  Jerome Kelleher,et al.  Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes , 2015, bioRxiv.

[110]  Ian Korf,et al.  Gene finding in novel genomes , 2004, BMC Bioinformatics.

[111]  H. Mcminn STUDIES IN THE GENUS DIPLACUS, SCROPHULARIACEAE , 1951 .

[112]  David M. Goodstein,et al.  Phytozome: a comparative platform for green plant genomics , 2011, Nucleic Acids Res..

[113]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..