Exploring a Pool‐seq‐only approach for gaining population genomic insights in nonmodel species

Abstract Developing genomic insights is challenging in nonmodel species for which resources are often scarce and prohibitively costly. Here, we explore the potential of a recently established approach using Pool‐seq data to generate a de novo genome assembly for mining exons, upon which Pool‐seq data are used to estimate population divergence and diversity. We do this for two pairs of sympatric populations of brown trout (Salmo trutta): one naturally sympatric set of populations and another pair of populations introduced to a common environment. We validate our approach by comparing the results to those from markers previously used to describe the populations (allozymes and individual‐based single nucleotide polymorphisms [SNPs]) and from mapping the Pool‐seq data to a reference genome of the closely related Atlantic salmon (Salmo salar). We find that genomic differentiation (F ST) between the two introduced populations exceeds that of the naturally sympatric populations (F ST = 0.13 and 0.03 between the introduced and the naturally sympatric populations, respectively), in concordance with estimates from the previously used SNPs. The same level of population divergence is found for the two genome assemblies, but estimates of average nucleotide diversity differ (π¯ ≈ 0.002 and π¯ ≈ 0.001 when mapping to S. trutta and S. salar, respectively), although the relationships between population values are largely consistent. This discrepancy might be attributed to biases when mapping to a haploid condensed assembly made of highly fragmented read data compared to using a high‐quality reference assembly from a divergent species. We conclude that the Pool‐seq‐only approach can be suitable for detecting and quantifying genome‐wide population differentiation, and for comparing genomic diversity in populations of nonmodel species where reference genomes are lacking.

[1]  L. Bernatchez,et al.  Comparing Pool‐seq, Rapture, and GBS genotyping for inferring weak population structure: The American lobster (Homarus americanus) as a case study , 2019, Ecology and evolution.

[2]  R. Guyomard,et al.  The Chromosome-Level Genome Assembly of European Grayling Reveals Aspects of a Unique Genome Evolution Process Within Salmonids , 2019, G3: Genes, Genomes, Genetics.

[3]  B. Koop,et al.  The Arctic charr (Salvelinus alpinus) genome and transcriptome assembly , 2018, PloS one.

[4]  S. Nylin,et al.  Genetic variation underlying local adaptation of diapause induction along a cline in a butterfly , 2018, Molecular ecology.

[5]  M. Leitwein,et al.  Genomic consequences of a recent three‐way admixture in supplemented wild brown trout populations revealed by local ancestry tracts , 2018, Molecular ecology.

[6]  A. Maass,et al.  Genomic variation underlying complex life-history traits revealed by genome sequencing in Chinook salmon , 2018, Proceedings of the Royal Society B: Biological Sciences.

[7]  D. Bekkevold,et al.  Population genetic structure after 125 years of stocking in sea trout (Salmo trutta L.) , 2018, Conservation Genetics.

[8]  S. Nylin,et al.  Microevolutionary selection dynamics acting on immune genes of the green‐veined white butterfly, Pieris napi , 2018, Molecular ecology.

[9]  B. Saint-Jean,et al.  A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea , 2018, BMC Genomics.

[10]  B. Koop,et al.  Chinook salmon (Oncorhynchus tshawytscha) genome and transcriptome , 2018, PloS one.

[11]  M. Gautier,et al.  Measuring Genetic Differentiation from Pool-seq Data , 2018, Genetics.

[12]  B. Koop,et al.  Convergence in form and function overcomes non-parallel evolutionary histories in Arctic charr , 2018, bioRxiv.

[13]  C. Adams,et al.  De novo transcriptome assembly, annotation and comparison of four ecological and evolutionary model salmonid fish species , 2018, BMC genomics.

[14]  L. A. Vøllestad Understanding Brown Trout Population Genetic Structure: A Northern-European Perspective , 2017 .

[15]  C. Wheat,et al.  Investigating the genomic basis of discrete phenotypes using a Pool‐Seq‐only approach: New insights into the genetics underlying colour variation in diverse taxa , 2017, Molecular ecology.

[16]  D. Ruzzante,et al.  Whole‐genome sequencing approaches for conservation biology: Advantages, limitations and practical recommendations , 2017, Molecular ecology.

[17]  F. Johansson,et al.  Lack of trophic polymorphism despite substantial genetic differentiation in sympatric brown trout (Salmo trutta) populations , 2017 .

[18]  D. Edwards,et al.  Theory, practice, and conservation in the age of genomics: The Galápagos giant tortoise as a case study , 2017, bioRxiv.

[19]  A. Aguilar,et al.  Population genomics reveals high gene flow in grass rockfish (Sebastes rastrelliger). , 2017, Marine genomics.

[20]  M. Kent,et al.  Complex genetic diversity patterns of cryptic, sympatric brown trout (Salmo trutta) populations in tiny mountain lakes , 2017, Conservation Genetics.

[21]  Nina Overgaard Therkildsen,et al.  Practical low‐coverage genomewide sequencing of hundreds of individually barcoded samples for population and evolutionary genomics in nonmodel species , 2017, Molecular ecology resources.

[22]  C. Wheat,et al.  Advances in finding Alba: the locus affecting life history and color polymorphism in a Colias butterfly , 2017, Journal of evolutionary biology.

[23]  M. Leitwein,et al.  A Dense Brown Trout (Salmo trutta) Linkage Map Reveals Recent Chromosomal Rearrangements in the Salmo Genus and the Impact of Selection on Linked Neutral Diversity , 2016, G3: Genes, Genomes, Genetics.

[24]  Santhosh Girirajan,et al.  Novel metrics to measure coverage in whole exome sequencing datasets reveal local and global non-uniformity , 2016, Scientific Reports.

[25]  G. Pazour,et al.  Ror2 signaling regulates Golgi structure and transport through IFT20 for tumor invasiveness , 2017, Scientific Reports.

[26]  C. Nugent,et al.  A SNP Based Linkage Map of the Arctic Charr (Salvelinus alpinus) Genome Provides Insights into the Diploidization Process After Whole Genome Duplication , 2016, G3: Genes, Genomes, Genetics.

[27]  L. Bernatchez On the maintenance of genetic variation and adaptation to environmental change: considerations from population genomics in fishes. , 2016, Journal of fish biology.

[28]  Roberta Bordoni,et al.  Next Generation Sequencing of Pooled Samples: Guideline for Variants’ Filtering , 2016, Scientific Reports.

[29]  M. Leitwein,et al.  Genome-wide nucleotide diversity of hatchery-reared Atlantic and Mediterranean strains of brown trout Salmo trutta compared to wild Mediterranean populations. , 2016, Journal of fish biology.

[30]  C. Schlötterer,et al.  Suitability of Different Mapping Algorithms for Genome-Wide Polymorphism Scans with Pool-Seq Data , 2016, G3: Genes, Genomes, Genetics.

[31]  Steven J. M. Jones,et al.  The Atlantic salmon genome provides insights into rediploidization , 2016, Nature.

[32]  Yvonne Feierabend,et al.  Population Genetics A Concise Guide , 2016 .

[33]  F. Volckaert Applying the hidden treasures of population biodiversity to decision making and management , 2015 .

[34]  P. Herzyk,et al.  Double‐digest RAD sequencing using Ion Proton semiconductor platform (ddRADseq‐ion) with nonmodel organisms , 2015, Molecular ecology resources.

[35]  Liliana Florea,et al.  Rcorrector: efficient and accurate error correction for Illumina RNA-seq reads , 2015, GigaScience.

[36]  Evgeny M. Zdobnov,et al.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs , 2015, Bioinform..

[37]  C. Dichmont,et al.  Ocean's eleven: a critical evaluation of the role of population, evolutionary and molecular genetics in the management of wild fisheries , 2015 .

[38]  C. Schlötterer,et al.  Sequencing pools of individuals — mining genome-wide polymorphism data without big funding , 2014, Nature Reviews Genetics.

[39]  D. Chalopin,et al.  The rainbow trout genome provides novel insights into evolution after whole-genome duplication in vertebrates , 2014, Nature Communications.

[40]  A. Snoj,et al.  Molecular phylogeny of Salmo of the western Balkans, based upon multiple nuclear loci , 2014, Genetics Selection Evolution.

[41]  D. Bekkevold,et al.  Local Adaptation at the Transcriptome Level in Brown Trout: Evidence from Early Life History Temperature Genomic Reaction Norms , 2014, PloS one.

[42]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[43]  Arndt von Haeseler,et al.  NextGenMap: fast and accurate read mapping in highly polymorphic genomes , 2013, Bioinform..

[44]  S. Tishkoff,et al.  SNP ascertainment bias in population genetic analyses: Why it is important, and how to correct it , 2013, BioEssays : news and reviews in molecular, cellular and developmental biology.

[45]  L. Bernatchez,et al.  THE GENETIC ARCHITECTURE OF REPRODUCTIVE ISOLATION DURING SPECIATION‐WITH‐GENE‐FLOW IN LAKE WHITEFISH SPECIES PAIRS ASSESSED BY RAD SEQUENCING , 2013, Evolution; international journal of organic evolution.

[46]  U. Schliewen,et al.  Genetic variation in brown trout Salmo trutta across the Danube, Rhine, and Elbe headwaters: a failure of the phylogeographic paradigm? , 2013, BMC Evolutionary Biology.

[47]  A. Palmé,et al.  Monitoring reveals two genetically distinct brown trout populations remaining in stable sympatry over 20 years in tiny mountain lakes , 2013, Conservation Genetics.

[48]  C. Wedekind,et al.  Testing for local adaptation in brown trout using reciprocal transplants , 2012, BMC Evolutionary Biology.

[49]  Joaquín Dopazo,et al.  Qualimap: evaluating next-generation sequencing alignment data , 2012, Bioinform..

[50]  S. Lien,et al.  Chromosomal differences between European and North American Atlantic salmon discovered by linkage mapping and supported by fluorescence in situ hybridization analysis , 2012, BMC Genomics.

[51]  Dmitri A. Petrov,et al.  Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster , 2012, PloS one.

[52]  J. Charlier,et al.  Genetic monitoring reveals temporal stability over 30 years in a small, lake-resident brown trout population , 2012, Heredity.

[53]  Robert Kofler,et al.  PoPoolation2: identifying differentiation between populations using sequencing of pooled DNA samples (Pool-Seq) , 2011, Bioinform..

[54]  M. Blaxter,et al.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing , 2011, Nature Reviews Genetics.

[55]  N. Friedman,et al.  Trinity : reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2016 .

[56]  D. Bekkevold,et al.  An assessment of the spatial scale of local adaptation in brown trout (Salmo trutta L.): footprints of selection at microsatellite DNA loci , 2011, Heredity.

[57]  A. Futschik,et al.  PoPoolation: A Toolbox for Population Genetic Analysis of Next Generation Sequencing Data from Pooled Individuals , 2011, PloS one.

[58]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer , 2011, Nature Biotechnology.

[59]  R. Nielsen,et al.  Ascertainment biases in SNP chips affect measures of population divergence. , 2010, Molecular biology and evolution.

[60]  Steven J. M. Jones,et al.  Sequencing the genome of the Atlantic salmon (Salmo salar) , 2010, Genome Biology.

[61]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[62]  F. Allendorf,et al.  In the Light of Evolution III: Two Centuries of Darwin Sackler Colloquium: Human-induced evolution caused by unnatural selection through harvest of wild animals , 2009 .

[63]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[64]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[65]  A. Punt,et al.  Integrating genetic data into management of marine resources: how can we do it better? , 2008 .

[66]  Osamu Gotoh,et al.  Direct mapping and alignment of protein sequences onto genomic sequence , 2008, Bioinform..

[67]  F. Rousset genepop’007: a complete re‐implementation of the genepop software for Windows and Linux , 2008, Molecular ecology resources.

[68]  Peter B. McGarvey,et al.  UniRef: comprehensive and non-redundant UniProt reference clusters , 2007, Bioinform..

[69]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. , 2003, Genetics.

[70]  M. M. Hansen Estimating the long‐term effects of stocking domesticated trout into wild brown trout (Salmo trutta) populations: an approach using microsatellite DNA analysis of historical and contemporary samples , 2002, Molecular ecology.

[71]  P. Ráb,et al.  Chromosome evolution in the Salmonidae (Pisces): an update , 2001, Biological reviews of the Cambridge Philosophical Society.

[72]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[73]  E. Nielsen,et al.  Microsatellite and mitochondrial DNA polymorphism reveals life‐history dependent interbreeding between hatchery and wild brown trout (Salmo trutta L.) , 2000, Molecular ecology.

[74]  N. Ryman,et al.  Genetic basis of phenotypic differences between transplanted stocks of brown trout , 1999 .

[75]  N. Ryman,et al.  TEMPORAL CHANGE OF MITOCHONDRIAL DNA HAPLOTYPE FREQUENCIES AND FEMALE EFFECTIVE SIZE IN A BROWN TROUT (SALMO TRUTTA) POPULATION , 1998, Evolution; international journal of organic evolution.

[76]  N. Ryman,et al.  Demographic genetics of brown trout (Salmo trutta) and estimation of effective population size from temporal change of allele frequencies. , 1996, Genetics.

[77]  K. Hindar,et al.  Genetic Effects of Cultured Fish on Natural Fish Populations , 1991 .

[78]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[79]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[80]  F. Tajima Evolutionary relationship of DNA sequences in finite populations. , 1983, Genetics.

[81]  N. Ryman,et al.  Simple mendelian inheritance at a locus coding for alpha-glycerophosphate dehydrogenase in brown trout (Salmo trutta). , 2008, Hereditas.

[82]  D. Simberloff,et al.  Conservation Biology: An Evolutionary-Ecological Perspective , 1980 .

[83]  F. Allendorf,et al.  Reproductive isolation with little genetic divergence in sympatric populations of brown trout (Salmo trutta). , 1979, Genetics.

[84]  M. Nei Analysis of gene diversity in subdivided populations. , 1973, Proceedings of the National Academy of Sciences of the United States of America.