De novo SNP discovery and genetic linkage mapping in poplar using restriction site associated DNA and whole-genome sequencing technologies

BackgroundRestriction site associated DNA sequencing (RAD-seq), a next-generation sequencing technology, has greatly facilitated genetic linkage mapping studies in outbred species. RAD-seq is capable of discovering thousands of genetic markers for linkage mapping across many individuals, and can be applied in species with or without a reference genome. Although several analytical tools are available for RAD-seq data, alternative strategies are necessary for improving the marker quality and hence the genetic mapping accuracy.ResultsWe demonstrate a strategy for constructing dense genetic linkage maps in hybrid forest trees by combining RAD-seq and whole-genome sequencing technologies. We performed RAD-seq of 150 progeny and whole-genome sequencing of the two parents in an F1 hybrid population of Populus deltoides × P. simonii. Two rough references were assembled from the whole-genome sequencing reads of the two parents separately. Based on the parental reference sequences, 3442 high-quality single nucleotide polymorphisms (SNPs) were identified that segregate in the ratio of 1:1. The maternal linkage map of P. deltoides was constructed with 2012 SNPs, containing 19 linkage groups and spanning 4067.16 cM of the genome with an average distance of 2.04 cM between adjacent markers, while the male map of P. simonii consisted of 1430 SNPs and the same number of linkage groups with a total length of 4356.04 cM and an average interval distance of 3.09 cM. Collinearity between the parental linkage maps and the reference genome of P. trichocarpa was also investigated. Compared with the result on the basis of the existing reference genome, our strategy identified more high-quality SNPs and generated parental linkage groups that nicely match the karyotype of Populus.ConclusionsThe strategy of simultaneously using RAD and whole-genome sequencing technologies can be applied to constructing high-density genetic maps in forest trees regardless of whether a reference genome exists. The two parental linkage maps constructed here provide more accurate genetic resources for unraveling quantitative trait loci and accelerating molecular breeding programs, as well as for comparative genomics in Populus.

[1]  R. Wu,et al.  Detection of quantitative trait loci influencing growth trajectories of adventitious roots in Populus using functional mapping , 2009, Tree Genetics & Genomes.

[2]  Tianzhen Zhang,et al.  Molecular Mapping of Restriction-Site Associated DNA Markers in Allotetraploid Upland Cotton , 2015, PloS one.

[3]  Dongyuan Liu,et al.  SLAF-seq: An Efficient Method of Large-Scale De Novo SNP Discovery and Genotyping Using High-Throughput Sequencing , 2013, PloS one.

[4]  C. Nusbaum,et al.  ALLPATHS: de novo assembly of whole-genome shotgun microreads. , 2008, Genome research.

[5]  Inanç Birol,et al.  Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data , 2013, Bioinform..

[6]  R. Sederoff,et al.  Genetic linkage maps of Eucalyptus grandis and Eucalyptus urophylla using a pseudo-testcross: mapping strategy and RAPD markers. , 1994, Genetics.

[7]  Katsutoshi Watanabe,et al.  A RAD-based linkage map and comparative genomics in the gudgeons (genus Gnathopogon, Cyprinidae) , 2013, BMC Genomics.

[8]  C T Falk,et al.  A simple scheme for preliminary ordering of multiple loci: application to 45 CF families. , 1989, Progress in clinical and biological research.

[9]  H. Xin,et al.  Construction of a high-density genetic map for grape using next generation restriction-site associated DNA sequencing , 2012, BMC Plant Biology.

[10]  Christophe Klopp,et al.  High-resolution genetic maps of Eucalyptus improve Eucalyptus grandis genome assembly. , 2015, The New phytologist.

[11]  Patrick M Hayes,et al.  Construction and application for QTL analysis of a Restriction Site Associated DNA (RAD) linkage map in barley , 2011, BMC Genomics.

[12]  Mark L. Blaxter,et al.  Linkage Mapping and Comparative Genomics Using Next-Generation RAD Sequencing of a Non-Model Organism , 2011, PloS one.

[13]  E. Pahlich,et al.  A rapid DNA isolation procedure for small quantities of fresh leaf tissue , 1980 .

[14]  J. Ooijen,et al.  JoinMap® 4, Software for the calculation of genetic linkage maps in experimental populations , 2006 .

[15]  T. Cezard,et al.  Special features of RAD Sequencing data: implications for genotyping , 2012, Molecular ecology.

[16]  Rongling Wu,et al.  Statistical Genetics of Quantitative Traits: Linkage, Maps and QTL , 2007 .

[17]  Mukesh Jain,et al.  NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data , 2012, PloS one.

[18]  Detlef Weigel,et al.  Paired-end RAD-seq for de novo assembly and marker design without available reference , 2011, Bioinform..

[19]  A. Amores,et al.  Stacks: Building and Genotyping Loci De Novo From Short-Read Sequences , 2011, G3: Genes | Genomes | Genetics.

[20]  Falk Ct,et al.  A simple scheme for preliminary ordering of multiple loci: application to 45 CF families. , 1989 .

[21]  Richard D. Hayes,et al.  The genome of Eucalyptus grandis , 2014, Nature.

[22]  J. D. Matthews Forest Genetics , 1951, Nature.

[23]  Deren A. R. Eaton,et al.  PyRAD: assembly of de novo RADseq loci for phylogenetic analyses , 2013, bioRxiv.

[24]  G. Tuskan,et al.  Genome structure and emerging evidence of an incipient sex chromosome in Populus. , 2008, Genome research.

[25]  James C. Schnable,et al.  ALLMAPS: robust scaffold ordering based on multiple maps , 2015, Genome Biology.

[26]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[27]  Janna L. Fierst,et al.  Using linkage maps to correct and scaffold de novo genome assemblies: methods, challenges, and computational tools , 2015, Front. Genet..

[28]  Jeffrey Ross-Ibarra,et al.  Genetic Data Analysis II. Methods for Discrete Population Genentic Data , 2002 .

[29]  D. Neale,et al.  Forest tree genomics: growing resources and applications , 2011, Nature Reviews Genetics.

[30]  A. Kilian,et al.  A reference linkage map for Eucalyptus , 2012, BMC Genomics.

[31]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[32]  Marco Marra,et al.  High-throughput BAC fingerprinting. , 2004, Methods in molecular biology.

[33]  Sebastian Kloska,et al.  A complete BAC-based physical map of the Arabidopsis thaliana genome , 1999, Nature Genetics.

[34]  Zechen Chong,et al.  Rainbow: an integrated tool for efficient clustering and assembling RAD-seq reads , 2012, Bioinform..

[35]  Robert W. Sykes,et al.  High-resolution genetic mapping of allelic variants associated with cell wall chemistry in Populus , 2015, BMC Genomics.

[36]  Riccardo Velasco,et al.  Fast and Cost-Effective Genetic Mapping in Apple Using Next-Generation Sequencing , 2014, G3: Genes, Genomes, Genetics.

[37]  P. Etter,et al.  Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers , 2008, PloS one.

[38]  Nicholas H. Putnam,et al.  A physical map of the highly heterozygous Populus genome: integration with the genome sequence and genetic map and analysis of haplotype variation. , 2007, The Plant journal : for cell and molecular biology.

[39]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[40]  J. Jansen,et al.  Linkage analysis in a full-sib family of an outbreeding plant species: overview and consequences for applications , 1997 .

[41]  Douglas G. Scofield,et al.  The Norway spruce genome sequence and conifer genome evolution , 2013, Nature.

[42]  Huanming Yang,et al.  Erratum: Genomic insights into salt adaptation in a desert poplar , 2013, Nature Communications.

[43]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[44]  S. Salzberg,et al.  Sequencing and Assembly of the 22-Gb Loblolly Pine Genome , 2014, Genetics.

[45]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[46]  Huanming Yang,et al.  De novo assembly of human genomes with massively parallel short read sequencing. , 2010, Genome research.

[47]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[48]  C. Tong,et al.  Construction of High-Density Linkage Maps of Populus deltoides × P. simonii Using Restriction-Site Associated DNA Sequencing , 2016, PloS one.

[49]  Chunfa Tong,et al.  A hidden Markov model approach to multilocus linkage analysis in a full-sib family , 2010, Tree Genetics & Genomes.

[50]  S. Berlin,et al.  High-density linkage mapping and evolution of paralogs and orthologs in Salix and Populus , 2010, BMC Genomics.

[51]  Roeland E. Voorrips,et al.  Software for the calculation of genetic linkage maps , 2001 .

[52]  Eric A. Johnson,et al.  Mapping with RAD (restriction-site associated DNA) markers to rapidly identify QTL for stem rust resistance in Lolium perenne , 2011, Theoretical and Applied Genetics.