SNP discovery in black cottonwood (Populus trichocarpa) by population transcriptome resequencing

The western black cottonwood (Populus trichocarpa) was the first tree to have its genome fully sequenced and has emerged as the model species for the study of secondary growth and wood formation. It is also a good candidate species for the production of lignocellulosic biofuels. Here, we present and make available to the research community the results of the sequencing of the transcriptome of developing xylem in 20 accessions with high‐throughput next generation sequencing technology. We found over 0.5 million putative single nucleotide polymorphisms (SNPs) in 26 595 genes that are expressed in developing secondary xylem. More than two‐thirds of all SNPs were found in annotated exons, with 18% and 14% in regions of the genome annotated as introns and intergenic, respectively, where only 3% and 4% of sequence reads mapped. This suggests that the current annotation of the poplar genome is remarkably incomplete and that there are many transcripts and novel genes waiting to be annotated. We hope that this resource will stimulate further research in expression profiling, detection of alternative splicing and adaptive evolution in poplar.

[1]  M. Nowrousian Next-Generation Sequencing Techniques for Eukaryotic Microorganisms: Sequencing-Based Solutions to Biological Problems , 2010, Eukaryotic Cell.

[2]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[3]  G. Ast,et al.  Alternative splicing and evolution: diversification, exon definition and function , 2010, Nature Reviews Genetics.

[4]  P. Tiffin,et al.  Nucleotide diversity and linkage disequilibrium in balsam poplar (Populus balsamifera). , 2010, The New phytologist.

[5]  S. Keller,et al.  Genomic diversity, population structure, and migration following rapid range expansion in the Balsam Poplar, Populus balsamifera , 2010, Molecular ecology.

[6]  G. Malerba,et al.  Characterization of Transcriptional Complexity during Berry Development in Vitis vinifera Using RNA-Seq1[W] , 2010, Plant Physiology.

[7]  Henry D. Priest,et al.  Genome-wide mapping of alternative splicing in Arabidopsis thaliana. , 2010, Genome research.

[8]  M. Marra,et al.  Applications of new sequencing technologies for transcriptome analysis. , 2009, Annual review of genomics and human genetics.

[9]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[10]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[11]  M. Olson,et al.  Nucleotide diversity among natural populations of a North American poplar (Populus balsamifera, Salicaceae). , 2009, The New phytologist.

[12]  C. Ying,et al.  Ecotypic mode of regional differentiation caused by restricted gene migration: a case in black cottonwood (Populus trichocarpa) along the Pacific Northwest coast , 2009 .

[13]  Chris Somerville,et al.  Cellulosic biofuels. , 2009, Annual review of plant biology.

[14]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[15]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[16]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[17]  Pär K Ingvarsson,et al.  Multilocus Patterns of Nucleotide Polymorphism and the Demographic History of Populus tremula , 2008, Genetics.

[18]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[19]  M. Gerstein,et al.  The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing , 2008, Science.

[20]  S. Ranade,et al.  Stem cell transcriptome profiling via massive-scale mRNA sequencing , 2008, Nature Methods.

[21]  C. Douglas,et al.  Populus: a model system for plant biology. , 2007, Annual review of plant biology.

[22]  T. Pyhäjärvi,et al.  Genomic diversity in forest trees. , 2007, Current opinion in plant biology.

[23]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[24]  B. Potts,et al.  A framework for community and ecosystem genetics: from genes to ecosystems , 2006, Nature Reviews Genetics.

[25]  S. Otto,et al.  Use of Ecotilling as an efficient SNP discovery tool to survey genetic variation in wild populations of Populus trichocarpa , 2006, Molecular ecology.

[26]  A. Groover What genes make a tree a tree? , 2005, Trends in plant science.

[27]  Q. Cronk Plant eco-devo: the potential of poplar as a model organism. , 2005, The New phytologist.

[28]  S Rozen,et al.  Primer3 on the WWW for general users and for biologist programmers. , 2000, Methods in molecular biology.

[29]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[30]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[31]  D. Nickerson,et al.  PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing. , 1997, Nucleic acids research.

[32]  J. Doyle,et al.  A rapid DNA isolation procedure for small amounts of fresh leaf tissue , 1987 .

[33]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[34]  S. Wright,et al.  Genetical Structure of Populations , 1950, Nature.