De Novo Transcriptome Sequence Assembly and Analysis of RNA Silencing Genes of Nicotiana benthamiana

Background Nicotiana benthamiana has been widely used for transient gene expression assays and as a model plant in the study of plant-microbe interactions, lipid engineering and RNA silencing pathways. Assembling the sequence of its transcriptome provides information that, in conjunction with the genome sequence, will facilitate gaining insight into the plant’s capacity for high-level transient transgene expression, generation of mobile gene silencing signals, and hyper-susceptibility to viral infection. Methodology/Results RNA-seq libraries from 9 different tissues were deep sequenced and assembled, de novo, into a representation of the transcriptome. The assembly, of16GB of sequence, yielded 237,340 contigs, clustering into 119,014 transcripts (unigenes). Between 80 and 85% of reads from all tissues could be mapped back to the full transcriptome. Approximately 63% of the unigenes exhibited a match to the Solgenomics tomato predicted proteins database. Approximately 94% of the Solgenomics N. benthamiana unigene set (16,024 sequences) matched our unigene set (119,014 sequences). Using homology searches we identified 31 homologues that are involved in RNAi-associated pathways in Arabidopsis thaliana, and show that they possess the domains characteristic of these proteins. Of these genes, the RNA dependent RNA polymerase gene, Rdr1, is transcribed but has a 72 nt insertion in exon1 that would cause premature termination of translation. Dicer-like 3 (DCL3) appears to lack both the DEAD helicase motif and second dsRNA binding motif, and DCL2 and AGO4b have unexpectedly high levels of transcription. Conclusions The assembled and annotated representation of the transcriptome and list of RNAi-associated sequences are accessible at www.benthgenome.com alongside a draft genome assembly. These genomic resources will be very useful for further study of the developmental, metabolic and defense pathways of N. benthamiana and in understanding the mechanisms behind the features which have made it such a well-used model plant.

[1]  Jinsong Bao,et al.  Hierarchical Action and Inhibition of Plant Dicer-Like Proteins in Antiviral Defense , 2006, Science.

[2]  Yamile Marquez,et al.  Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis , 2012, Genome research.

[3]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[4]  H. Vaucheret,et al.  Plant ARGONAUTES. , 2008, Trends in plant science.

[5]  M. Wydro,et al.  Optimization of transient Agrobacterium-mediated gene expression system in leaves of Nicotiana benthamiana. , 2006, Acta biochimica Polonica.

[6]  Keith Bradnam,et al.  CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes , 2007, Bioinform..

[7]  G. Bryan,et al.  The Transcriptome of the Reference Potato Genome Solanum tuberosum Group Phureja Clone DM1-3 516R44 , 2011, PloS one.

[8]  Tatiana A. Tatusova,et al.  NCBI Reference Sequences: current status, policy and new initiatives , 2008, Nucleic Acids Res..

[9]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[10]  P. Waterhouse,et al.  RNA Silencing in Plants: Yesterday, Today, and Tomorrow , 2008, Plant Physiology.

[11]  Jia Liu,et al.  Comparative analyses of six solanaceous transcriptomes reveal a high degree of sequence conservation and species-specific transcripts , 2005, BMC Genomics.

[12]  A. Green,et al.  A leaf-based assay using interchangeable design principles to rapidly assemble multistep recombinant pathways. , 2009, Plant biotechnology journal.

[13]  Inanç Birol,et al.  De novo transcriptome assembly with ABySS , 2009, Bioinform..

[14]  Rogerio Margis,et al.  The evolution and diversification of Dicers in plants , 2006, FEBS letters.

[15]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[16]  Michael Brudno,et al.  SHRiMP: Accurate Mapping of Short Color-space Reads , 2009, PLoS Comput. Biol..

[17]  M. Ni,et al.  SHORT HYPOCOTYL UNDER BLUE1 Truncations and Mutations Alter Its Association with a Signaling Protein Complex in Arabidopsis[W] , 2010, Plant Cell.

[18]  R. Martienssen,et al.  Control of female gamete formation by a small RNA pathway in Arabidopsis , 2010, Nature.

[19]  Aureliano Bombarely,et al.  A draft genome sequence of Nicotiana benthamiana to enhance molecular plant-microbe biology research. , 2012, Molecular plant-microbe interactions : MPMI.

[20]  C. Hawes,et al.  Rapid, transient expression of fluorescent fusion proteins in tobacco plants and generation of stably transformed plants , 2006, Nature Protocols.

[21]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[22]  R. Naidu,et al.  Nicotiana benthamiana: its history and future as a model for plant-pathogen interactions. , 2008, Molecular plant-microbe interactions : MPMI.

[23]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[24]  R. Hellens,et al.  Advanced Engineering of Lipid Metabolism in Nicotiana benthamiana Using a Draft Genome and the V2 Viral Silencing-Suppressor Protein , 2012, PloS one.

[25]  P. Waterhouse,et al.  RNA interference‐inducing hairpin RNAs in plants act through the viral defence pathway , 2006, EMBO reports.

[26]  S. Ding,et al.  Viral suppressors of RNA silencing. , 2001, Current opinion in biotechnology.

[27]  Daniel W. A. Buchan,et al.  The tomato genome sequence provides insights into fleshy fruit evolution , 2012, Nature.

[28]  A. Weber,et al.  RNA-Seq Assembly – Are We There Yet? , 2012, Front. Plant Sci..

[29]  Richard S Nelson,et al.  A natural variant of a host RNA-dependent RNA polymerase is associated with increased susceptibility to viruses by Nicotiana benthamiana. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Jialei Duan,et al.  Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data , 2012, BMC Genomics.

[31]  R. Fang,et al.  RNA-Dependent RNA Polymerase 1 from Nicotiana tabacum Suppresses RNA Silencing and Enhances Viral Infection in Nicotiana benthamiana[W] , 2010, Plant Cell.

[32]  Jun Kong,et al.  Transcriptome Analysis of Nicotiana tabacum Infected by Cucumber mosaic virus during Systemic Symptom Development , 2012, PloS one.

[33]  The UniProt Consortium,et al.  Reorganizing the protein space at the Universal Protein Resource (UniProt) , 2011, Nucleic Acids Res..

[34]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[35]  M. Yandell,et al.  Characterization of the Conus bullatus genome and its venom-duct transcriptome , 2011, BMC Genomics.

[36]  Tom E Clemente Nicotiana (Nicotiana tobaccum, Nicotiana benthamiana). , 2006, Methods in molecular biology.

[37]  Sandrine Dudoit,et al.  Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments , 2010, BMC Bioinformatics.

[38]  Steven J. M. Jones,et al.  De novo assembly and analysis of RNA-seq data , 2010, Nature Methods.

[39]  P. Waterhouse,et al.  DRB2, DRB3 and DRB5 function in a non-canonical microRNA pathway in Arabidopsis thaliana , 2012, Plant signaling & behavior.

[40]  K. Perry,et al.  ARGONAUTE2 Mediates RNA-Silencing Antiviral Defenses against Potato virus X in Arabidopsis1[W][OA] , 2011, Plant Physiology.

[41]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[42]  Thomas J. Hardcastle,et al.  The Arabidopsis RNA-Directed DNA Methylation Argonautes Functionally Diverge Based on Their Expression and Interaction with Target Loci[W][OA] , 2010, Plant Cell.

[43]  P. Waterhouse,et al.  DRB2 Is Required for MicroRNA Biogenesis in Arabidopsis thaliana , 2012, PloS one.

[44]  Yi Zhang,et al.  Comparison of the transcriptomes of American chestnut (Castanea dentata) and Chinese chestnut (Castanea mollissima) in response to the chestnut blight infection , 2009, BMC Plant Biology.

[45]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[46]  Nan Li,et al.  Comparison of the two major classes of assembly algorithms: overlap-layout-consensus and de-bruijn-graph. , 2012, Briefings in functional genomics.

[47]  J. Cairney,et al.  A simple and efficient method for isolating RNA from pine trees , 1993, Plant Molecular Biology Reporter.

[48]  T. Csorba,et al.  RNA silencing: an antiviral mechanism. , 2009, Advances in virus research.

[49]  Cutoffs and k-mers: implications from a transcriptome study in allopolyploid plants , 2012, BMC Genomics.

[50]  Hongliang Zhu,et al.  Arabidopsis Argonaute10 Specifically Sequesters miR166/165 to Regulate Shoot Apical Meristem Development , 2011, Cell.

[51]  Aureliano Bombarely,et al.  Deciphering the complex leaf transcriptome of the allotetraploid species Nicotiana tabacum: a phylogenomic perspective , 2012, BMC Genomics.

[52]  M. Matzke,et al.  RNA-mediated chromatin-based silencing in plants. , 2009, Current opinion in cell biology.

[53]  Miao Bai,et al.  Genome-wide identification of Dicer-like, Argonaute and RNA-dependent RNA polymerase gene families and their expression analyses in response to viral infection and abiotic stresses in Solanum lycopersicum. , 2012, Gene.

[54]  P. Ladiges,et al.  Comparative morphology and phylogeny of Nicotiana section Suaveolentes (Solanaceae) in Australia and the South Pacific , 2011 .

[55]  K. Kalantidis,et al.  DICER-LIKE 4 but not DICER-LIKE 2 may have a positive effect on potato spindle tuber viroid accumulation in Nicotiana benthamiana. , 2013, Molecular Plant.

[56]  Julie A. Law,et al.  Establishing, maintaining and modifying DNA methylation patterns in plants and animals , 2010, Nature Reviews Genetics.

[57]  Keith Bradnam,et al.  Assessing the gene space in draft genomes , 2008, Nucleic acids research.

[58]  Luigi Faino,et al.  The transcriptome of Verticillium dahliae-infected Nicotiana benthamiana determined by deep RNA sequencing , 2012, Plant signaling & behavior.

[59]  Matthew D. Young,et al.  From RNA-seq reads to differential expression results , 2010, Genome Biology.

[60]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[61]  Peter B. McGarvey,et al.  UniRef: comprehensive and non-redundant UniProt reference clusters , 2007, Bioinform..

[62]  Xuan Li,et al.  Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study , 2011, BMC Bioinformatics.

[63]  D. Baulcombe,et al.  Retracted: Viral pathogenicity determinants are suppressors of transgene silencing in Nicotiana benthamiana , 1998, The EMBO journal.

[64]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[65]  I. Dasgupta,et al.  Virus-induced gene silencing: a versatile tool for discovery of gene functions in plants. , 2009, Plant physiology and biochemistry : PPB.

[66]  L. Reid,et al.  Proposed methods for testing and selecting the ERCC external RNA controls , 2005, BMC Genomics.

[67]  Rolf Apweiler,et al.  InterPro and InterProScan , 2007 .

[68]  A. Eamens,et al.  Virus-Induced Gene Silencing of Argonaute Genes in Nicotiana benthamiana Demonstrates That Extensive Systemic Silencing Requires Argonaute1-Like and Argonaute4-Like Genes1 , 2006, Plant Physiology.

[69]  J. Silberg,et al.  A transposase strategy for creating libraries of circularly permuted proteins , 2012, Nucleic acids research.

[70]  Tao Jiang,et al.  Workshop: Transcriptome assembly and isoform expression level estimation from biased RNA-Seq reads , 2012, 2012 IEEE 2nd International Conference on Computational Advances in Bio and medical Sciences (ICCABS).

[71]  K. Hansen,et al.  Biases in Illumina transcriptome sequencing caused by random hexamer priming , 2010, Nucleic acids research.