Massively parallel sequencing of the polyadenylated transcriptome of C. elegans.

Using massively parallel sequencing by synthesis methods, we have surveyed the polyA+ transcripts from four stages of the nematode Caenorhabditis elegans to an unprecedented depth. Using novel statistical approaches, we evaluated the coverage of annotated features of the genome and of candidate processed transcripts, including splice junctions, trans-spliced leader sequences, and polyadenylation tracts. The data provide experimental support for >85% of the annotated protein-coding transcripts in WormBase (WS170) and confirm additional details of processing. For example, the total number of confirmed splice junctions was raised from 70,911 to over 98,000. The data also suggest thousands of modifications to WormBase annotations and identify new spliced junctions and genes not part of any WormBase annotation, including at least 80 putative genes not found in any of three predicted gene sets. The quantitative nature of the data also suggests that mRNA levels may be measured by this approach with unparalleled precision. Although most sequences align with protein-coding genes, a small fraction falls in introns and intergenic regions. One notable region on the X chromosome encodes a noncoding transcript of >10 kb localized to somatic nuclei.

[1]  Ryan D. Morin,et al.  Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing. , 2008, BioTechniques.

[2]  Chaochun Wei,et al.  Closing in on the C. elegans ORFeome by cloning TWINSCAN predictions. , 2005, Genome research.

[3]  V. Reinke,et al.  A global profile of germline gene expression in C. elegans. , 2000, Molecular cell.

[4]  D. Slonim,et al.  Composition and dynamics of the Caenorhabditis elegans early embryonic transcriptome , 2003, Development.

[5]  Oliver Hobert,et al.  A transcriptional regulatory cascade that controls left/right asymmetry in chemosensory neurons of C. elegans. , 2003, Genes & development.

[6]  S. Ranade,et al.  Stem cell transcriptome profiling via massive-scale mRNA sequencing , 2008, Nature Methods.

[7]  H. Hardy Letter: Correction on the number of presumed beryllium-induced osteosarcomas in human beings. , 1976, The New England journal of medicine.

[8]  Donna K Slonim,et al.  Synthetic lethal analysis of Caenorhabditis elegans posterior embryonic patterning genes identifies conserved genetic interactions , 2005, Genome Biology.

[9]  K. Kinzler,et al.  Serial Analysis of Gene Expression , 1995, Science.

[10]  M. Rosbash,et al.  Number and distribution of polyadenylated RNA sequences in yeast , 1977, Cell.

[11]  D. Riddle,et al.  Control of C. elegans Larval Development by Neuronal Expression of a TGF-β Homolog , 1996, Science.

[12]  N. Halloran,et al.  A survey of expressed genes in Caenorhabditis elegans , 1992, Nature Genetics.

[13]  M. Gerstein,et al.  The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing , 2008, Science.

[14]  B. Coupar,et al.  Quantification of hepatic transcribing RNA polymerase molecules, polyribonucleotide elongation rates and messenger RNA complexity in fed and fasted rats. , 1978, European journal of biochemistry.

[15]  G C Johnston,et al.  Regulation of cell size in the yeast Saccharomyces cerevisiae , 1979, Journal of bacteriology.

[16]  Y. Kohara,et al.  [Large scale analysis of C. elegans cDNA]. , 1996, Tanpakushitsu kakusan koso. Protein, nucleic acid, enzyme.

[17]  J. Hudson,et al.  C. elegans ORFeome version 1.1: experimental verification of the genome annotation and resource for proteome-scale protein expression , 2003, Nature Genetics.

[18]  I. Goodhead,et al.  Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution , 2008, Nature.

[19]  T. Moore,et al.  Open-reading-frame sequence tags (OSTs) support the existence of at least 17,300 genes in C. elegans , 2001, Nature Genetics.

[20]  V. Reinke,et al.  Genome-wide analysis of developmental and sex-regulated gene expression profiles in Caenorhabditis elegans. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Michael J MacCoss,et al.  Use of shotgun proteomics for the identification, confirmation, and correction of C. elegans gene annotations. , 2008, Genome research.

[22]  Wei Zhou,et al.  Mapping the C. elegans noncoding transcriptome with a whole-genome tiling microarray. , 2007, Genome research.

[23]  Ian Korf,et al.  Integrating genomic homology into gene structure prediction , 2001, ISMB.

[24]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[25]  Joshua M. Stuart,et al.  A Gene Expression Map for Caenorhabditis elegans , 2001, Science.

[26]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.