Organization of the Caenorhabditis elegans small non-coding transcriptome: genomic features, biogenesis, and expression.

Recent evidence points to considerable transcription occurring in non-protein-coding regions of eukaryote genomes. However, their lack of conservation and demonstrated function have created controversy over whether these transcripts are functional. Applying a novel cloning strategy, we have cloned 100 novel and 61 known or predicted Caenorhabditis elegans full-length ncRNAs. Studying the genomic environment and transcriptional characteristics have shown that two-thirds of all ncRNAs, including many intronic snoRNAs, are independently transcribed under the control of ncRNA-specific upstream promoter elements. Furthermore, the transcription levels of at least 60% of the ncRNAs vary with developmental stages. We identified two new classes of ncRNAs, stem-bulge RNAs (sbRNAs) and snRNA-like RNAs (snlRNAs), both featuring distinct internal motifs, secondary structures, upstream elements, and high and developmentally variable expression. Most of the novel ncRNAs are conserved in Caenorhabditis briggsae, but only one homolog was found outside the nematodes. Preliminary estimates indicate that the C. elegans transcriptome contains approximately 2700 small non-coding RNAs, potentially acting as regulatory elements in nematode development.

[1]  R. Cortese,et al.  Promoter of a eukaryotic tRNAPro gene is composed of three noncontiguous regions. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[2]  C. Guthrie,et al.  A subset of yeast snRNA's contains functional binding sites for the highly conserved Sm antigen. , 1987, Science.

[3]  J. Thomas,et al.  The spliceosomal snRNAs of Caenorhabditis elegans. , 1990, Nucleic acids research.

[4]  R. Singh,et al.  Characterization of U6 small nuclear RNA cap-specific antibodies. Identification of gamma-monomethyl-GTP cap structure in 7SK and several other human small RNAs. , 1990, The Journal of biological chemistry.

[5]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[6]  Charles Elkan,et al.  The Value of Prior Knowledge in Discovering Motifs with MEME , 1995, ISMB.

[7]  William Noble Grundy,et al.  Meta-MEME: motif-based hidden Markov models of protein families , 1997, Comput. Appl. Biosci..

[8]  S. Eddy,et al.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. , 1997, Nucleic acids research.

[9]  Temple F. Smith,et al.  Comparison of the complete protein sets of worm and yeast: orthology and divergence. , 1998, Science.

[10]  Kathleen R. Noon,et al.  Posttranscriptional Modifications in 16 S and 23 S rRNAs of the Archaeal Hyperthermophile Sulfolobus solfataricus , 1998 .

[11]  Phillip D. Zamore,et al.  RNA Interference , 2000, Science.

[12]  A. Hüttenhofer,et al.  Identification of brain-specific and imprinted small nucleolar RNA genes exhibiting an unusual genomic organization. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[13]  N. Hernandez,et al.  Small Nuclear RNA Genes: a Model System to Study Fundamental Mechanisms of Transcription* , 2001, The Journal of Biological Chemistry.

[14]  T. Tuschl,et al.  RNA interference is mediated by 21- and 22-nucleotide RNAs. , 2001, Genes & development.

[15]  A. Hüttenhofer,et al.  RNomics: an experimental approach that identifies 201 candidates for novel, small, non‐messenger RNAs in mouse , 2001, The EMBO journal.

[16]  S. Eddy Non–coding RNA genes and the modern RNA world , 2001, Nature Reviews Genetics.

[17]  A. Hüttenhofer,et al.  Identification of 86 candidates for small non-messenger RNAs from the archaeon Archaeoglobus fulgidus , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Tatsuo Tanaka,et al.  Location of 2(')-O-methyl nucleotides in 26S rRNA and methylation guide snoRNAs in Caenorhabditis elegans. , 2002, Biochemical and biophysical research communications.

[19]  Jürgen Brosius,et al.  Experimental RNomics Identification of 140 Candidates for Small Non-Messenger RNAs in the Plant Arabidopsis thaliana , 2002, Current Biology.

[20]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[21]  Juancarlos Chan,et al.  WormBase: a cross-species database for comparative genomics , 2003, Nucleic Acids Res..

[22]  Manuel Echeverria,et al.  Plant snoRNAs: functional evolution and new modes of gene expression. , 2003, Trends in plant science.

[23]  Stuart K. Kim,et al.  Global analysis of dauer gene expression in Caenorhabditis elegans , 2003, Development.

[24]  A. Marchfelder,et al.  Plant dicistronic tRNA–snoRNA genes: a new mode of expression of the small nucleolar RNAs processed by RNase Z , 2003, The EMBO journal.

[25]  J. Vogel,et al.  RNomics in Escherichia coli detects new sRNA species and indicates parallel transcriptional output in bacteria. , 2003, Nucleic acids research.

[26]  J. Mattick Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms. , 2003, BioEssays : news and reviews in molecular, cellular and developmental biology.

[27]  Joseph M. Dale,et al.  Empirical Analysis of Transcriptional Activity in the Arabidopsis Genome , 2003, Science.

[28]  Matthew Purdy,et al.  Cloning and characterization of the Drosophila U7 small nuclear RNA , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Jürgen Brosius,et al.  RNomics in Drosophila melanogaster: identification of 66 candidates for novel non-messenger RNAs , 2003, Nucleic acids research.

[30]  Zhihua Zhang,et al.  Conservation analysis of small RNA genes in Escherichia coli , 2004, Bioinform..

[31]  Ram Samudrala,et al.  Mouse transcriptome: Neutral evolution of ‘non-coding’ complementary DNAs , 2004, Nature.

[32]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[33]  S. Cawley,et al.  Unbiased Mapping of Transcription Factor Binding Sites along Human Chromosomes 21 and 22 Points to Widespread Regulation of Noncoding RNAs , 2004, Cell.

[34]  I. Bozzoni,et al.  TOP promoter elements control the relative ratio of intron-encoded snoRNA versus spliced mRNA biosynthesis. , 2004, Journal of molecular biology.

[35]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[36]  C. Burge,et al.  Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification. , 2004, RNA.

[37]  J. Steitz,et al.  Guide RNAs with 5′ Caps and Novel Box C/D snoRNA-like Domains for Modification of snRNAs in Metazoa , 2004, Current Biology.

[38]  J. Mattick RNA regulation: a new genetics? , 2004, Nature Reviews Genetics.

[39]  Cgtcctaccgcagtatctttcgaacacta Tgaaatgaccc,et al.  Is a Ubiquitous snoRNA with Two Conserved Sequence Motifs Essential for 18 S rRNA Production , 2004 .

[40]  S. Cawley,et al.  Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22. , 2004, Genome research.

[41]  Pontus Larsson,et al.  Novel non-coding RNAs in Dictyostelium discoideum and their expression during development. , 2004, Nucleic Acids Research.

[42]  J. Lukeš,et al.  SmD1 Is Required for Spliced Leader RNA Biogenesis , 2004, Eukaryotic Cell.

[43]  A. Muto,et al.  Isolation of eight novel Caenorhabditis elegans small RNAs. , 2004, Gene.

[44]  T. Kiss,et al.  U17/snR30 Is a Ubiquitous snoRNA with Two Conserved Sequence Motifs Essential for 18S rRNA Production , 2004, Molecular and Cellular Biology.

[45]  A. Hüttenhofer,et al.  Non-coding RNAs: hope or hype? , 2005, Trends in genetics : TIG.

[46]  Boris Lenhard,et al.  RNAdb—a comprehensive mammalian noncoding RNA database , 2004, Nucleic Acids Res..

[47]  Yi Zhao,et al.  NONCODE: an integrated knowledge database of non-coding RNAs , 2004, Nucleic Acids Res..

[48]  Klaudia Walter,et al.  Open access, freely available online PLoS BIOLOGY Highly Conserved Non-Coding Sequences Are Associated with Vertebrate Development , 2022 .

[49]  G. Phillips,et al.  Identification of transcribed sequences in Arabidopsis thaliana by using high-resolution genome tiling arrays. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Sean R Eddy,et al.  C. elegans noncoding RNA genes. , 2005, WormBook : the online review of C. elegans biology.

[51]  G. Helt,et al.  Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution , 2005, Science.

[52]  T. Rognes,et al.  Predicting non-coding RNA genes in Escherichia coli with boosted genetic programming , 2005, Nucleic acids research.