Post-transcriptional processing generates a diversity of 5′-modified long and short RNAs

The transcriptomes of eukaryotic cells are incredibly complex. Individual non-coding RNAs dwarf the number of protein-coding genes, and include classes that are well understood as well as classes for which the nature, extent and functional roles are obscure. Deep sequencing of small RNAs (<200 nucleotides) from human HeLa and HepG2 cells revealed a remarkable breadth of species. These arose both from within annotated genes and from unannotated intergenic regions. Overall, small RNAs tended to align with CAGE (cap-analysis of gene expression) tags, which mark the 5′ ends of capped, long RNA transcripts. Many small RNAs, including the previously described promoter-associated small RNAs, appeared to possess cap structures. Members of an extensive class of both small RNAs and CAGE tags were distributed across internal exons of annotated protein coding and non-coding genes, sometimes crossing exon–exon junctions. Here we show that processing of mature mRNAs through an as yet unknown mechanism may generate complex populations of both long and short RNAs whose apparently capped 5′ ends coincide. Supplying synthetic promoter-associated small RNAs corresponding to the c-MYC transcriptional start site reduced MYC messenger RNA abundance. The studies presented here expand the catalogue of cellular small RNAs and demonstrate a biological impact for at least one class of non-canonical small RNAs.

[1]  R. Lührmann,et al.  A monoclonal antibody against 2,2,7-trimethylguanosine that reacts with intact, class U, small nuclear ribonucleoproteins as well as with 7-methylguanosine-capped RNAs. , 1987, European journal of biochemistry.

[2]  S. Berget Exon Recognition in Vertebrate Splicing (*) , 1995, The Journal of Biological Chemistry.

[3]  A. Hüttenhofer,et al.  RNomics: an experimental approach that identifies 201 candidates for novel, small, non‐messenger RNAs in mouse , 2001, The EMBO journal.

[4]  J. Mattick Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms. , 2003, BioEssays : news and reviews in molecular, cellular and developmental biology.

[5]  J. Kawai,et al.  Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[6]  K. Morris,et al.  Small Interfering RNA-Induced Transcriptional Gene Silencing in Human Cells , 2004, Science.

[7]  J. Mattick RNA regulation: a new genetics? , 2004, Nature Reviews Genetics.

[8]  J. Herman,et al.  Short double-stranded RNA induces transcriptional gene silencing in human cancer cells in the absence of DNA methylation , 2005, Nature Genetics.

[9]  S. Batalov,et al.  A Strategy for Probing the Function of Noncoding RNAs Finds a Repressor of NFAT , 2005, Science.

[10]  J. Borén,et al.  Apolipoprotein B: a clinically important apolipoprotein which assembles atherogenic lipoproteins and promotes the development of atherosclerosis , 2005, Journal of internal medicine.

[11]  Martin S. Taylor,et al.  Genome-wide analysis of mammalian promoter architecture and evolution , 2006, Nature Genetics.

[12]  Liang-Hu Qu,et al.  snoSeeker: an advanced computational package for screening of guide and orphan snoRNA genes in the human genome , 2006, Nucleic acids research.

[13]  Jun Kawai,et al.  CAGE Basic/Analysis Databases: the CAGE resource for comprehensive promoter analysis , 2005, Nucleic Acids Res..

[14]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[15]  P. Stadler,et al.  RNA Maps Reveal New RNA Classes and a Possible Function for Pervasive Transcription , 2007, Science.

[16]  D. Corey,et al.  Activating gene expression in mammalian cells with promoter-targeted duplex RNAs. , 2007, Nature chemical biology.

[17]  T. Gingeras,et al.  Genome-wide transcription and the implications for genomic organization , 2007, Nature Reviews Genetics.

[18]  Jan Komorowski,et al.  Whole-genome maps of USF1 and USF2 binding and histone H3 acetylation reveal new aspects of promoter structure and candidate genes for common human disorders. , 2008, Genome research.

[19]  E. Mardis The impact of next-generation sequencing technology on genetics. , 2008, Trends in genetics : TIG.

[20]  W. J. Kent,et al.  The UCSC Genome Browser , 2003, Current protocols in bioinformatics.