Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements

Understanding the relationships between regulatory factor binding, chromatin structure, cis-regulatory elements and RNA-regulation mechanisms relies on accurate information about transcription start sites (TSS) and polyadenylation sites (PAS). Although several approaches have identified transcript ends in yeast, limitations of resolution and coverage have remained, and definitive identification of TSS and PAS with single-nucleotide resolution has not yet been achieved. We developed SMORE-seq (simultaneous mapping of RNA ends by sequencing) and used it to simultaneously identify the strongest TSS for 5207 (90%) genes and PAS for 5277 (91%) genes. The new transcript annotations identified by SMORE-seq showed improved distance relationships with TATA-like regulatory elements, nucleosome positions and active RNA polymerase. We found 150 genes whose TSS were downstream of the annotated start codon, and additional analysis of evolutionary conservation and ribosome footprinting suggests that these protein-coding sequences are likely to be mis-annotated. SMORE-seq detected short non-coding RNAs transcribed divergently from more than a thousand promoters in wild-type cells under normal conditions. These divergent non-coding RNAs were less evident at promoters containing canonical TATA boxes, suggesting a model where transcription initiation at promoters by RNAPII is bidirectional, with TATA elements serving to constrain the directionality of initiation.

[1]  B. Pugh,et al.  Genome-wide Nucleosome Specificity and Directionality of Chromatin Remodelers , 2012, Cell.

[2]  Wolfgang Huber,et al.  A high-resolution map of transcription in the yeast genome. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[4]  R. Elkon,et al.  The Poly(A)-Binding Protein Nuclear 1 Suppresses Alternative Cleavage and Polyadenylation Sites , 2012, Cell.

[5]  S. Loeillet,et al.  XUTs are a class of Xrn1-sensitive antisense regulatory non-coding RNA in yeast , 2011, Nature.

[6]  Struan C. Murray,et al.  A pre-initiation complex at the 3′-end of genes drives antisense transcription independent of divergent sense transcription , 2011, Nucleic acids research.

[7]  Peter J. Shepard,et al.  Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq. , 2011, RNA.

[8]  N. Friedman,et al.  Strand-specific RNA sequencing reveals extensive regulated long antisense transcripts that are conserved across yeast species , 2010, Genome Biology.

[9]  V. Iyer,et al.  Absolute mRNA levels and transcriptional initiation rates in Saccharomyces cerevisiae. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[10]  F. Dietrich,et al.  Mapping of transcription start sites in Saccharomyces cerevisiae using 5′ SAGE , 2005, Nucleic acids research.

[11]  Chong-Jian Chen,et al.  Differential genome-wide profiling of tandem 3' UTRs among human breast cancer and normal cells by high-throughput sequencing. , 2011, Genome research.

[12]  L. Steinmetz,et al.  Bidirectional promoters generate pervasive transcription in yeast , 2009, Nature.

[13]  B. Pugh,et al.  Genome-wide structure and organization of eukaryotic pre-initiation complexes , 2011, Nature.

[14]  Steven J. M. Jones,et al.  Dynamic Remodeling of Individual Nucleosomes Across a Eukaryotic Genome in Response to Transcriptional Perturbation , 2007, PLoS biology.

[15]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[16]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[17]  K. Struhl,et al.  Species-specific factors mediate extensive heterogeneity of mRNA 3′ ends in yeasts , 2013, Proceedings of the National Academy of Sciences.

[18]  Jing Zhao,et al.  Formation of mRNA 3′ Ends in Eukaryotes: Mechanism, Regulation, and Interrelationships with Other Steps in mRNA Synthesis , 1999, Microbiology and Molecular Biology Reviews.

[19]  Y. Pilpel,et al.  Determinants of translation efficiency and accuracy , 2011, Molecular systems biology.

[20]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[21]  R. Tjian,et al.  Orchestrated response: a symphony of transcription factors for gene control. , 2000, Genes & development.

[22]  Hong Ma,et al.  Stable and dynamic nucleosome states during a meiotic developmental process. , 2011, Genome research.

[23]  L. Steinmetz,et al.  Extensive transcriptional heterogeneity revealed by isoform profiling , 2013, Nature.

[24]  L. Fulton,et al.  Finding Functional Features in Saccharomyces Genomes by Phylogenetic Footprinting , 2003, Science.

[25]  M. Gerstein,et al.  The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing , 2008, Science.

[26]  Cizhong Jiang,et al.  A compiled and systematic reference map of nucleosome positions across the Saccharomyces cerevisiae genome , 2009, Genome Biology.

[27]  G. Yehia,et al.  Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing , 2012, Nature Methods.

[28]  C. Mello,et al.  CapSeq and CIP-TAP Identify Pol II Start Sites and Reveal Capped Small RNAs as C. elegans piRNA Precursors , 2012, Cell.

[29]  M. Hattori,et al.  A large-scale full-length cDNA analysis to explore the budding yeast transcriptome , 2006, Proceedings of the National Academy of Sciences.

[30]  S. Malik,et al.  Mediator-regulated transcription through the +1 nucleosome. , 2012, Molecular cell.

[31]  B. Pugh,et al.  Comprehensive Genome-wide Protein-DNA Interactions Detected at Single-Nucleotide Resolution , 2011, Cell.

[32]  Christopher B. Burge,et al.  Promoter directionality is controlled by U1 snRNP and polyadenylation signals , 2013, Nature.

[33]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[34]  D. Bartel,et al.  Formation, Regulation and Evolution of Caenorhabditis elegans 3′UTRs , 2010, Nature.

[35]  Piero Carninci,et al.  High-throughput verification of transcriptional starting sites by Deep-RACE. , 2009, BioTechniques.

[36]  P. Kapranov,et al.  Comprehensive Polyadenylation Site Maps in Yeast and Human Reveal Pervasive Alternative Polyadenylation , 2010, Cell.

[37]  Vishwanath R Iyer,et al.  Nucleosome positioning: bringing order to the eukaryotic genome. , 2012, Trends in cell biology.

[38]  Judith B. Zaugg,et al.  Gene Loops Enhance Transcriptional Directionality , 2012, Science.

[39]  A. Shyu,et al.  Mechanisms of deadenylation‐dependent decay , 2011, Wiley interdisciplinary reviews. RNA.

[40]  M. Kozak Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes , 1986, Cell.

[41]  Kenta Nakai,et al.  Genome-wide characterization of transcriptional start sites in humans by integrative transcriptome analysis. , 2011, Genome research.

[42]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[43]  Piero Carninci,et al.  Automated Workflow for Preparation of cDNA for Cap Analysis of Gene Expression on a Single Molecule Sequencer , 2012, PloS one.

[44]  C. Gissi,et al.  Untranslated regions of mRNAs , 2002, Genome Biology.

[45]  Christophe Malabat,et al.  Widespread bidirectional promoters are the major source of cryptic transcripts in yeast , 2009, Nature.

[46]  S. Letovsky,et al.  Quantification of the yeast transcriptome by single-molecule sequencing , 2009, Nature Biotechnology.

[47]  Kevin Struhl,et al.  Nucleosome depletion at yeast terminators is not intrinsic and can occur by a transcriptional mechanism linked to 3’-end formation , 2010, Proceedings of the National Academy of Sciences.

[48]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[49]  Michael Hampsey,et al.  Molecular Genetics of the RNA Polymerase II General Transcriptional Machinery , 1998, Microbiology and Molecular Biology Reviews.

[50]  B. Tian,et al.  Alternative cleavage and polyadenylation: the long and short of it. , 2013, Trends in biochemical sciences.

[51]  T. Babak,et al.  A quantitative atlas of polyadenylation in five mammals , 2012, Genome research.

[52]  B. Pugh,et al.  Identification and Distinct Regulation of Yeast TATA Box-Containing Genes , 2004, Cell.