Simple and efficient measurement of transcription initiation and transcript levels with STRIPE-seq

Accurate mapping of transcription start sites (TSSs) is key for understanding transcriptional regulation. However, current protocols for genome-wide TSS profiling are laborious and/or expensive. We present Survey of TRanscription Initiation at Promoter Elements with high-throughput sequencing (STRIPE-seq), a simple, rapid, and cost-effective protocol for sequencing capped RNA 5’ ends from as little as 50 ng total RNA. Including depletion of uncapped RNA and SPRI bead cleanups, a STRIPE-seq library can be constructed in about five hours. We demonstrate application of STRIPE-seq to TSS profiling in yeast and human cells and show that it can also be effectively used for measuring transcript levels and differential gene expression analysis. In conjunction with our ready-to-use computational analysis workflows, STRIPE-seq is a straightforward, efficient means by which to probe the landscape of transcriptional initiation.

[1]  Qixiang Zhang,et al.  Genome-wide characterization of PEBP family genes in nine Rosaceae tree species and their expression analysis in P. mume , 2021, BMC Ecology and Evolution.

[2]  Jiang Qian,et al.  EnhancerAtlas 2.0: an updated resource with enhancer annotation in 586 tissue/cell types across nine species , 2019, Nucleic Acids Res..

[3]  Ivan R. Corrêa,et al.  Non-templated addition and template switching by Moloney murine leukemia virus (MMLV)-based reverse transcriptases co-occur and compete with each other , 2019, The Journal of Biological Chemistry.

[4]  A. Sandelin,et al.  CAGEfightR: analysis of 5′-end data using R/Bioconductor , 2019, BMC Bioinformatics.

[5]  S. Nechaev,et al.  Genome-wide RNA pol II initiation and pausing in neural progenitors of the rat , 2019, BMC Genomics.

[6]  Hatice S. Kaya-Okur,et al.  CUT&Tag for efficient epigenomic profiling of small samples and single cells , 2019, Nature Communications.

[7]  M. Kellner,et al.  NanoPARE: parallel analysis of RNA 5′ ends from low-input RNA , 2018, Genome research.

[8]  Zhenguo Lin,et al.  Pervasive and dynamic transcription initiation in Saccharomyces cerevisiae , 2018, bioRxiv.

[9]  Gordon K Smyth,et al.  The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads , 2018, bioRxiv.

[10]  Piero Carninci,et al.  SLIC-CAGE: high-resolution transcription start site mapping using nanogram-levels of total RNA , 2018, bioRxiv.

[11]  Shintaro Iwasaki,et al.  Transcripts from downstream alternative transcription start sites evade uORF-mediated inhibition of gene expression in Arabidopsis , 2018, Proceedings of the National Academy of Sciences.

[12]  Aviv Regev,et al.  Comprehensive comparative analysis of 5’ end RNA sequencing methods , 2018, Nature Methods.

[13]  P. Tassone,et al.  MALAT1: a druggable long non-coding RNA for targeted anti-cancer approaches , 2018, Journal of Hematology & Oncology.

[14]  Anders Gorm Pedersen,et al.  Characterization of the enhancer and promoter landscape of inflammatory bowel disease from human colon biopsies , 2018, Nature Communications.

[15]  B. Lenhard,et al.  Distinct core promoter codes drive transcription initiation at key developmental transitions in a marine chordate , 2017, BMC Genomics.

[16]  D. Weinberg,et al.  Application of a Schizosaccharomyces pombe Edc1-fused Dcp1–Dcp2 decapping enzyme for transcription start site mapping , 2018, RNA.

[17]  Wolfgang Huber,et al.  Alternative start and termination sites of transcription drive most transcript isoform differences across human tissues , 2017, Nucleic acids research.

[18]  G. M. Kurtzer,et al.  Enhancing reproducibility in scientific computing: Metrics and registry for Singularity containers , 2017, PloS one.

[19]  C. Vollmers,et al.  Tn5Prime, a Tn5 based 5′ capture method for single cell RNA-seq , 2017, bioRxiv.

[20]  Piero Carninci,et al.  Systematic analysis of transcription start sites in avian development , 2017, PLoS biology.

[21]  Nuno A. Fonseca,et al.  A Pan-Cancer Transcriptome Analysis Reveals Pervasive Regulation through Tumor-Associated Alternative Promoters , 2018 .

[22]  Thomas J. Ha,et al.  Relatively frequent switching of transcription start sites during cerebellar development , 2017, BMC Genomics.

[23]  Vanessa Sochat,et al.  Singularity: Scientific containers for mobility of compute , 2017, PloS one.

[24]  Long Vo Ngoc,et al.  The human initiator is a distinct and abundant element that is precisely positioned in focused core promoters , 2017, Genes & development.

[25]  Matthew R. Krause,et al.  Single-cell profiling reveals that eRNA accumulation at enhancer–promoter loops is not required to sustain transcription , 2016, Nucleic acids research.

[26]  M. Harbers,et al.  NanoCAGE: A Method for the Analysis of Coding and Noncoding 5'-Capped Transcriptomes. , 2017, Methods in molecular biology.

[27]  平林茂樹,et al.  Cap Analysis of Gene Expression法 , 2017 .

[28]  Leighton J. Core,et al.  Base-pair-resolution genome-wide mapping of active RNA polymerases using precision nuclear run-on (PRO-seq) , 2016, Nature Protocols.

[29]  Wei Chen,et al.  Pervasive isoform‐specific translational regulation via alternative transcription start sites in mammals , 2016, Molecular systems biology.

[30]  A. Heger,et al.  UMI-tools: modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy , 2016, bioRxiv.

[31]  Fidel Ramírez,et al.  deepTools2: a next generation web server for deep-sequencing data analysis , 2016, Nucleic Acids Res..

[32]  Sean Davis,et al.  Statistical Genomics. Methods and Protocols. , 2016, Anticancer research.

[33]  Bernd Bukau,et al.  Accurate prediction of cellular co-translational folding indicates proteins can switch from post- to co-translational folding , 2016, Nature Communications.

[34]  Florian Hahne,et al.  Visualizing Genomic Data Using Gviz and Bioconductor , 2016, Statistical Genomics.

[35]  A. Doseff,et al.  Core Promoter Plasticity Between Maize Tissues and Genotypes Contrasts with Predominance of Sharp Transcription Initiation Sites[OPEN] , 2015, Plant Cell.

[36]  Jason S. Cumbie,et al.  NanoCAGE-XL and CapFilter: an approach to genome wide identification of high confidence transcription start sites , 2015, BMC Genomics.

[37]  Qing-Yu He,et al.  ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization , 2015, Bioinform..

[38]  Christophe Malabat,et al.  Quality control of transcription start site selection by nonsense-mediated-mRNA decay , 2015, eLife.

[39]  M. Rowicka,et al.  Strategies for Achieving High Sequencing Accuracy for Low Diversity Samples and Avoiding Sample Bleeding Using Illumina Platform , 2015, PloS one.

[40]  Boris Lenhard,et al.  CAGEr: precise TSS data retrieval and high-resolution promoterome mining for integrative analyses , 2015, Nucleic acids research.

[41]  Timo Lassmann,et al.  TagDust2: a generic method to extract reads from sequencing data , 2015, BMC Bioinformatics.

[42]  André L. Martins,et al.  Analysis of nascent RNA identifies a unified architecture of initiation regions at mammalian promoters and enhancers , 2014, Nature Genetics.

[43]  D. Corcoran,et al.  Paired-End Analysis of Transcription Start Sites in Arabidopsis Reveals Plant-Specific Promoter Signatures[C][W] , 2014, Plant Cell.

[44]  P. Lichter,et al.  Capture and Amplification by Tailing and Switching (CATS) , 2014, RNA biology.

[45]  C. Glass,et al.  Enhancer RNAs and regulated transcriptional programs. , 2014, Trends in biochemical sciences.

[46]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[47]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[48]  Cesare Furlanello,et al.  A promoter-level mammalian expression atlas , 2015 .

[49]  Vishwanath R. Iyer,et al.  Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements , 2014, Nucleic acids research.

[50]  Nan Li,et al.  Two independent transcription initiation codes overlap on vertebrate core promoters , 2014, Nature.

[51]  Piero Carninci,et al.  Detecting expressed genes using CAGE. , 2014, Methods in molecular biology.

[52]  Pawel Zajac,et al.  Base Preferences in Non-Templated Nucleotide Incorporation by MMLV-Derived Reverse Transcriptases , 2013, PloS one.

[53]  Åsa K. Björklund,et al.  Smart-seq2 for sensitive full-length transcriptome profiling in single cells , 2013, Nature Methods.

[54]  T. Gingeras,et al.  RAMPAGE: Promoter Activity Profiling by Paired‐End Sequencing of 5′‐Complete cDNAs , 2013, Current protocols in molecular biology.

[55]  Piero Carninci,et al.  Comparison of RNA- or LNA-hybrid oligonucleotides in template-switching reactions for high-speed sequencing library preparation , 2013, BMC Genomics.

[56]  Boris Lenhard,et al.  Dynamic regulation of the transcription initiation landscape at single nucleotide resolution during vertebrate embryogenesis , 2013, Genome research.

[57]  L. Romão,et al.  Gene Expression Regulation by Upstream Open Reading Frames and Human Disease , 2013, PLoS genetics.

[58]  Joshua A. Arribere,et al.  Roles for transcript leaders in translation and mRNA decay revealed by transcript leader sequencing , 2013, Genome research.

[59]  L. Steinmetz,et al.  Extensive transcriptional heterogeneity revealed by isoform profiling , 2013, Nature.

[60]  Piero Carninci,et al.  High-fidelity promoter profiling reveals widespread alternative promoter usage and transposon-driven developmental gene expression , 2013, Genome research.

[61]  Piero Carninci,et al.  Suppression of artifacts and barcode bias in high-throughput transcriptome analyses utilizing template switching , 2012, Nucleic acids research.

[62]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[63]  C. Mello,et al.  CapSeq and CIP-TAP Identify Pol II Start Sites and Reveal Capped Small RNAs as C. elegans piRNA Precursors , 2012, Cell.

[64]  M. Gut,et al.  Supplemental information for : “ CpG islands and GC content dictate nucleosome depletion in a transcription independent manner at mammalian promoters ” , 2012 .

[65]  Nadav S. Bar,et al.  Landscape of transcription in human cells , 2012, Nature.

[66]  N. Friedman,et al.  Systematic Dissection of Roles for Chromatin Regulators in a Yeast Stress Response , 2012, PLoS biology.

[67]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[68]  Piero Carninci,et al.  5′ end–centered expression profiling using cap-analysis gene expression and next-generation sequencing , 2012, Nature Protocols.

[69]  S. Linnarsson,et al.  Counting absolute numbers of molecules using unique molecular identifiers , 2011, Nature Methods.

[70]  Carsten Wiuf,et al.  Tumor-specific usage of alternative transcription start sites in colorectal cancer identified by genome-wide exon array analysis , 2011, BMC Genomics.

[71]  R. Sachidanandam,et al.  Identification and remediation of biases in the activity of RNA ligases in small-RNA deep sequencing , 2011, Nucleic acids research.

[72]  S. Luo,et al.  RNA-ligase-dependent biases in miRNA representation in deep-sequenced small RNA cDNA libraries. , 2011, RNA.

[73]  P. Scacheri,et al.  Epigenetic signatures distinguish multiple classes of enhancers with distinct cellular functions. , 2011, Genome research.

[74]  S. Linnarsson,et al.  Characterization of the single-cell transcriptional landscape by highly multiplex RNA-seq. , 2011, Genome research.

[75]  G. Veenstra,et al.  TBP-related factors: a paradigm of diversity in transcription initiation , 2011, Cell & Bioscience.

[76]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[77]  Kenta Nakai,et al.  Genome-wide characterization of transcriptional start sites in humans by integrative transcriptome analysis. , 2011, Genome research.

[78]  Ryan A. Flynn,et al.  A unique chromatin signature uncovers early developmental enhancers in humans , 2011, Nature.

[79]  Piero Carninci,et al.  Genome-wide analysis of promoter architecture in Drosophila melanogaster. , 2011, Genome research.

[80]  Cameron S. Osborne,et al.  Large Scale Loss of Data in Low-Diversity Illumina Sequencing Libraries Can Be Recovered by Deferred Cluster Calling , 2011, PloS one.

[81]  Hideaki Sugawara,et al.  The Sequence Read Archive , 2010, Nucleic Acids Res..

[82]  Andrew C. Adey,et al.  Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition , 2010, Genome Biology.

[83]  R. Young,et al.  Histone H3K27ac separates active from poised enhancers and predicts developmental state , 2010, Proceedings of the National Academy of Sciences.

[84]  Carsten O. Daub,et al.  Linking promoters to functional transcripts in small samples with nanoCAGE and CAGEscan , 2010, Nature Methods.

[85]  Uwe Ohler,et al.  A paired-end sequencing strategy to map the complex landscape of transcription initiation , 2010, Nature Methods.

[86]  J. Ragoussis,et al.  A Large Fraction of Extragenic RNA Pol II Transcription Sites Overlap Enhancers , 2010, PLoS biology.

[87]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[88]  D. Gang,et al.  Incorporation of non-natural nucleotides into template-switching oligonucleotides reduces background and improves cDNA synthesis from very small RNA samples , 2010, BMC Genomics.

[89]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[90]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[91]  V. Mootha,et al.  Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans , 2009, Proceedings of the National Academy of Sciences.

[92]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[93]  Jun Kawai,et al.  Genome-wide detection and analysis of hippocampus core promoters using DeepCAGE. , 2009, Genome research.

[94]  Sumio Sugano,et al.  The functional consequences of alternative promoter use in mammalian genomes. , 2008, Trends in genetics : TIG.

[95]  A. Krogh,et al.  A code for transcription initiation in mammalian genomes. , 2007, Genome research.

[96]  Kenta Nakai,et al.  DBTSS: database of transcription start sites, progress report 2008 , 2007, Nucleic Acids Res..

[97]  C. Kai,et al.  CAGE: cap analysis of gene expression , 2006, Nature Methods.

[98]  F. Dietrich,et al.  Mapping of transcription start sites in Saccharomyces cerevisiae using 5′ SAGE , 2005, Nucleic acids research.

[99]  M. Simonen,et al.  Dual regulation by heat and nutrient stress of the yeast HSP150 gene encoding a secretory glycoprotein , 1993, Molecular and General Genetics MGG.

[100]  J. Kawai,et al.  Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[101]  F. J. Livesey Strategies for microarray analysis of limiting amounts of RNA. , 2003, Briefings in functional genomics & proteomics.

[102]  Sumio Sugano,et al.  Construction of a full-length enriched and a 5'-end enriched cDNA library using the oligo-capping method. , 2003, Methods in molecular biology.

[103]  T. Zhou Characterization of the Enhancer and Promoter Elements of the Human TAF(II)55 Gene , 2002 .

[104]  A. Chenchik,et al.  Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction. , 2001, BioTechniques.

[105]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[106]  S. Lillard,et al.  In-situ sampling and separation of RNA from individual mammalian cells. , 2000, Analytical chemistry.

[107]  W. Schmidt,et al.  CapSelect: a highly sensitive method for 5' CAP-dependent enrichment of full-length cDNA in PCR-mediated analysis of mRNAs. , 1999, Nucleic acids research.

[108]  A. Hinnebusch,et al.  Translational Regulation of Yeast GCN4 , 1997, The Journal of Biological Chemistry.

[109]  G. Gerard,et al.  Reverse Transcriptase , 1997, Molecular biotechnology.

[110]  G. Barsh,et al.  Differences in dorsal and ventral pigmentation result from regional expression of the mouse agouti gene. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[111]  A. Hinnebusch,et al.  Multiple upstream AUG codons mediate translational control of GCN4 , 1986, Cell.

[112]  A. Forrest Quality control. , 1978, British medical journal.