RNA‐Seq methods for transcriptome analysis

Deep sequencing has been revolutionizing biology and medicine in recent years, providing single base‐level precision for our understanding of nucleic acid sequences in high throughput fashion. Sequencing of RNA, or RNA‐Seq, is now a common method to analyze gene expression and to uncover novel RNA species. Aspects of RNA biogenesis and metabolism can be interrogated with specialized methods for cDNA library preparation. In this study, we review current RNA‐Seq methods for general analysis of gene expression and several specific applications, including isoform and gene fusion detection, digital gene expression profiling, targeted sequencing and single‐cell analysis. In addition, we discuss approaches to examine aspects of RNA in the cell, technical challenges of existing RNA‐Seq methods, and future directions. WIREs RNA 2017, 8:e1364. doi: 10.1002/wrna.1364

[1]  M. S. Ko,et al.  An 'equalized cDNA library' by the reassociation of short double-stranded cDNAs. , 1990, Nucleic acids research.

[2]  J. Craig Venter,et al.  Sequence identification of 2,375 human brain genes , 1992, Nature.

[3]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[4]  D. Lockhart,et al.  Expression monitoring by hybridization to high-density oligonucleotide arrays , 1996, Nature Biotechnology.

[5]  H. Seuánez,et al.  Detection of BCR-ABL Transcripts by Multiplex and Nested PCR in Different Haematological Disorders , 2000, Leukemia & lymphoma.

[6]  J. Rowley,et al.  Oligo(dT) primer generates a high frequency of truncated cDNAs through internal poly(A) priming during reverse transcription , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Ji Huang,et al.  [Serial analysis of gene expression]. , 2002, Yi chuan = Hereditas.

[8]  Guey-Shin Wang,et al.  Splicing in disease: disruption of the splicing code and the decoding machinery , 2007, Nature Reviews Genetics.

[9]  Tyson A. Clark,et al.  HITS-CLIP yields genome-wide insights into brain alternative RNA processing , 2008, Nature.

[10]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[11]  Leighton J. Core,et al.  Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters , 2008, Science.

[12]  Thomas Tuschl,et al.  Identification of microRNAs and other small regulatory RNAs using cDNA library sequencing. , 2008, Methods.

[13]  Ronghua Chen,et al.  Digital transcriptome profiling using selective hexamer priming for cDNA synthesis , 2009, Nature Methods.

[14]  Ryan D. Morin,et al.  Next-generation tag sequencing for cancer gene expression profiling. , 2009, Genome research.

[15]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[16]  Catalin C. Barbacioru,et al.  mRNA-Seq whole-transcriptome analysis of a single cell , 2009, Nature Methods.

[17]  Jeffrey G. Reifenberger,et al.  Direct RNA sequencing , 2009, Nature.

[18]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[19]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[20]  David I. Smith,et al.  3' tag digital gene expression profiling of human brain and universal reference RNA using Illumina Genome Analyzer , 2009, BMC Genomics.

[21]  Matthew Mort,et al.  Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts. , 2009, Genome research.

[22]  T. Fennell,et al.  Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcripts , 2009, Genome Biology.

[23]  Gene W. Yeo,et al.  An RNA code for the FOX2 splicing regulator revealed by mapping RNA-protein interactions in stem cells , 2009, Nature Structural &Molecular Biology.

[24]  S. Tringe,et al.  Validation of two ribosomal RNA removal methods for microbial metatranscriptomics , 2010, Nature Methods.

[25]  Kristin Reiche,et al.  The primary transcriptome of the major human pathogen Helicobacter pylori , 2010, Nature.

[26]  E. Wang,et al.  Analysis and design of RNA sequencing experiments for identifying isoform regulation , 2010, Nature Methods.

[27]  Mohsen Khorshid,et al.  PAR-CliP - A Method to Identify Transcriptome-wide the Binding Sites of RNA Binding Proteins , 2010, Journal of visualized experiments : JoVE.

[28]  T. Clark,et al.  Multi-targeted priming for genome-wide gene expression assays , 2010, BMC Genomics.

[29]  N. Friedman,et al.  Comprehensive comparative analysis of strand-specific RNA sequencing methods , 2010, Nature Methods.

[30]  D. Haussler,et al.  FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencing , 2010, Nature Methods.

[31]  Howard Y. Chang,et al.  Genome-wide measurement of RNA secondary structure in yeast , 2010, Nature.

[32]  J. Ule,et al.  iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution , 2010, Nature Structural &Molecular Biology.

[33]  Cole Trapnell,et al.  Multiplexed RNA structure characterization with selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq) , 2011, Proceedings of the National Academy of Sciences.

[34]  R. Sachidanandam,et al.  Identification and remediation of biases in the activity of RNA ligases in small-RNA deep sequencing , 2011, Nucleic acids research.

[35]  Guihua Sun,et al.  A bias-reducing strategy in profiling small RNAs using Solexa. , 2011, RNA.

[36]  Peter J. Shepard,et al.  Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq. , 2011, RNA.

[37]  Achim Tresch,et al.  Dynamic transcriptome analysis measures rates of mRNA synthesis and decay in yeast , 2011, Molecular systems biology.

[38]  D. Bartel,et al.  Formation, Regulation and Evolution of Caenorhabditis elegans 3′UTRs , 2010, Nature.

[39]  James A. Casbon,et al.  A method for counting PCR template molecules with application to next-generation sequencing , 2011, Nucleic acids research.

[40]  Eytan Domany,et al.  Coupled pre-mRNA and mRNA dynamics unveil operational strategies underlying transcriptional responses to stimuli , 2013 .

[41]  T. Borodina,et al.  A strand-specific library preparation protocol for RNA sequencing. , 2011, Methods in enzymology.

[42]  N. Friedman,et al.  Metabolic labeling of RNA uncovers principles of RNA production and degradation dynamics in mammalian cells , 2011, Nature Biotechnology.

[43]  J. Weissman,et al.  Nascent transcript sequencing visualizes transcription at nucleotide resolution , 2011, Nature.

[44]  Cassandra B. Jabara,et al.  Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID , 2011, Proceedings of the National Academy of Sciences.

[45]  Xiaobo Zhou,et al.  FusionQ: a novel approach for gene fusion detection and quantification from paired-end RNA-Seq , 2013, BMC Bioinformatics.

[46]  R. Sandberg,et al.  Full-Length mRNA-Seq from single cell levels of RNA and individual circulating tumor cells , 2012, Nature Biotechnology.

[47]  T. Babak,et al.  A quantitative atlas of polyadenylation in five mammals , 2012, Genome research.

[48]  S. Linnarsson,et al.  Counting absolute numbers of molecules using unique molecular identifiers , 2011, Nature Methods.

[49]  Tony Z. Jia,et al.  Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes , 2012, Proceedings of the National Academy of Sciences.

[50]  T. Hashimshony,et al.  CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. , 2012, Cell reports.

[51]  I. Nookaew,et al.  A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae , 2012, Nucleic acids research.

[52]  H. Ueda,et al.  Erratum to: Quartz-Seq: a highly reproducible and sensitive single-cell RNA sequencing method, reveals non-genetic gene-expression heterogeneity , 2017, Genome Biology.

[53]  Jinsong Qiu,et al.  RASL‐seq for Massively Parallel and Quantitative Analysis of Gene Expression , 2012, Current protocols in molecular biology.

[54]  Charles Gawad,et al.  Circular RNAs Are the Predominant Transcript Isoform from Hundreds of Human Genes in Diverse Cell Types , 2012, PloS one.

[55]  W. Wong,et al.  Improving PacBio Long Read Accuracy by Short Read Alignment , 2012, PloS one.

[56]  Zachary W. Bent,et al.  Peregrine , 2013, RNA biology.

[57]  E. Shapiro,et al.  Single-cell sequencing-based technologies will revolutionize whole-organism science , 2013, Nature Reviews Genetics.

[58]  Wing Hung Wong,et al.  Characterization of the human ESC transcriptome by hybrid sequencing , 2013, Proceedings of the National Academy of Sciences.

[59]  Siu-Ming Yiu,et al.  SOAPfusion: a robust and effective computational fusion discovery tool for RNA-seq reads , 2013, Bioinform..

[60]  Krishna R. Kalari,et al.  Impact of Library Preparation on Downstream Analysis and Interpretation of RNA-Seq Data: Comparison between Illumina PolyA and NuGEN Ovation Protocol , 2013, PloS one.

[61]  Vicent Pelechano,et al.  An efficient method for genome-wide polyadenylation site mapping and RNA quantification , 2013, Nucleic acids research.

[62]  Aviv Regev,et al.  Corrigendum: Comparative analysis of RNA sequencing methods for degraded or low-input samples , 2013, Nature Methods.

[63]  B. Tian,et al.  Alternative cleavage and polyadenylation: the long and short of it. , 2013, Trends in biochemical sciences.

[64]  Haixu Tang,et al.  A new method for stranded whole transcriptome RNA-seq. , 2013, Methods.

[65]  S. Donatelli,et al.  State-of-the-Art Fusion-Finder Algorithms Sensitivity and Specificity , 2013, BioMed research international.

[66]  G. Yehia,et al.  Analysis of alterative cleavage and polyadenylation by 3′ region extraction and deep sequencing , 2012, Nature Methods.

[67]  Erik Willems,et al.  Quantitative Transcriptomics using Designed Primer-based Amplification , 2013, Scientific Reports.

[68]  Aviv Regev,et al.  Comprehensive comparative analysis of RNA sequencing methods for degraded or low input samples , 2013, Nature Methods.

[69]  Olga V. Britanova,et al.  Preparing Unbiased T-Cell Receptor and Antibody cDNA Libraries for the Deep Next Generation Sequencing Profiling , 2013, Front. Immunol..

[70]  A. Hartmann,et al.  Sensitive and specific detection of EML4-ALK rearrangements in non-small cell lung cancer (NSCLC) specimens by multiplex amplicon RNA massive parallel sequencing. , 2014, Lung cancer.

[71]  Gioele La Manno,et al.  Quantitative single-cell RNA-seq with unique molecular identifiers , 2013, Nature Methods.

[72]  Marcel E Dinger,et al.  Targeted sequencing for gene discovery and quantification using RNA CaptureSeq , 2014, Nature Protocols.

[73]  George M. Church,et al.  Highly Multiplexed Subcellular RNA Sequencing in Situ , 2014, Science.

[74]  V. Kim,et al.  TAIL-seq: genome-wide determination of poly(A) tail length and 3' end modifications. , 2014, Molecular cell.

[75]  Sharon R Grossman,et al.  RNA-RNA Interactions Enable Specific Targeting of Noncoding RNAs to Nascent Pre-mRNAs and Chromatin Sites , 2014, Cell.

[76]  Åsa K. Björklund,et al.  Tn5 transposase and tagmentation procedures for massively scaled sequencing projects , 2014, Genome research.

[77]  C. Perou,et al.  Comparison of RNA-Seq by poly (A) capture, ribosomal RNA depletion, and DNA microarray for expression profiling , 2014, BMC Genomics.

[78]  Manolis Kellis,et al.  Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo , 2013, Nature.

[79]  Sheng Li,et al.  Multi-platform assessment of transcriptome profiling using RNA-seq in the ABRF next-generation sequencing study , 2014, Nature Biotechnology.

[80]  I. Amit,et al.  Massively Parallel Single-Cell RNA-Seq for Marker-Free Decomposition of Tissues into Cell Types , 2014, Science.

[81]  Itai Yanai,et al.  Seeing is believing: new methods for in situ single-cell transcriptomics , 2014, Genome Biology.

[82]  A. Oudenaarden,et al.  Validation of noise models for single-cell transcriptomics , 2014, Nature Methods.

[83]  H. Lehrach,et al.  Influence of RNA extraction methods and library selection schemes on RNA-seq data , 2014, BMC Genomics.

[84]  Yusuke Nakamura,et al.  Quantitative T cell repertoire analysis by deep cDNA sequencing of T cell receptor α and β chains using next-generation sequencing (NGS) , 2014, Oncoimmunology.

[85]  Francis Doyle,et al.  Profiling post-transcriptionally networked mRNA subsets using RIP-Chip and RIP-Seq. , 2014, Methods.

[86]  S. P. Fodor,et al.  Digital Encoding of Cellular mRNAs Enabling Precise and Absolute Gene Expression Measurement by Single-Molecule Counting , 2014, Analytical chemistry.

[87]  A. Nicholson Ribonuclease III mechanisms of double-stranded RNA cleavage , 2013, Wiley interdisciplinary reviews. RNA.

[88]  A. Saliba,et al.  Single-cell RNA-seq: advances and future challenges , 2014, Nucleic acids research.

[89]  T. Preiss,et al.  Selective and flexible depletion of problematic sequences from RNA-seq libraries at the cDNA stage , 2014, BMC Genomics.

[90]  B. Tian,et al.  RNA-binding proteins in regulation of alternative cleavage and polyadenylation. , 2014, Advances in experimental medicine and biology.

[91]  C. Thermes,et al.  Library preparation methods for next-generation sequencing: tone down the bias. , 2014, Experimental cell research.

[92]  S. P. Fodor,et al.  Molecular indexing enables quantitative targeted RNA sequencing and reveals poor efficiencies in standard library preparations , 2014, Proceedings of the National Academy of Sciences.

[93]  L. Notarangelo,et al.  Next Generation Sequencing Reveals Skewing of the T and B Cell Receptor Repertoires in Patients with Wiskott–Aldrich Syndrome , 2014, Front. Immunol..

[94]  C. Plessy,et al.  Targeted reduction of highly abundant transcripts with pseudo-random primers , 2015, bioRxiv.

[95]  Torsten Seemann,et al.  PAT-seq: a method to study the integration of 3′-UTR dynamics with gene expression in the eukaryotic transcriptome , 2015, RNA.

[96]  Alex P. Reynolds,et al.  Native Elongating Transcript Sequencing Reveals Human Transcriptional Activity at Nucleotide Resolution , 2015, Cell.

[97]  Giulio Pavesi,et al.  RIP-Seq data analysis to determine RNA-protein associations. , 2015, Methods in molecular biology.

[98]  Kin-Fan Au,et al.  PacBio Sequencing and Its Applications , 2015, Genom. Proteom. Bioinform..

[99]  Anton J. Enright,et al.  Quantitative gene profiling of long noncoding RNAs with targeted RNA sequencing , 2015, Nature Methods.

[100]  Yixing Han,et al.  Advanced Applications of RNA Sequencing and Challenges , 2015, Bioinformatics and biology insights.

[101]  Hong Tang,et al.  Employment of digital gene expression profiling to identify potential pathogenic and therapeutic targets of fulminant hepatic failure , 2015, Journal of Translational Medicine.

[102]  Howard Y. Chang,et al.  Structural imprints in vivo decode RNA regulatory mechanisms , 2015, Nature.

[103]  Haixu Tang,et al.  Stranded Whole Transcriptome RNA‐Seq for All RNA Types , 2015, Current protocols in human genetics.

[104]  M. Dimon,et al.  An Efficient Method for Identifying Gene Fusions by Targeted RNA Sequencing from Fresh Frozen and FFPE Samples , 2015, PloS one.

[105]  Li Yang Splicing noncoding RNAs from the inside out , 2015, Wiley interdisciplinary reviews. RNA.

[106]  Aleksandra A. Kolodziejczyk,et al.  The technology and biology of single-cell RNA sequencing. , 2015, Molecular cell.

[107]  Michael F. Covington,et al.  BrAD-seq: Breath Adapter Directional sequencing: a streamlined, ultra-simple and fast library preparation protocol for strand specific mRNA library construction , 2015, Front. Plant Sci..

[108]  T. Preiss,et al.  Probe‐Directed Degradation (PDD) for Flexible Removal of Unwanted cDNA Sequences from RNA‐Seq Libraries , 2015, Current protocols in human genetics.

[109]  Allon M. Klein,et al.  Droplet Barcoding for Single-Cell Transcriptomics Applied to Embryonic Stem Cells , 2015, Cell.

[110]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[111]  Songnian Hu,et al.  Full-length transcriptome sequences and splice variants obtained by a combination of sequencing platforms applied to different root tissues of Salvia miltiorrhiza and tanshinone biosynthesis. , 2015, The Plant journal : for cell and molecular biology.

[112]  H. Gronemeyer,et al.  TARDIS, a targeted RNA directional sequencing method for rare RNA discovery , 2015, Nature Protocols.

[113]  Guo-Cheng Yuan,et al.  Single-Cell Analysis in Cancer Genomics. , 2015, Trends in genetics : TIG.

[114]  S. Teichmann,et al.  Computational and analytical challenges in single-cell transcriptomics , 2015, Nature Reviews Genetics.

[115]  S. P. Fodor,et al.  Combinatorial labeling of single cells for gene expression cytometry , 2015, Science.

[116]  Sijia Lu,et al.  Single Cell Transcriptome Amplification with MALBAC , 2015, PloS one.

[117]  T. Blauwkamp,et al.  Comprehensive transcriptome analysis using synthetic long-read sequencing reveals molecular co-association of distant splicing events , 2015, Nature Biotechnology.

[118]  H. Kimura,et al.  Mammalian NET-Seq Reveals Genome-wide Nascent Transcription Coupled to RNA Processing , 2015, Cell.

[119]  Eric Samorodnitsky,et al.  Evaluation of Hybridization Capture Versus Amplicon‐Based Methods for Whole‐Exome Sequencing , 2015, Human mutation.

[120]  M. Moore,et al.  An optimized kit-free method for making strand-specific deep sequencing libraries from RNA fragments , 2014, Nucleic acids research.

[121]  C. Antonescu,et al.  Gene fusions in soft tissue tumors: Recurrent and overlapping pathogenetic themes , 2016, Genes, chromosomes & cancer.

[122]  C. Plessy,et al.  Targeted reduction of highly abundant transcripts using pseudo-random primers. , 2016, BioTechniques.

[123]  U. Ohler,et al.  Identifying RBP Targets with RIP-seq. , 2016, Methods in molecular biology.