Systematic evaluation of differential splicing tools for RNA-seq studies

Abstract Differential splicing (DS) is a post-transcriptional biological process with critical, wide-ranging effects on a plethora of cellular activities and disease processes. To date, a number of computational approaches have been developed to identify and quantify differentially spliced genes from RNA-seq data, but a comprehensive intercomparison and appraisal of these approaches is currently lacking. In this study, we systematically evaluated 10 DS analysis tools for consistency and reproducibility, precision, recall and false discovery rate, agreement upon reported differentially spliced genes and functional enrichment. The tools were selected to represent the three different methodological categories: exon-based (DEXSeq, edgeR, JunctionSeq, limma), isoform-based (cuffdiff2, DiffSplice) and event-based methods (dSpliceType, MAJIQ, rMATS, SUPPA). Overall, all the exon-based methods and two event-based methods (MAJIQ and rMATS) scored well on the selected measures. Of the 10 tools tested, the exon-based methods performed generally better than the isoform-based and event-based methods. However, overall, the different data analysis tools performed strikingly differently across different data sets or numbers of samples.

[1]  Alyssa C. Frazee,et al.  Ballgown bridges the gap between transcriptome assembly and expression analysis , 2015, Nature Biotechnology.

[2]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[3]  Liang Chen Statistical and Computational Methods for High-Throughput Sequencing Data Analysis of Alternative Splicing , 2012, Statistics in Biosciences.

[4]  David G Hendrickson,et al.  Differential analysis of gene regulation at transcript resolution with RNA-seq , 2012, Nature Biotechnology.

[5]  Gael P. Alamancos,et al.  Leveraging transcript quantification for fast computation of alternative splicing profiles , 2015, bioRxiv.

[6]  Eun Ji Kim,et al.  Simulation-based comprehensive benchmarking of RNA-seq aligners , 2016, Nature Methods.

[7]  Derek Y. Chiang,et al.  DiffSplice: the genome-wide detection of differential splicing events with RNA-seq , 2012, Nucleic acids research.

[8]  Huidong Shi,et al.  A survey of computational methods in transcriptome-wide alternative splicing analysis , 2015, Biomolecular concepts.

[9]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[10]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[11]  Lan Lin,et al.  rMATS: Robust and flexible detection of differential alternative splicing from replicate RNA-Seq data , 2014, Proceedings of the National Academy of Sciences.

[12]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[13]  David A. Knowles,et al.  Annotation-free quantification of RNA splicing using LeafCutter , 2017, Nature Genetics.

[14]  C. Mason,et al.  The impact of read length on quantification of differentially expressed genes and splice junction detection , 2015, Genome Biology.

[15]  Yanmei Xu,et al.  Mechanism of alternative splicing and its regulation (Review) , 2015 .

[16]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[17]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[18]  Lei Liu,et al.  Potential diagnostic and prognostic marker dimethylglycine dehydrogenase (DMGDH) suppresses hepatocellular carcinoma metastasis in vitro and in vivo , 2016, Oncotarget.

[19]  David Haussler,et al.  Transcriptome and Genome Conservation of Alternative Splicing Events in Humans and Mice , 2003, Pacific Symposium on Biocomputing.

[20]  Juan González-Vallinas,et al.  A new view of transcriptome complexity and regulation through the lens of local splicing variations , 2016, eLife.

[21]  Juw Won Park,et al.  MATS: a Bayesian framework for flexible detection of differential alternative splicing from RNA-Seq data , 2012, Nucleic acids research.

[22]  John N. Weinstein,et al.  SpliceSeq: a resource for analysis and visualization of RNA-Seq data on alternative splicing and its functional impacts , 2012, Bioinform..

[23]  Gael P. Alamancos,et al.  Leveraging transcript quantification for fast computation of alternative splicing profiles , 2014, bioRxiv.

[24]  Hui Jiang,et al.  rSeqDiff: Detecting Differential Isoform Expression from RNA-Seq Data Using Hierarchical Likelihood Ratio Test , 2013, PloS one.

[25]  J. Hooper,et al.  A survey of software for genome-wide discovery of differential splicing in RNA-Seq data , 2014, Human Genomics.

[26]  Mark D. Robinson,et al.  Isoform prefiltering improves performance of count-based methods for analysis of differential transcript usage , 2016, Genome Biology.

[27]  E. Wang,et al.  Analysis and design of RNA sequencing experiments for identifying isoform regulation , 2010, Nature Methods.

[28]  James C. Mullikin,et al.  Detection and visualization of differential splicing in RNA-Seq data with JunctionSeq , 2015, Nucleic acids research.

[29]  Colin N. Dewey,et al.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome , 2011, BMC Bioinformatics.

[30]  Yamile Marquez,et al.  Complexity of the Alternative Splicing Landscape in Plants[C][W][OPEN] , 2013, Plant Cell.

[31]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[32]  W. Huber,et al.  Detecting differential usage of exons from RNA-seq data , 2012, Genome research.

[33]  R. Lothe,et al.  Aberrant RNA splicing in cancer; expression changes and driver mutations of splicing factor genes , 2016, Oncogene.

[34]  Sara Ballouz,et al.  The fractured landscape of RNA-seq alignment: the default in our STARs , 2017, bioRxiv.

[35]  Steven J. M. Jones,et al.  Alternative expression analysis by RNA sequencing , 2010, Nature Methods.

[36]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[37]  Julie A. Dickerson,et al.  Comparisons of computational methods for differential alternative splicing detection using RNA-seq in plant systems , 2014, BMC Bioinformatics.

[38]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[39]  Miha Skalic,et al.  SUPPA2: fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions , 2016, Genome Biology.

[40]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[41]  Nan Deng,et al.  A Generalized dSpliceType Framework to Detect Differential Splicing and Differential Expression Events Using RNA-Seq , 2015, IEEE Transactions on NanoBioscience.

[42]  B. Prabhakar,et al.  Alternative splicing as a biomarker and potential target for drug discovery , 2015, Acta Pharmacologica Sinica.

[43]  Kui Wang,et al.  PennDiff: detecting differential alternative splicing and transcription by RNA sequencing , 2018, Bioinform..

[44]  B. Frey,et al.  Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing , 2008, Nature Genetics.

[45]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[46]  Laura L. Elo,et al.  Comparison of software packages for detecting differential expression in RNA-seq studies , 2013, Briefings Bioinform..

[47]  Yan Wang,et al.  Mechanism of alternative splicing and its regulation. , 2015, Biomedical reports.

[48]  O. Abdel-Wahab,et al.  Aberrant RNA Splicing in Cancer. , 2019, Annual review of cancer biology.

[49]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[50]  Xuegong Zhang,et al.  Opportunities and methods for studying alternative splicing in cancer with RNA-Seq. , 2013, Cancer letters.

[51]  Claude C. Warzecha,et al.  The splicing regulators Esrp1 and Esrp2 direct an epithelial splicing program essential for mammalian development , 2015, eLife.

[52]  G. Ast,et al.  Alternative splicing: current perspectives , 2008, BioEssays : news and reviews in molecular, cellular and developmental biology.

[53]  Mihaela Zavolan,et al.  Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data , 2015, Genome Biology.

[54]  G. Ast,et al.  Alternative splicing and evolution: diversification, exon definition and function , 2010, Nature Reviews Genetics.

[55]  Christopher J. Lee,et al.  Global analysis of exon creation versus loss and the role of alternative splicing in 17 vertebrate genomes. , 2007, RNA.