Computational comparison of common event-based differential splicing tools: practical considerations for laboratory researchers

Background Computational tools analyzing RNA-sequencing data have boosted alternative splicing research by identifying and assessing differentially spliced genes. However, common alternative splicing analysis tools differ substantially in their statistical analyses and general performance. This report compares the computational performance (CPU utilization and RAM usage) of three event-level splicing tools; rMATS, MISO, and SUPPA2. Additionally, concordance between tool outputs was investigated. Results Log-linear relations were found between job times and dataset size in all splicing tools and all virtual machine (VM) configurations. MISO had the highest job times for all analyses, irrespective of VM size, while MISO analyses also exceeded maximum CPU utilization on all VM sizes. rMATS and SUPPA2 load averages were relatively low in both size and replicate comparisons, not nearing maximum CPU utilization in the VM simulating the lowest computational power (D2 VM). RAM usage in rMATS and SUPPA2 did not exceed 20% of maximum RAM in both size and replicate comparisons while MISO reached maximum RAM usage in D2 VM analyses for input size. Correlation coefficients of differential splicing analyses showed high correlation (β > 80%) between different tool outputs with the exception of comparisons of retained intron (RI) events between rMATS/MISO and rMATS/SUPPA2 (β < 60%). Conclusions Prior to RNA-seq analyses, users should consider job time, amount of replicates and splice event type of interest to determine the optimal alternative splicing tool. In general, rMATS is superior to both MISO and SUPPA2 in computational performance. Analysis outputs show high concordance between tools, with the exception of RI events.

[1]  Yan Wang,et al.  Mechanism of alternative splicing and its regulation. , 2015, Biomedical reports.

[2]  Y. Assaraf,et al.  Pre-mRNA splicing in cancer: the relevance in oncogenesis, treatment and drug resistance , 2015, Expert opinion on drug metabolism & toxicology.

[3]  J. Bertino,et al.  Enzymatic synthesis of folylpolyglutamates. Characterization of the reaction and its products. , 1980, The Journal of biological chemistry.

[4]  S. Salzberg,et al.  StringTie enables improved reconstruction of a transcriptome from RNA-seq reads , 2015, Nature Biotechnology.

[5]  G. Peters,et al.  The association of aberrant folylpolyglutamate synthetase splicing with ex vivo methotrexate resistance and clinical outcome in childhood acute lymphoblastic leukemia , 2016, Haematologica.

[6]  N. Lee,et al.  Aberrant RNA Splicing in Cancer and Drug Resistance , 2018, Cancers.

[7]  Y. Assaraf,et al.  Aberrant splicing of folylpolyglutamate synthetase as a novel mechanism of antifolate resistance in leukemia. , 2009, Blood.

[8]  Lan Lin,et al.  rMATS: Robust and flexible detection of differential alternative splicing from replicate RNA-Seq data , 2014, Proceedings of the National Academy of Sciences.

[9]  B. Prabhakar,et al.  Alternative splicing as a biomarker and potential target for drug discovery , 2015, Acta Pharmacologica Sinica.

[10]  G. Peters,et al.  Methotrexate resistance in relation to treatment outcome in childhood acute lymphoblastic leukemia , 2015, Journal of Hematology & Oncology.

[11]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[12]  B. Shane,et al.  Evolution of drug resistance in CCRF-CEM human leukemia cells selected by intermittent methotrexate exposure. , 1995, Oncology research.

[13]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[14]  B. Frey,et al.  Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing , 2008, Nature Genetics.

[15]  E. Giovannetti,et al.  Glucocorticoid Resistant Pediatric Acute Lymphoblastic Leukemia Samples Display Altered Splicing Profile and Vulnerability to Spliceosome Modulation , 2020, Cancers.

[16]  Arfa Mehmood,et al.  Systematic evaluation of differential splicing tools for RNA-seq studies , 2019, Briefings Bioinform..

[17]  E. Wang,et al.  Analysis and design of RNA sequencing experiments for identifying isoform regulation , 2010, Nature Methods.

[18]  E. Giovannetti,et al.  Using RNA-sequencing to Detect Novel Splice Variants Related to Drug Resistance in In Vitro Cancer Models. , 2016, Journal of visualized experiments : JoVE.

[19]  Miha Skalic,et al.  SUPPA2: fast, accurate, and uncertainty-aware differential splicing analysis across multiple conditions , 2016, Genome Biology.

[20]  Gael P. Alamancos,et al.  Methods to study splicing from high-throughput RNA sequencing data. , 2013, Methods in molecular biology.

[21]  Lior Pachter,et al.  Near-optimal probabilistic RNA-seq quantification , 2016, Nature Biotechnology.

[22]  F. Baralle,et al.  Alternative splicing as a regulator of development and tissue identity , 2017, Nature Reviews Molecular Cell Biology.

[23]  D. Baralle,et al.  RNA splicing in human disease and in the clinic. , 2017, Clinical science.

[24]  Yanmei Xu,et al.  Mechanism of alternative splicing and its regulation (Review) , 2015 .

[25]  G. Peters,et al.  Folylpolyglutamate synthetase splicing alterations in acute lymphoblastic leukemia are provoked by methotrexate and other chemotherapeutics and mediate chemoresistance , 2016, International journal of cancer.

[26]  E. Giovannetti,et al.  Splicing modulation as novel therapeutic strategy against diffuse malignant peritoneal mesothelioma , 2018, EBioMedicine.

[27]  R. Guigó,et al.  Modelling and simulating generic RNA-Seq experiments with the flux simulator , 2012, Nucleic acids research.

[28]  J. Bertino,et al.  Decreased folylpolyglutamate synthetase activity as a mechanism of methotrexate resistance in CCRF-CEM human leukemia sublines. , 1991, The Journal of biological chemistry.

[29]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[30]  Julie A. Dickerson,et al.  Comparisons of computational methods for differential alternative splicing detection using RNA-seq in plant systems , 2014, BMC Bioinformatics.

[31]  Donny D. Licatalosi,et al.  RNA processing and its regulation: global insights into biological networks , 2010, Nature Reviews Genetics.

[32]  W. Lems,et al.  Association of altered folylpolyglutamate synthetase pre-mRNA splicing with methotrexate unresponsiveness in early rheumatoid arthritis , 2020, Rheumatology.

[33]  Rob Patro,et al.  Salmon provides fast and bias-aware quantification of transcript expression , 2017, Nature Methods.

[34]  D. Rio,et al.  Mechanisms and Regulation of Alternative Pre-mRNA Splicing. , 2015, Annual review of biochemistry.