RJunBase: a database of RNA splice junctions in human normal and cancerous tissues

Abstract Splicing is an essential step of RNA processing for multi-exon genes, in which introns are removed from a precursor RNA, thereby producing mature RNAs containing splice junctions. Here, we develope the RJunBase (www.RJunBase.org), a web-accessible database of three types of RNA splice junctions (linear, back-splice, and fusion junctions) that are derived from RNA-seq data of non-cancerous and cancerous tissues. The RJunBase aims to integrate and characterize all RNA splice junctions of both healthy or pathological human cells and tissues. This new database facilitates the visualization of the gene-level splicing pattern and the junction-level expression profile, as well as the demonstration of unannotated and tumor-specific junctions. The first release of RJunBase contains 682 017 linear junctions, 225 949 back-splice junctions and 34 733 fusion junctions across 18 084 non-cancerous and 11 540 cancerous samples. RJunBase can aid researchers in discovering new splicing-associated targets and provide insights into the identification and assessment of potential neoepitopes for cancer treatment.

[1]  Roland Eils,et al.  circlize implements and enhances circular visualization in R , 2014, Bioinform..

[2]  Min Zhao,et al.  ONGene: A literature-based database for human oncogenes. , 2017, Journal of genetics and genomics = Yi chuan xue bao.

[3]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[4]  Julio Delgado,et al.  The U1 spliceosomal RNA is recurrently mutated in multiple cancers , 2019, Nature.

[5]  Feng Gao,et al.  Long‐Read RNA Sequencing Identifies Alternative Splice Variants in Hepatocellular Carcinoma and Tumor‐Specific Isoforms , 2019, Hepatology.

[6]  Yang Gao,et al.  CSCD: a database for cancer-specific circular RNAs , 2017, Nucleic Acids Res..

[7]  Steven J. M. Jones,et al.  Comprehensive Analysis of Alternative Splicing Across Tumors from 8,705 Patients , 2018, Cancer cell.

[8]  Xiaobo Zhou,et al.  FusionGDB: fusion gene annotation DataBase , 2018, Nucleic Acids Res..

[9]  E. Lehnert,et al.  Improved detection of gene fusions by applying statistical methods reveals oncogenic RNA cancer drivers , 2019, Proceedings of the National Academy of Sciences.

[10]  Robert Brown,et al.  TCGASpliceSeq a compendium of alternative mRNA splicing in cancer , 2015, Nucleic Acids Res..

[11]  Qin Li,et al.  ASJA: A Program for Assembling Splice Junctions Analysis , 2019, Computational and structural biotechnology journal.

[12]  M. Warmuth,et al.  H3B-8800, an orally available small-molecule splicing modulator, induces lethality in spliceosome-mutant cancers , 2018, Nature Medicine.

[13]  Omar Abdel-Wahab,et al.  Therapeutic targeting of splicing in cancer , 2016, Nature Medicine.

[14]  Zhongming Zhao,et al.  TSGene: a web resource for tumor suppressor genes , 2012, Nucleic Acids Res..

[15]  Liran Carmel,et al.  JuncDB: an exon–exon junction database , 2015, Nucleic Acids Res..

[16]  Yan Li,et al.  A LIN28B Tumor-Specific Transcript in Cancer. , 2018, Cell reports.

[17]  B. Blencowe,et al.  Alternative Splicing Regulatory Networks: Functions, Mechanisms, and Evolution. , 2019, Molecular cell.

[18]  D. Baltimore,et al.  Alternative mRNA splicing in cancer immunotherapy , 2019, Nature Reviews Immunology.

[19]  D. Hawke,et al.  A splicing switch from ketohexokinase-C to ketohexokinase-A drives hepatocellular carcinoma formation , 2016, Nature Cell Biology.

[20]  S. Dhanasekaran,et al.  The Landscape of Circular RNA in Cancer , 2019, Cell.

[21]  S. Salzberg,et al.  StringTie enables improved reconstruction of a transcriptome from RNA-seq reads , 2015, Nature Biotechnology.

[22]  B. Langmead,et al.  Human splicing diversity and the extent of unannotated splice junctions across human RNA-seq samples on the Sequence Read Archive , 2016, Genome Biology.

[23]  William D. Duncan,et al.  An ontology for representing hematologic malignancies: the cancer cell ontology , 2019, BMC Bioinformatics.

[24]  M. Carmo-Fonseca,et al.  Targeting mRNA processing as an anticancer strategy , 2019, Nature Reviews Drug Discovery.

[25]  E. Hoffman,et al.  A long-read RNA-seq approach to identify novel transcripts of very large genes , 2020, Genome research.

[26]  Yan Li,et al.  Tumor‐Specific Transcripts Are Frequently Expressed in Hepatocellular Carcinoma With Clinical Implication and Potential Function , 2020, Hepatology.

[27]  ETV4 Is Necessary for Estrogen Signaling and Growth in Endometrial Cancer Cells. , 2020, Cancer research.

[28]  R. Slotkin,et al.  Long-Read cDNA Sequencing Enables a "Gene-Like" Transcript Annotation of Transposable Elements. , 2020, The Plant cell.

[29]  Nathan Ungerleider,et al.  SpliceV: analysis and publication quality printing of linear and circular RNA splicing, expression and regulation , 2019, BMC Bioinform..

[30]  Matthew A. Wyczalkowski,et al.  Systematic Analysis of Splice-Site-Creating Mutations in Cancer , 2018, Cell reports.

[31]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[32]  Gunnar Rätsch,et al.  A Pan-cancer Transcriptome Analysis Reveals Pervasive Regulation through Alternative Promoters , 2019, Cell.