JAMI: fast computation of conditional mutual information for ceRNA network analysis

Abstract Motivation Genome-wide measurements of paired miRNA and gene expression data have enabled the prediction of competing endogenous RNAs (ceRNAs). It has been shown that the sponge effect mediated by protein-coding as well as non-coding ceRNAs can play an important regulatory role in the cell in health and disease. Therefore, many computational methods for the computational identification of ceRNAs have been suggested. In particular, methods based on Conditional Mutual Information (CMI) have shown promising results. However, the currently available implementation is slow and cannot be used to perform computations on a large scale. Results Here, we present JAMI, a Java tool that uses a non-parametric estimator for CMI values from gene and miRNA expression data. We show that JAMI speeds up the computation of ceRNA networks by a factor of ∼70 compared to currently available implementations. Further, JAMI supports multi-threading to make use of common multi-core architectures for further performance gain. Requirements Java 8. Availability and implementation JAMI is available as open-source software from https://github.com/SchulzLab/JAMI. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  P. Pandolfi,et al.  A ceRNA Hypothesis: The Rosetta Stone of a Hidden RNA Language? , 2011, Cell.

[2]  Alexander van Oudenaarden,et al.  Genome-wide dissection of microRNA functions and cotargeting networks using gene set signatures. , 2010, Molecular cell.

[3]  Prahlad T. Ram,et al.  Cupid: simultaneous reconstruction of microRNA-target and ceRNA networks , 2015, Genome research.

[4]  Yadong Wang,et al.  miR2Disease: a manually curated database for microRNA deregulation in human disease , 2008, Nucleic Acids Res..

[5]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[6]  C. Sander,et al.  Target mRNA abundance dilutes microRNA and siRNA activity , 2010, Molecular systems biology.

[7]  Hsien-Da Huang,et al.  miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database , 2015, Nucleic Acids Res..

[8]  P. Pandolfi,et al.  A coding-independent function of gene and pseudogene mRNAs regulates tumour biology , 2010, Nature.

[9]  Xuerui Yang,et al.  High-throughput validation of ceRNA regulatory networks , 2017, BMC Genomics.

[10]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[11]  P. Pandolfi,et al.  The multilayered complexity of ceRNA crosstalk and competition , 2014, Nature.

[12]  Tao Huang,et al.  Cancer-Related Triplets of mRNA-lncRNA-miRNA Revealed by Integrative Network in Uterine Corpus Endometrial Carcinoma , 2017, BioMed research international.

[13]  Junpeng Zhang,et al.  Computational methods for identifying miRNA sponge interactions , 2016, Briefings Bioinform..

[14]  Igor Vajda,et al.  Estimation of the Information by an Adaptive Partitioning of the Observation Space , 1999, IEEE Trans. Inf. Theory.

[15]  Yufei Huang,et al.  Gene Regulation, Modulation, and Their Applications in Gene Expression Data Analysis , 2013, Adv. Bioinformatics.

[16]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[17]  Lorenzo Farina,et al.  Computational analysis identifies a sponge interaction network between long non-coding RNAs and messenger RNAs in human breast cancer , 2014, BMC Systems Biology.

[18]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[19]  Xuerui Yang,et al.  An Extensive MicroRNA-Mediated Network of RNA-RNA Interactions Regulates Established Oncogenic Pathways in Glioblastoma , 2011, Cell.

[20]  Xia Li,et al.  Identification of lncRNA-associated competing triplets reveals global patterns and prognostic markers for cancer , 2015, Nucleic acids research.