CeModule: an integrative framework for discovering regulatory patterns from genomic data in cancer

BackgroundNon-coding RNAs (ncRNAs) are emerging as key regulators and play critical roles in a wide range of tumorigenesis. Recent studies have suggested that long non-coding RNAs (lncRNAs) could interact with microRNAs (miRNAs) and indirectly regulate miRNA targets through competing interactions. Therefore, uncovering the competing endogenous RNA (ceRNA) regulatory mechanism of lncRNAs, miRNAs and mRNAs in post-transcriptional level will aid in deciphering the underlying pathogenesis of human polygenic diseases and may unveil new diagnostic and therapeutic opportunities. However, the functional roles of vast majority of cancer specific ncRNAs and their combinational regulation patterns are still insufficiently understood.ResultsHere we develop an integrative framework called CeModule to discover lncRNA, miRNA and mRNA-associated regulatory modules. We fully utilize the matched expression profiles of lncRNAs, miRNAs and mRNAs and establish a model based on joint orthogonality non-negative matrix factorization for identifying modules. Meanwhile, we impose the experimentally verified miRNA-lncRNA interactions, the validated miRNA-mRNA interactions and the weighted gene-gene network into this framework to improve the module accuracy through the network-based penalties. The sparse regularizations are also used to help this model obtain modular sparse solutions. Finally, an iterative multiplicative updating algorithm is adopted to solve the optimization problem.ConclusionsWe applied CeModule to two cancer datasets including ovarian cancer (OV) and uterine corpus endometrial carcinoma (UCEC) obtained from TCGA. The modular analysis indicated that the identified modules involving lncRNAs, miRNAs and mRNAs are significantly associated and functionally enriched in cancer-related biological processes and pathways, which may provide new insights into the complex regulatory mechanism of human diseases at the system level.

[1]  Xing Chen,et al.  LncRNADisease: a database for long-non-coding RNA-associated diseases , 2012, Nucleic Acids Res..

[2]  Francisco Facchinei,et al.  Solving quasi-variational inequalities via their KKT conditions , 2014, Math. Program..

[3]  Chris H. Q. Ding,et al.  Orthogonal nonnegative matrix t-factorizations for clustering , 2006, KDD '06.

[4]  Hui Zhou,et al.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein–RNA interaction networks from large-scale CLIP-Seq data , 2013, Nucleic Acids Res..

[5]  Cheng Liang,et al.  Predicting MicroRNA-Disease Associations Using Kronecker Regularized Least Squares Based on Heterogeneous Omics Data , 2017, IEEE Access.

[6]  D. Spector,et al.  The noncoding RNA MALAT1 is a critical regulator of the metastasis phenotype of lung cancer cells. , 2013, Cancer research.

[7]  Xiao Liu,et al.  Semi-supervised community detection based on non-negative matrix factorization with node popularity , 2017, Inf. Sci..

[8]  Hyunju Lee,et al.  A Computational Approach to Identifying Gene-microRNA Modules in Cancer , 2015, PLoS Comput. Biol..

[9]  Andrew K. Godwin,et al.  Differential Expressions of Adhesive Molecules and Proteases Define Mechanisms of Ovarian Tumor Cell Matrix Penetration/Invasion , 2011, PloS one.

[10]  Hsien-Da Huang,et al.  miRTarBase update 2014: an information resource for experimentally validated miRNA-target interactions , 2013, Nucleic Acids Res..

[11]  Jiuyong Li,et al.  A novel framework for inferring condition-specific TF and miRNA co-regulation of protein-protein interactions. , 2016, Gene.

[12]  E. Marcotte,et al.  Prioritizing candidate disease genes by network-based boosting of genome-wide association data. , 2011, Genome research.

[13]  Cheng Liang,et al.  Identifying lncRNA and mRNA Co-Expression Modules from Matched Expression Data in Ovarian Cancer , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[14]  Yidong Chen,et al.  Differential correlation analysis of glioblastoma reveals immune ceRNA interactions predictive of patient survival , 2017, BMC Bioinformatics.

[15]  Jiawei Luo,et al.  A novel approach for predicting microRNA-disease associations by unbalanced bi-random walk on heterogeneous network , 2017, J. Biomed. Informatics.

[16]  Haitao Zhao,et al.  Long Noncoding RNA Plays a Key Role in Metastasis and Prognosis of Hepatocellular Carcinoma , 2014, BioMed research international.

[17]  Jing Xu,et al.  Down-regulation of focal adhesion signaling in response to cyclophilin A knockdown in human endometrial cancer cells, implicated by cDNA microarray analysis. , 2013, Gynecologic oncology.

[18]  P. Pandolfi,et al.  A coding-independent function of gene and pseudogene mRNAs regulates tumour biology , 2010, Nature.

[19]  Tongbin Li,et al.  miRecords: an integrated resource for microRNA–target interactions , 2008, Nucleic Acids Res..

[20]  Nectarios Koziris,et al.  TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support , 2011, Nucleic Acids Res..

[21]  J. Uhm Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2009 .

[22]  Cheng Liang,et al.  A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations , 2018, Bioinform..

[23]  M. Todesco,et al.  Target mimicry provides a new mechanism for regulation of microRNA activity , 2007, Nature Genetics.

[24]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[25]  Chris Sander,et al.  Comprehensive Analysis of Long Non-Coding RNAs in Ovarian Cancer Reveals Global Patterns and Targeted DNA Amplification , 2013, PloS one.

[26]  Wenwen Min,et al.  A Two-Stage Method to Identify Joint Modules From Matched MicroRNA and mRNA Expression Data , 2016, IEEE Transactions on NanoBioscience.

[27]  Xantha Karp,et al.  Encountering MicroRNAs in Cell Fate Signaling , 2005, Science.

[28]  P. Pandolfi,et al.  The multilayered complexity of ceRNA crosstalk and competition , 2014, Nature.

[29]  J. Miguel Villas-Boas,et al.  Optimal Search for Product Information , 2010, Manag. Sci..

[30]  P. Sarnow,et al.  Modulation of Hepatitis C Virus RNA Abundance by a Liver-Specific MicroRNA , 2005, Science.

[31]  Núria Queralt-Rosinach,et al.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants , 2016, Nucleic Acids Res..

[32]  Bing Shi,et al.  TAM: A method for enrichment and depletion analysis of a microRNA category in a list of microRNAs , 2010, BMC Bioinformatics.

[33]  Xiangxiang Zeng,et al.  Integrative approaches for predicting microRNA function and prioritizing disease-related microRNA using biological interaction networks , 2016, Briefings Bioinform..

[34]  Hyunsoo Kim,et al.  Sparse Non-negative Matrix Factorizations via Alternating Non-negativity-constrained Least Squares , 2006 .

[35]  Ling Feng,et al.  Integrated Analysis of Long Noncoding RNA and mRNA Expression Profile in Advanced Laryngeal Squamous Cell Carcinoma , 2016, PloS one.

[36]  Seungjin Choi,et al.  Nonnegative Matrix Factorization with Orthogonality Constraints , 2010, J. Comput. Sci. Eng..

[37]  Cheng Liang,et al.  A Novel Method to Detect Functional microRNA Regulatory Modules by Bicliques Merging , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[38]  Bennani Younes,et al.  Controlling orthogonality constraints for better NMF clustering , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[39]  Shi-Hua Zhang,et al.  Integrative analysis for identifying joint modular patterns of gene-expression and drug-response data , 2016, Bioinform..

[40]  Ke Liu,et al.  Linc2GO: a human LincRNA function annotation resource based on ceRNA hypothesis , 2013, Bioinform..

[41]  Yang Li,et al.  HMDD v2.0: a database for experimentally supported human microRNA and disease associations , 2013, Nucleic Acids Res..

[42]  P. Pandolfi,et al.  A ceRNA Hypothesis: The Rosetta Stone of a Hidden RNA Language? , 2011, Cell.

[43]  Martin Reczko,et al.  DIANA-LncBase: experimentally verified and computationally predicted microRNA targets on long non-coding RNAs , 2012, Nucleic Acids Res..

[44]  Jianhua Dai,et al.  Computational Prediction of Human Disease- Associated circRNAs Based on Manifold Regularization Learning Framework , 2019, IEEE Journal of Biomedical and Health Informatics.

[45]  T. Xia,et al.  Long noncoding RNA associated-competing endogenous RNAs in gastric cancer , 2014, Scientific Reports.

[46]  Margaret S. Ebert,et al.  MicroRNA sponges: competitive inhibitors of small RNAs in mammalian cells , 2007, Nature Methods.

[47]  Jiayi Wang,et al.  CREB up-regulates long non-coding RNA, HULC expression through interaction with microRNA-372 in liver cancer , 2010, Nucleic acids research.

[48]  M. Byrom,et al.  Antisense inhibition of human miRNAs and indications for an involvement of miRNA in cell growth and apoptosis , 2005, Nucleic acids research.

[49]  Yu-Jin Zhang,et al.  Nonnegative Matrix Factorization: A Comprehensive Review , 2013, IEEE Transactions on Knowledge and Data Engineering.

[50]  A. Thor,et al.  p53 gene mutations and protein accumulation in human ovarian cancer. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Tingting Shao,et al.  The mRNA related ceRNA–ceRNA landscape and significance across 20 major cancer types , 2015, Nucleic acids research.

[52]  Jing Li,et al.  dbDEPC 2.0: updated database of differentially expressed proteins in human cancers , 2011, Nucleic Acids Res..

[53]  Cheng Liang,et al.  A novel motif-discovery algorithm to identify co-regulatory motifs in large transcription factor and microRNA co-regulatory networks in human , 2015, Bioinform..

[54]  Peng Wang,et al.  Lnc2Cancer: a manually curated database of experimentally supported lncRNAs associated with various human cancers , 2015, Nucleic Acids Res..

[55]  Yiran Huang,et al.  Downregulated MEG3 activates autophagy and increases cell proliferation in bladder cancer. , 2013, Molecular bioSystems.

[56]  Xiaolei Wang,et al.  Non-negative matrix factorization by maximizing correntropy for cancer clustering , 2013, BMC Bioinformatics.

[57]  Rui Xia,et al.  Downregulated long noncoding RNA MEG3 is associated with poor prognosis and promotes cell proliferation in gastric cancer , 2014, Tumor Biology.

[58]  Cheng Liang,et al.  Mirsynergy: detecting synergistic miRNA regulatory modules by overlapping neighbourhood expansion , 2014, Bioinform..

[59]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[60]  Jian Zhang,et al.  Construction and analysis of cardiac hypertrophy-associated lncRNA-mRNA network based on competitive endogenous RNA reveal functional lncRNAs in cardiac hypertrophy , 2016, Oncotarget.

[61]  Ana Kozomara,et al.  miRBase: annotating high confidence microRNAs using deep sequencing data , 2013, Nucleic Acids Res..

[62]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[63]  Wei Tang,et al.  dbDEMC 2.0: updated database of differentially expressed miRNAs in human cancers , 2016, Nucleic Acids Res..