A network-based algorithm for the identification of moonlighting noncoding RNAs and its application in sepsis

Moonlighting proteins provide more options for cells to execute multiple functions without increasing the genome and transcriptome complexity. Although there have long been calls for computational methods for the prediction of moonlighting proteins, no method has been designed for determining moonlighting long noncoding ribonucleicacidz (RNAs) (mlncRNAs). Previously, we developed an algorithm MoonFinder for the identification of mlncRNAs at the genome level based on the functional annotation and interactome data of lncRNAs and proteins. Here, we update MoonFinder to MoonFinder v2.0 by providing an extensive framework for the detection of protein modules and the establishment of RNA-module associations in human. A novel measure, moonlighting coefficient, was also proposed to assess the confidence of an ncRNA acting in a moonlighting manner. Moreover, we explored the expression characteristics of mlncRNAs in sepsis, in which we found that mlncRNAs tend to be upregulated and differentially expressed. Interestingly, the mlncRNAs are mutually exclusive in terms of coexpression when compared to the other lncRNAs. Overall, MoonFinder v2.0 is dedicated to the prediction of human mlncRNAs and thus bears great promise to serve as a valuable R package for worldwide research communities (https://cran.r-project.org/web/packages/MoonFinder/index.html). Also, our analyses provide the first attempt to characterize mlncRNA expression and coexpression properties in adult sepsis patients, which will facilitate the understanding of the interaction and expression patterns of mlncRNAs.

[1]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[2]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[3]  Adriana Espinosa-Cantú,et al.  Gene duplication and the evolution of moonlighting proteins , 2015, Front. Genet..

[4]  Vladimir B Bajic,et al.  LncBook: a curated knowledgebase of human long non-coding RNAs , 2018, Nucleic Acids Res..

[5]  Nicola J. Mulder,et al.  Gene Ontology semantic similarity tools: survey on features and challenges for biological knowledge discovery , 2016, Briefings Bioinform..

[6]  Seung Joon Baek,et al.  Moonlighting proteins in cancer. , 2016, Cancer letters.

[7]  Jianzhong Su,et al.  Analysis of long noncoding RNAs highlights region-specific altered expression patterns and diagnostic roles in Alzheimer's disease , 2019, Briefings Bioinform..

[8]  A. Whitmarsh,et al.  Mitochondrial Proteins Moonlighting in the Nucleus. , 2015, Trends in biochemical sciences.

[9]  Kwong-Sak Leung,et al.  Identification and characterization of moonlighting long non‐coding RNAs based on RNA and protein interactome , 2018, Bioinform..

[10]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[11]  Dong Wang,et al.  Exploiting locational and topological overlap model to identify modules in protein interaction networks , 2019, BMC Bioinformatics.

[12]  Constance J. Jeffery,et al.  Why study moonlighting proteins? , 2015, Front. Genet..

[13]  Kwong-Sak Leung,et al.  SMILE: a novel procedure for subcellular module identification with localisation expansion , 2018, IET systems biology.

[14]  Peter Nürnberg,et al.  Classification of patients with sepsis according to blood genomic endotype: a prospective cohort study. , 2017, The Lancet. Respiratory medicine.

[15]  Haiyuan Yu,et al.  Detecting overlapping protein complexes in protein-protein interaction networks , 2012, Nature Methods.

[16]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[17]  T. van der Poll,et al.  Severe sepsis and septic shock. , 2013, The New England journal of medicine.

[18]  Alessio Colantoni,et al.  Revealing protein–lncRNA interaction , 2015, Briefings Bioinform..

[19]  Dong Wang,et al.  CrossNorm: a novel normalization strategy for microarray data in cancers , 2016, Scientific Reports.

[20]  Lionel Spinelli,et al.  Extreme multifunctional proteins identified from a human protein interaction network , 2015, Nature Communications.

[21]  Zhen Su,et al.  Integrative genomic analyses reveal clinically relevant long non-coding RNA in human cancer , 2013 .

[22]  Ying Dai,et al.  Principal component analysis based methods in bioinformatics studies , 2011, Briefings Bioinform..

[23]  Hans-Werner Mewes,et al.  CORUM: the comprehensive resource of mammalian protein complexes , 2007, Nucleic Acids Res..

[24]  Li Li,et al.  Gene co‐expression analysis identifies common modules related to prognosis and drug resistance in cancer cell lines , 2014, International journal of cancer.

[25]  A. Lange,et al.  The metabolism of cancer cells: moonlighting proteins and growth control. , 2006, Current opinion in clinical nutrition and metabolic care.

[26]  F. Pontén,et al.  The Human Protein Atlas—a tool for pathology , 2008, The Journal of pathology.

[27]  M. Netea,et al.  The immunopathology of sepsis and potential therapeutic targets , 2017, Nature Reviews Immunology.

[28]  Aristeidis E. Boukouris,et al.  Metabolic Enzymes Moonlighting in the Nucleus: Metabolic Regulation of Gene Transcription. , 2016, Trends in biochemical sciences.

[29]  Srinivasan Parthasarathy,et al.  Identifying functional modules in interaction networks through overlapping Markov clustering , 2012, Bioinform..

[30]  Ling-Ling Chen Linking Long Noncoding RNA Localization and Function. , 2016, Trends in biochemical sciences.

[31]  Kwong-Sak Leung,et al.  ICN: a normalization method for gene expression data considering the over-expression of informative genes. , 2016, Molecular bioSystems.

[32]  Dong Wang,et al.  Full Characterization of Localization Diversity in the Human Protein Interactome. , 2017, Journal of proteome research.

[33]  Michael R. Kosorok,et al.  Identification of differential gene pathways with principal component analysis , 2009, Bioinform..

[34]  Yue Zhao,et al.  RAID v2.0: an updated resource of RNA-associated interactions across organisms , 2016, Nucleic Acids Res..

[35]  Robert H. Singer,et al.  In the right place at the right time: visualizing and understanding mRNA localization , 2014, Nature Reviews Molecular Cell Biology.

[36]  S. Brunak,et al.  A scored human protein–protein interaction network to catalyze genomic interpretation , 2017, Nature Methods.

[37]  Kwong-Sak Leung,et al.  SMILE: A Novel Procedure for Subcellular Module Identification with Localization Expansion , 2017, BCB.

[38]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[39]  Yijie Wang,et al.  Functional module identification in protein interaction networks by interaction patterns , 2014, Bioinform..

[40]  Katsuhiko Murakami,et al.  PCDq: human protein complex database with quality index which summarizes different levels of evidences of protein complexes predicted from H-Invitational protein-protein interactions integrative dataset , 2012, BMC Systems Biology.

[41]  Benjamin Tang,et al.  Development and validation of a novel molecular biomarker diagnostic test for the early detection of sepsis , 2011, Critical care.

[42]  Jie Hong,et al.  A long non-coding RNA signature to improve prognosis prediction of gastric cancer , 2016, Molecular Cancer.

[43]  Jianzhong Su,et al.  Recurrence-Associated Long Non-coding RNA Signature for Determining the Risk of Recurrence in Patients with Colon Cancer , 2018, Molecular therapy. Nucleic acids.

[44]  Hui Zhou,et al.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein–RNA interaction networks from large-scale CLIP-Seq data , 2013, Nucleic Acids Res..

[45]  Xia Li,et al.  Identification of lncRNA-associated competing triplets reveals global patterns and prognostic markers for cancer , 2015, Nucleic acids research.

[46]  Xiuqing Zhang,et al.  An overview and metanalysis of machine and deep learning-based CRISPR gRNA design tools , 2019, RNA biology.

[47]  Kwong-Sak Leung,et al.  Quantification of non-coding RNA target localization diversity and its application in cancers , 2018, Journal of molecular cell biology.

[48]  Yibo Wu,et al.  GOSemSim: an R package for measuring semantic similarity among GO terms and gene products , 2010, Bioinform..