m7GDisAI: N7-methylguanosine (m7G) sites and diseases associations inference based on heterogeneous network

Background Recent studies have confirmed that N7-methylguanosine (m 7 G) modification plays an important role in regulating various biological processes and has associations with multiple diseases. Wet-lab experiments are cost and time ineffective for the identification of disease-associated m 7 G sites. To date, tens of thousands of m 7 G sites have been identified by high-throughput sequencing approaches and the information is publicly available in bioinformatics databases, which can be leveraged to predict potential disease-associated m 7 G sites using a computational perspective. Thus, computational methods for m 7 G-disease association prediction are urgently needed, but none are currently available at present. Results To fill this gap, we collected association information between m 7 G sites and diseases, genomic information of m 7 G sites, and phenotypic information of diseases from different databases to build an m 7 G-disease association dataset. To infer potential disease-associated m 7 G sites, we then proposed a heterogeneous network-based model, m 7 G Sites and Diseases Associations Inference (m 7 GDisAI) model. m 7 GDisAI predicts the potential disease-associated m 7 G sites by applying a matrix decomposition method on heterogeneous networks which integrate comprehensive similarity information of m 7 G sites and diseases. To evaluate the prediction performance, 10 runs of tenfold cross validation were first conducted, and m 7 GDisAI got the highest AUC of 0.740(± 0.0024). Then global and local leave-one-out cross validation (LOOCV) experiments were implemented to evaluate the model’s accuracy in global and local situations respectively. AUC of 0.769 was achieved in global LOOCV, while 0.635 in local LOOCV. A case study was finally conducted to identify the most promising ovarian cancer-related m 7 G sites for further functional analysis. Gene Ontology (GO) enrichment analysis was performed to explore the complex associations between host gene of m 7 G sites and GO terms. The results showed that m 7 GDisAI identified disease-associated m 7 G sites and their host genes are consistently related to the pathogenesis of ovarian cancer, which may provide some clues for pathogenesis of diseases. Conclusion The m 7 GDisAI web server can be accessed at http://180.208.58.66/m7GDisAI/ , which provides a user-friendly interface to query disease associated m 7 G. The list of top 20 m 7 G sites predicted to be associted with 177 diseases can be achieved. Furthermore, detailed information about specific m 7 G sites and diseases are also shown.

[1]  Dong-Hyun Kim,et al.  Activation of PI3K/Akt pathway by PTEN reduction and PIK3CA mRNA amplification contributes to cisplatin resistance in an ovarian cancer cell line. , 2005, Gynecologic oncology.

[2]  Deendayal Dinakarpandian,et al.  Finding disease similarity based on implicit semantic similarity , 2012, J. Biomed. Informatics.

[3]  Arash Salmaninejad,et al.  Current insights into the metastasis of epithelial ovarian cancer - hopes and hurdles , 2020, Cellular Oncology.

[4]  By Yasuhiro,et al.  Discovery of m 7 Gcap in eukaryotic mRNAs , 2015 .

[5]  Fowzan S. Alkuraya,et al.  Mutation in WDR4 impairs tRNA m7G46 methylation and causes a distinct form of microcephalic primordial dwarfism , 2015, Genome Biology.

[6]  A. J. Shatkin,et al.  5′-Terminal 7-methylguanosine in eukaryotic mRNA is required for translation , 1975, Nature.

[7]  Lulu Hu,et al.  Transcriptome-wide Mapping of Internal N7-Methylguanosine Methylome in Mammalian mRNA. , 2019, Molecular cell.

[8]  E. Phizicky,et al.  Two-subunit enzymes involved in eukaryotic post-transcriptional tRNA modification , 2014, RNA biology.

[9]  K. Entian,et al.  Tuning the ribosome: The influence of rRNA modification on eukaryotic ribosome biogenesis and function , 2016, RNA biology.

[10]  Samie R Jaffrey,et al.  An expanding universe of mRNA modifications , 2014, Nature Structural &Molecular Biology.

[11]  Junwei Shi,et al.  Promoter-bound METTL3 maintains myeloid leukaemia by m6A-dependent translation control , 2017, Nature.

[12]  Prateek Jain,et al.  Low-rank matrix completion using alternating minimization , 2012, STOC '13.

[13]  Y. Furuichi,et al.  Discovery of m7G-cap in eukaryotic mRNAs , 2015, Proceedings of the Japan Academy. Series B, Physical and biological sciences.

[14]  Jiajie Peng,et al.  SemFunSim: A New Method for Measuring Disease Similarity by Integrating Semantic and Gene Functional Association , 2014, PloS one.

[15]  Dave Zachariah,et al.  Alternating Least-Squares for Low-Rank Matrix Reconstruction , 2012, IEEE Signal Processing Letters.

[16]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[17]  R. Berkowitz,et al.  Bcl-2 and p53 protein expression, apoptosis, and p53 mutation in human epithelial ovarian cancers. , 2000, The American journal of pathology.

[18]  J. Engelman,et al.  ERBB receptors: from oncogene discovery to basic science to mechanism-based cancer therapeutics. , 2014, Cancer cell.

[19]  K. Shimotohno,et al.  Importance of 5'-terminal blocking structure to stabilize mRNA in eukaryotic protein synthesis. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Victor S Lelyveld,et al.  Mettl1/Wdr4-Mediated m7G tRNA Methylome Is Required for Normal mRNA Translation and Embryonic Stem Cell Self-Renewal and Differentiation. , 2018, Molecular cell.

[21]  Jia Meng,et al.  m7GHub: deciphering the location, regulation and pathogenesis of internal mRNA N7-methylguanosine (m7G) sites in human , 2020, Bioinform..

[22]  V. Cowling,et al.  Regulation of mRNA cap methylation , 2009, The Biochemical journal.

[23]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[24]  M. Koike,et al.  Germline mutation of MSH6 as the cause of hereditary nonpolyposis colorectal cancer , 1997, Nature Genetics.

[25]  Jionglong Su,et al.  WHISTLE: a high-accuracy map of the human N6-methyladenosine (m6A) epitranscriptome predicted using a machine learning approach , 2019, Nucleic acids research.

[26]  D. Alberts,et al.  Analysis of MLH1 and MSH2 expression in ovarian cancer before and after platinum drug-based chemotherapy. , 2000, Clinical cancer research : an official journal of the American Association for Cancer Research.

[27]  Philip S. Yu,et al.  A new method to measure the semantic similarity of GO terms , 2007, Bioinform..

[28]  Joe W. Gray,et al.  PIK3CA is implicated as an oncogene in ovarian cancer , 1999, Nature Genetics.

[29]  R. Wood,et al.  Dual role for mammalian DNA polymerase ζ in maintaining genome stability and proliferative responses , 2013, Proceedings of the National Academy of Sciences.

[30]  P. Moore,et al.  The crystal structure of yeast phenylalanine tRNA at 1.93 A resolution: a classic structure revisited. , 2000, RNA.

[31]  M. Olivier,et al.  Impact of mutant p53 functional properties on TP53 mutation patterns and tumor phenotype: lessons from recent developments in the IARC TP53 database , 2007, Human mutation.

[32]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[33]  Samie R Jaffrey,et al.  Reading, writing and erasing mRNA methylation , 2019, Nature Reviews Molecular Cell Biology.

[34]  R. Roskoski The ErbB/HER family of protein-tyrosine kinases and cancer. , 2014, Pharmacological research.

[35]  J. Coward,et al.  A systematic literature review assessing if genetic biomarkers are predictors for platinum-based chemotherapy response in ovarian cancer patients , 2020, European Journal of Clinical Pharmacology.

[36]  Yun-Gui Yang,et al.  Dynamic methylome of internal mRNA N7-methylguanosine and its regulatory role in translation , 2019, Cell Research.

[37]  Galit Lahav,et al.  Stimulus-dependent dynamics of p53 in single cells , 2011, Molecular systems biology.

[38]  H. Grönberg,et al.  Two Swedish founder MSH6 mutations, one nonsense and one missense, conferring high cumulative risk of Lynch syndrome , 2005, Clinical genetics.

[39]  D. Stern,et al.  Specificity within the EGF family/ErbB receptor family signaling network , 1998, BioEssays : news and reviews in molecular, cellular and developmental biology.

[40]  E. Izaurralde,et al.  The role of the cap structure in RNA processing and nuclear export. , 1997, European journal of biochemistry.

[41]  D. Ward,et al.  Mutation in the DNA mismatch repair gene homologue hMLH 1 is associated with hereditary non-polyposis colon cancer , 1994, Nature.

[42]  Yadong Wang,et al.  DisSetSim: an online system for calculating similarity between disease sets , 2016, Journal of Biomedical Semantics.

[43]  C. Che,et al.  Gold(III) complexes inhibit growth of cisplatin-resistant ovarian cancer in association with upregulation of proapoptotic PMS2 gene , 2014 .

[44]  I. Vlodavsky,et al.  Expression of heparanase, Mdm2, and erbB2 in ovarian cancer. , 2001, International journal of oncology.

[45]  J. Armstrong,et al.  The effect of capping and polyadenylation on the stability, movement and translation of synthetic messenger RNAs in Xenopus oocytes. , 1985, Nucleic acids research.

[46]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[47]  Anna Tramontano,et al.  Accurate energies of hydrogen bonded nucleic acid base pairs and triplets in tRNA tertiary interactions , 2006, Nucleic acids research.

[48]  P. Sharp,et al.  Recognition of cap structure in splicing in vitro of mRNA precursors , 1984, Cell.

[49]  Y. Hérault,et al.  A new mouse model for the trisomy of the Abcg1–U2af1 region reveals the complexity of the combinatorial genetic code of down syndrome , 2009, Human molecular genetics.

[50]  Q. Cui,et al.  SRAMP: prediction of mammalian N6-methyladenosine (m6A) sites based on sequence-derived features , 2016, Nucleic acids research.

[51]  T. Sellers,et al.  A review of the clinical relevance of mismatch‐repair deficiency in ovarian cancer , 2008, Cancer.

[52]  C. Rooryck,et al.  Further delineation of the phenotype caused by biallelic variants in the WDR4 gene , 2018, Clinical genetics.

[53]  Jeremy E. Purvis,et al.  p53 Dynamics Control Cell Fate , 2012, Science.

[54]  J. Knezetic,et al.  Microsatellite instability and expression of MLH1 and MSH2 in normal and malignant endometrial and ovarian epithelium in hereditary nonpolyposis colorectal cancer family members. , 1999, Cancer genetics and cytogenetics.

[55]  Stewart Shuman,et al.  Interactions between Fission Yeast mRNA Capping Enzymes and Elongation Factor Spt5* , 2002, The Journal of Biological Chemistry.