DiSNEP: a Disease-Specific gene Network Enhancement to improve Prioritizing candidate disease genes

Biological network-based strategies are useful in prioritizing genes associated with diseases. Several comprehensive human gene networks such as STRING, GIANT and HumanNet were developed and used in network-assisted algorithms to identify disease-associated genes. However, none of these networks are disease-specific and may not accurately reflect gene interactions for a specific disease. Aiming to improve disease gene prioritization using networks, we propose a Disease-Specific Network Enhancement Prioritization (DiSNEP) framework. DiSNEP first enhances a comprehensive gene network specifically for a disease through a diffusion process on a gene-gene similarity matrix derived from disease omics data. The enhanced disease-specific gene network thus better reflects true gene interactions for the disease and may improve prioritizing disease-associated genes subsequently. In simulations, DiSNEP that uses an enhanced disease-specific network prioritizes more true signal genes than comparison methods using a general gene network or without prioritization. Applications to prioritize cancer-associated gene expression and DNA methylation signal genes for five cancer types from The Cancer Genome Atlas (TCGA) project suggest that more prioritized candidate genes by DiSNEP are cancer-related according to the DisGeNET database than those prioritized by the comparison methods, consistently across all five cancer types considered, and for both gene expression and DNA methylation signal genes.

[1]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[2]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[3]  Damian Szklarczyk,et al.  STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets , 2018, Nucleic Acids Res..

[4]  P. Robinson,et al.  Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[5]  J. Franklyn,et al.  Expression of fibroblast growth factors in thyroid cancer. , 1995, The Journal of clinical endocrinology and metabolism.

[6]  T. Kipps,et al.  ATM Mutations in Cancer: Therapeutic Implications , 2016, Molecular Cancer Therapeutics.

[7]  Wei Zheng,et al.  dmGWAS: dense module searching for genome-wide association studies in protein-protein interaction networks , 2011, Bioinform..

[8]  Benno Schwikowski,et al.  Network-based analysis of omics data: the LEAN method , 2016, Bioinform..

[9]  Michelle Giglio,et al.  Human Disease Ontology 2018 update: classification, content and workflow expansion , 2018, Nucleic Acids Res..

[10]  M. Probst-Kepper,et al.  CXCR4/CXCL12 expression and signalling in kidney cancer , 2002, British Journal of Cancer.

[11]  L. Milanesi,et al.  Network diffusion-based analysis of high-throughput data for the detection of differentially enriched modules , 2016, Scientific Reports.

[12]  Huiling Li,et al.  Upregulation of Tyrosine Kinase FYN in Human Thyroid Carcinoma: Role in Modulating Tumor Cell Proliferation, Invasion, and Migration. , 2017, Cancer biotherapy & radiopharmaceuticals.

[13]  M. Xing,et al.  Clinical utility of RAS mutations in thyroid cancer: a blurred picture now emerging clearer , 2016, BMC Medicine.

[14]  F. Sanz,et al.  The DisGeNET knowledge platform for disease genomics: 2019 update , 2019, Nucleic Acids Res..

[15]  R. Koenig,et al.  Pax-8–PPAR-γ fusion protein in thyroid carcinoma , 2014, Nature Reviews Endocrinology.

[16]  Ralf Herwig,et al.  Analyzing and interpreting genome data at the network level with ConsensusPathDB , 2016, Nature Protocols.

[17]  G. Mills,et al.  The PI3K/AKT Pathway and Renal Cell Carcinoma. , 2015, Journal of genetics and genomics = Yi chuan xue bao.

[18]  Y. Moreau,et al.  Computational tools for prioritizing candidate genes: boosting disease gene discovery , 2012, Nature Reviews Genetics.

[19]  Erik L. L. Sonnhammer,et al.  MaxLink: network-based prioritization of genes tightly linked to a disease seed set , 2014, Bioinform..

[20]  Ruth Pidsley,et al.  A data-driven approach to preprocessing Illumina 450K methylation array data , 2013, BMC Genomics.

[21]  Andrew M. Gross,et al.  Network-based stratification of tumor mutations , 2013, Nature Methods.

[22]  F. Rojo,et al.  Active angiogenesis in metastatic renal cell carcinoma predicts clinical benefit to sunitinib-based therapy , 2014, British Journal of Cancer.

[23]  Xin Ma,et al.  Prognostic value of CD44 expression in renal cell carcinoma: a systematic review and meta-analysis , 2015, Scientific Reports.

[24]  Fei Wang,et al.  The expression of Cullin1 is increased in renal cell carcinoma and promotes cancer cell proliferation, migration, and invasion , 2016, Tumor Biology.

[25]  Andrew E. Teschendorff,et al.  Role of DNA Methylation and Epigenetic Silencing of HAND2 in Endometrial Cancer Development , 2013, PLoS medicine.

[26]  Sunmo Yang,et al.  HumanNet v2: human gene networks for disease research , 2018, Nucleic Acids Res..

[27]  R. Myers,et al.  Candidate-gene approaches for studying complex genetic traits: practical considerations , 2002, Nature Reviews Genetics.

[28]  F. Jasmine,et al.  Exploring genome-wide DNA methylation profiles altered in hepatocellular carcinoma using Infinium HumanMethylation 450 BeadChips , 2013, Epigenetics.

[29]  Jon C. Aster,et al.  Network analysis of gene essentiality in functional genomics experiments , 2015, Genome Biology.

[30]  E. Marcotte,et al.  Prioritizing candidate disease genes by network-based boosting of genome-wide association data. , 2011, Genome research.

[31]  Wei Zhang,et al.  Systematic Evaluation of Molecular Networks for Discovery of Disease Genes. , 2018, Cell systems.

[32]  David W. Johnson,et al.  The emerging role of nuclear factor kappa B in renal cell carcinoma. , 2011, The international journal of biochemistry & cell biology.

[33]  C. Cole,et al.  The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers , 2018, Nature Reviews Cancer.

[34]  Jung Hun Song,et al.  Regulation of signal transducer and activator of transcription 1 (STAT1) and STAT1-dependent genes by RET/PTC (rearranged in transformation/papillary thyroid carcinoma) oncogenic tyrosine kinases. , 2004, Molecular endocrinology.

[35]  Shuigeng Zhou,et al.  NEpiC: a network-assisted algorithm for epigenetic studies using mean and variance combined signals , 2016, Nucleic acids research.

[36]  Benjamin J. Raphael,et al.  Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes , 2014, Nature Genetics.

[37]  M. Boyd,et al.  p53 and MDM2 in renal cell carcinoma , 2010, Cancer.

[38]  M. Santoro,et al.  Mutation of the PIK3CA gene in anaplastic thyroid cancer. , 2005, Cancer research.

[39]  Daniel S. Himmelstein,et al.  Understanding multicellular function and disease with human tissue-specific networks , 2015, Nature Genetics.

[40]  Xiaoyan Zhou,et al.  Expression and function of CXCL12/CXCR4/CXCR7 in thyroid cancer , 2016, International journal of oncology.

[41]  The role of matrix metalloproteinase-9 as a prognostic biomarker in papillary thyroid cancer , 2018, BMC cancer.

[42]  Hyunjung Shin,et al.  Disease gene identification based on generic and disease-specific genome networks , 2018, Bioinform..

[43]  Meng Zhang,et al.  PIK3R1 negatively regulates the epithelial-mesenchymal transition and stem-like phenotype of renal cancer cells through the AKT/GSK3β/CTNNB1 signaling pathway , 2015, Scientific Reports.

[44]  Y. Okada,et al.  Expression of TNF‐α and CD44 is implicated in poor prognosis, cancer cell invasion, metastasis and resistance to the sunitinib treatment in clear cell renal cell carcinomas , 2015, International journal of cancer.

[45]  Zhe Zhang,et al.  Polo-like kinase 1 is overexpressed in renal cancer and participates in the proliferation and invasion of renal cancer cells , 2013, Tumor Biology.

[46]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.