An integrative network-based approach to identify novel disease genes and pathways: a case study in the context of inflammatory bowel disease

BackgroundThere are different and complicated associations between genes and diseases. Finding the causal associations between genes and specific diseases is still challenging. In this work we present a method to predict novel associations of genes and pathways with inflammatory bowel disease (IBD) by integrating information of differential gene expression, protein-protein interaction and known disease genes related to IBD.ResultsWe downloaded IBD gene expression data from NCBI’s Gene Expression Omnibus, performed statistical analysis to determine differentially expressed genes, collected known IBD genes from DisGeNet database, which were used to construct a IBD related PPI network with HIPPIE database. We adapted our graph-based clustering algorithm DPClusO to cluster the disease PPI network. We evaluated the statistical significance of the identified clusters in the context of determining the richness of IBD genes using Fisher’s exact test and predicted novel genes related to IBD. We showed 93.8% of our predictions are correct in the context of other databases and published literatures related to IBD.ConclusionsFinding disease-causing genes is necessary for developing drugs with synergistic effect targeting many genes simultaneously. Here we present an approach to identify novel disease genes and pathways and discuss our approach in the context of IBD. The approach can be generalized to find disease-associated genes for other diseases.

[1]  C. Thompson,et al.  Tumor necrosis factor receptor-associated factors (TRAFs)--a family of adapter proteins that regulates life and death. , 1998, Genes & development.

[2]  R. Fisher On the Interpretation of χ 2 from Contingency Tables , and the Calculation of P Author , 2022 .

[3]  S. Dimauro,et al.  Polyglucosan body myopathy caused by defective ubiquitin ligase RBCK1 , 2013, Annals of neurology.

[4]  K. Jeang,et al.  Isolation of full-length cDNA and chromosomal localization of human NF-kappaB modulator NEMO to Xq28. , 1999, Journal of biomedical science.

[5]  A. Proudfoot,et al.  Chemokine receptors and their antagonists in allergic lung disease , 1999, Inflammation Research.

[6]  Jun-Ming Zhang,et al.  Cytokines, Inflammation, and Pain , 2007, International anesthesiology clinics.

[7]  A. Bonato,et al.  Dominating Biological Networks , 2011, PloS one.

[8]  R. Fisher On the Interpretation of χ2 from Contingency Tables, and the Calculation of P , 2018, Journal of the Royal Statistical Society Series A (Statistics in Society).

[9]  重彦 金谷,et al.  <ファクトデータベース・フリーウェア特集号> DPClus: タンパク質相互作用解析を主目的とする密度-周辺構造に基づいたグラフ・クラスター解析ソフトウエア , 2006 .

[10]  Pankaj Agarwal,et al.  A Pathway-Based View of Human Diseases and Disease Relationships , 2009, PloS one.

[11]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[12]  Eric P. Xing,et al.  A multivariate regression approach to association analysis of a quantitative trait network , 2008, Bioinform..

[13]  P. Delvenne,et al.  Altered expression of type I insulin‐like growth factor receptor in Crohn's disease , 2005, Clinical and experimental immunology.

[14]  Shigehiko Kanaya,et al.  DPClus : A Density-periphery Based Graph Clustering Software Mainly Focused on Detection of Protein Complexes in Interaction Networks , 2006 .

[15]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[16]  Huarong Huang,et al.  CCR5 expression in inflammatory bowel disease and its correlation with inflammatory cells and β-arrestin2 expression , 2017, Scandinavian journal of gastroenterology.

[17]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[18]  Michael J. O'Donovan,et al.  The transcription factor IRF8 activates integrin-mediated TGF-β signaling and promotes neuroinflammation. , 2014, Immunity.

[19]  A. Barabasi,et al.  Uncovering disease-disease relationships through the incomplete interactome , 2015, Science.

[20]  O. Rath,et al.  MAP kinase signalling pathways in cancer , 2007, Oncogene.

[21]  Timothy L. Tickle,et al.  Pediatric Crohn disease patients exhibit specific ileal transcriptome and microbiome signature. , 2014, The Journal of clinical investigation.

[22]  Tariq Ahmad,et al.  Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47 , 2011, Nature Genetics.

[23]  B. Zupan,et al.  Discovering disease-disease associations by fusing systems-level molecular data , 2013, Scientific Reports.

[24]  Jing Chen,et al.  Improved human disease candidate gene prioritization using mouse phenotype , 2007, BMC Bioinformatics.

[25]  Krin A. Kay,et al.  The implications of human metabolic network topology for disease comorbidity , 2008, Proceedings of the National Academy of Sciences.

[26]  T. Ideker,et al.  Network-based classification of breast cancer metastasis , 2007, Molecular systems biology.

[27]  Marc-Thorsten Hütt,et al.  Distinct metabolic network states manifest in the gene expression profiles of pediatric inflammatory bowel disease patients and controls , 2016, Scientific Reports.

[28]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[29]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[30]  E. Ruppin,et al.  Global map of physical interactions among differentially expressed genes in multiple sclerosis relapses and remissions. , 2011, Human molecular genetics.

[31]  G. Kolios,et al.  Increased expression of chemokine receptor CCR3 and its ligands in ulcerative colitis: the role of colonic epithelial cells in in vitro studies , 2010, Clinical and experimental immunology.

[32]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[33]  E. Ruppin,et al.  Common and specific signatures of gene expression and protein–protein interactions in autoimmune diseases , 2012, Genes and Immunity.

[34]  Guanghui Hu,et al.  Human Disease-Drug Network Based on Genomic Expression Profiles , 2009, PloS one.

[35]  Núria Queralt-Rosinach,et al.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes , 2015, Database J. Biol. Databases Curation.

[36]  Minhu Chen,et al.  E3 Ubiquitin ligase RNF183 Is a Novel Regulator in Inflammatory Bowel Disease. , 2016, Journal of Crohn's & colitis.

[37]  I. C. Allen,et al.  The Goldilocks Conundrum: NLR Inflammasome Modulation of Gastrointestinal Inflammation during Inflammatory Bowel Disease. , 2016, Critical reviews in immunology.

[38]  David J. Porteous,et al.  SUSPECTS : enabling fast and effective prioritization of positional candidates , 2005 .

[39]  C. Klein,et al.  The diagnostic approach to monogenic very early onset inflammatory bowel disease. , 2014, Gastroenterology.

[40]  M. Stephens,et al.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. , 2008, Genome research.

[41]  H. Rosenberg,et al.  The Chemokine Macrophage-Inflammatory Protein-1α and Its Receptor CCR1 Control Pulmonary Inflammation and Antiviral Host Defense in Paramyxovirus Infection , 2000, The Journal of Immunology.

[42]  R. Medzhitov,et al.  Toll-like receptors and cancer , 2009, Nature Reviews Cancer.

[43]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2017 , 2016, Nucleic Acids Res..

[44]  Zhen Liu,et al.  Identifying disease associations via genome-wide association studies , 2009, BMC Bioinformatics.

[45]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[46]  Z. Ran,et al.  Up-regulation and Pre-activation of TRAF3 and TRAF5 in Inflammatory Bowel Disease , 2013, International journal of medical sciences.

[47]  G. Clayman,et al.  Chemokines in cancer , 2001, Expert Reviews in Molecular Medicine.

[48]  Natasa Przulj,et al.  The Core Diseasome. , 2012, Molecular bioSystems.

[49]  Deendayal Dinakarpandian,et al.  Finding disease similarity based on implicit semantic similarity , 2012, J. Biomed. Informatics.

[50]  Yi-Chien Lee,et al.  Temporal self-regulation of transposition through host-independent transposase rodlet formation , 2016, Nucleic acids research.

[51]  Alan R. Powell,et al.  Integration of text- and data-mining using ontologies successfully selects disease gene candidates , 2005, Nucleic acids research.

[52]  Yan Gu,et al.  Gene expression of tumor necrosis factor receptor associated‐factor (TRAF)‐1 and TRAF‐2 in inflammatory bowel disease , 2013, Journal of digestive diseases.

[53]  C. Monaco,et al.  Nuclear factor kappaB: a potential therapeutic target in atherosclerosis and thrombosis. , 2004, Cardiovascular research.

[54]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[55]  K. Jeang,et al.  Isolation of Full-Length cDNA and Chromosomal Localization of Human NF-κB Modulator NEMO to Xq28 , 1999, Journal of Biomedical Science.

[56]  E. Letouzé,et al.  Analysis of the copy number profiles of several tumor samples from the same patient reveals the successive steps in tumorigenesis , 2010, Genome Biology.

[57]  Hailin Zhao,et al.  Role of necroptosis in the pathogenesis of solid organ injury , 2015, Cell Death and Disease.

[58]  T. Mohr,et al.  Proteomic profiling identifies markers for inflammation-related tumor–fibroblast interaction , 2017, Clinical Proteomics.

[59]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[60]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[61]  Judy H. Cho,et al.  Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations , 2015, Nature Genetics.

[62]  T. Klaenhammer,et al.  Regulation of induced colonic inflammation by Lactobacillus acidophilus deficient in lipoteichoic acid , 2011, Proceedings of the National Academy of Sciences.

[63]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[64]  Zhongming Zhao,et al.  Pathway-based analysis of GWAS datasets: effective but caution required. , 2011, The international journal of neuropsychopharmacology.

[65]  Bassem A. Hassan,et al.  Gene prioritization through genomic data fusion , 2006, Nature Biotechnology.

[66]  M. Karin,et al.  Expanding TRAF function: TRAF3 as a tri-faced immune regulator , 2011, Nature Reviews Immunology.

[67]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[68]  Z. Ran,et al.  Intestinal protein expression profile identifies inflammatory bowel disease and predicts relapse. , 2013, International journal of clinical and experimental pathology.

[69]  S. Brand,et al.  Analysis of IL12B Gene Variants in Inflammatory Bowel Disease , 2012, PloS one.

[70]  Gary D. Bader,et al.  HyperModules: identifying clinically and phenotypically significant network modules with disease mutations for biomarker discovery , 2014, Bioinform..

[71]  Jing Chen,et al.  ToppGene Suite for gene list enrichment analysis and candidate gene prioritization , 2009, Nucleic Acids Res..

[72]  M. Mort,et al.  Exome Analysis of Rare and Common Variants within the NOD Signaling Pathway , 2017, Scientific Reports.

[73]  S. Linnarsson,et al.  NOD-like receptor signaling and inflammasome-related pathways are highlighted in psoriatic epidermis , 2016, Scientific Reports.

[74]  R. Sharan,et al.  Protein networks in disease. , 2008, Genome research.

[75]  S. Bhattacharyya,et al.  Platelet‐activating factor‐induced NF‐&kgr;B activation and IL‐8 production in intestinal epithelial cells are Bcl10‐dependent , 2010, Inflammatory bowel diseases.

[76]  Pall I. Olason,et al.  A human phenome-interactome network of protein complexes implicated in genetic disorders , 2007, Nature Biotechnology.

[77]  P. Jung,et al.  Epithelial IL-1R2 acts as a homeostatic regulator during remission of ulcerative colitis , 2015, Mucosal Immunology.

[78]  Altaf-Ul-Amin,et al.  Partitioning a PPI Network into Overlapping Modules Constrained by High-Density and Periphery Tracking , 2012 .

[79]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[80]  M. Khoury,et al.  A navigator for human genome epidemiology , 2008, Nature Genetics.

[81]  Seiya Kato,et al.  IFNγ-dependent, spontaneous development of colorectal carcinomas in SOCS1-deficient mice , 2006, The Journal of experimental medicine.

[82]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[83]  Ponnuthurai N. Suganthan,et al.  Identification of structurally conserved residues of proteins in absence of structural homologs using neural network ensemble , 2008, Bioinform..

[84]  Martin H. Schaefer,et al.  HIPPIE: Integrating Protein Interaction Networks with Experiment Based Quality Scores , 2012, PloS one.

[85]  G. Rogler,et al.  Role of Protein Tyrosine Phosphatases in Regulating the Immune System: Implications for Chronic Intestinal Inflammation , 2015, Inflammatory bowel diseases.

[86]  David C. Wilson,et al.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease , 2012, Nature.

[87]  Tariq Ahmad,et al.  Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci , 2010, Nature Genetics.

[88]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[89]  Natasa Przulj,et al.  Predicting disease associations via biological network analysis , 2014, BMC Bioinformatics.

[90]  Elaine Nsoesie,et al.  Prediction of Disease and Phenotype Associations from Genome-Wide Association Studies , 2011, PloS one.

[91]  P. Gray,et al.  CCR11 Is a Functional Receptor for the Monocyte Chemoattractant Protein Family of Chemokines* , 2000, The Journal of Biological Chemistry.

[92]  A. Cumano,et al.  Monitoring of Blood Vessels and Tissues by a Population of Monocytes with Patrolling Behavior , 2007, Science.

[93]  Joel Dudley,et al.  Network-Based Elucidation of Human Disease Similarities Reveals Common Functional Modules Enriched for Pluripotent Drug Targets , 2010, PLoS Comput. Biol..

[94]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[95]  Hongyu Zhao,et al.  Genotyping and inflated type I error rate in genome-wide association case/control studies , 2009, BMC Bioinformatics.