A systems genomics approach to uncover patient-specific pathogenic pathways and proteins in a complex disease

We describe a novel precision medicine workflow, the integrated single nucleotide polymorphism network platform (iSNP), designed to identify the exact mechanisms of how SNPs affect cellular regulatory networks, and how SNP co-occurrences contribute to disease pathogenesis in ulcerative colitis (UC). Using SNP profiles of 377 UC patients, we mapped the regulatory effects of the SNPs to a human signalling network containing protein-protein, miRNA-mRNA and transcription factor binding interactions. Unsupervised clustering algorithms grouped these patient-specific networks into four distinct clusters based on two large disease hubs, NFKB1 and PKCB. Pathway analysis identified the epigenetic modification as common and the T-cell specific responses as differing signalling pathways in the clusters. By integrating individual transcriptomes in active and quiescent disease setting to the patient networks, we validated the impact of non-coding SNPs. The iSNP approach identified regulatory effects of disease-associated non-coding SNPs, and identified how pathogenesis pathways are activated via different genetic modifications.

[1]  J. Long,et al.  MicroRNA-34b inhibits pancreatic cancer metastasis through repressing Smad3. , 2013, Current molecular medicine.

[2]  Rafael A. Irizarry,et al.  A framework for oligonucleotide microarray preprocessing , 2010, Bioinform..

[3]  Malay Mandal,et al.  Targeting the NF-kappaB signaling pathway in Notch1-induced T-cell leukemia. , 2007, Nature medicine.

[4]  E. Kubista,et al.  HER2 overexpression and activation, and tamoxifen efficacy in receptor-positive early breast cancer , 2009, Journal of Cancer Research and Clinical Oncology.

[5]  Hailiang Huang,et al.  Fine-mapping inflammatory bowel disease loci to single variant resolution , 2017, Nature.

[6]  R. Sartor Mechanisms of Disease: pathogenesis of Crohn's disease and ulcerative colitis , 2006, Nature Clinical Practice Gastroenterology &Hepatology.

[7]  Wesley De Neve,et al.  SpliceRover: interpretable convolutional neural networks for improved splice site prediction , 2018, Bioinform..

[8]  J. Caamaño,et al.  NF‐κB1, NF‐κB2 and c‐Rel differentially regulate susceptibility to colitis‐associated adenoma development in C57BL/6 mice , 2015, The Journal of pathology.

[9]  Paul Shannon,et al.  CyREST: Turbocharging Cytoscape Access for External Tools via a RESTful API , 2015, F1000Research.

[10]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[11]  Yang I Li,et al.  An Expanded View of Complex Traits: From Polygenic to Omnigenic , 2017, Cell.

[12]  G. Sonenshein,et al.  Notch1 augments NF‐κB activity by facilitating its nuclear retention , 2006 .

[13]  J. Jongstra,et al.  Leukocyte-specific protein 1 (LSP1) , 2006, Immunologic research.

[14]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[15]  Li Li,et al.  Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records , 2016, Scientific Reports.

[16]  M. Neurath,et al.  NF‐κB in inflammatory bowel disease , 2008, Journal of internal medicine.

[17]  Fiona Powrie,et al.  Intestinal homeostasis and its breakdown in inflammatory bowel disease , 2011, Nature.

[18]  Malay Mandal,et al.  Targeting the NF-κB signaling pathway in Notch1-induced T-cell leukemia , 2007, Nature Medicine.

[19]  Fangqing Zhao,et al.  Prioritization and functional assessment of noncoding variants associated with complex diseases , 2018, Genome Medicine.

[20]  Qing Zhang,et al.  A heuristic model for computational prediction of human branch point sequence , 2017, BMC Bioinformatics.

[21]  Jung Eun Shim,et al.  TRRUST: a reference database of human transcriptional regulatory interactions , 2015, Scientific Reports.

[22]  Ying Li,et al.  RPITER: A Hierarchical Deep Learning Framework for ncRNA–Protein Interaction Prediction , 2019, International journal of molecular sciences.

[23]  Sha He,et al.  Pipelines for cross-species and genome-wide prediction of long noncoding RNA binding , 2019, Nature Protocols.

[24]  F. Noubissi,et al.  CRD-BP protects the coding region of betaTrCP1 mRNA from miR-183-mediated degradation. , 2009, Molecular cell.

[25]  C. Peschel,et al.  Recruitment of PKC-βII to lipid rafts mediates apoptosis-resistance in chronic lymphocytic leukemia expressing ZAP-70 , 2010, Leukemia.

[26]  S Askari,et al.  A novel role for the beta 2 integrin CD11b/CD18 in neutrophil apoptosis: a homeostatic mechanism in inflammation. , 1996, Immunity.

[27]  Wang Fei,et al.  Measuring Patient Similarities via a Deep Architecture with Medical Concept Embedding , 2016 .

[28]  Z. Šidák Rectangular Confidence Regions for the Means of Multivariate Normal Distributions , 1967 .

[29]  M. Hentze,et al.  Mechanism of translational regulation by miR-2 from sites in the 5' untranslated region or the open reading frame. , 2010, RNA.

[30]  Reuven Agami,et al.  miR-148 targets human DNMT3b protein coding region. , 2008, RNA.

[31]  B. Heit,et al.  Endothelial LSP1 Modulates Extravascular Neutrophil Chemotaxis by Regulating Nonhematopoietic Vascular PECAM-1 Expression , 2015, The Journal of Immunology.

[32]  C. Peschel,et al.  Recruitment of PKC-Beta to Lipid Rafts Mediates Apoptosis-Resistance in Chronic Lymphocytic Leukemia Expressing Zap-70. , 2009 .

[33]  Wei Zhang,et al.  Systematic Evaluation of Molecular Networks for Discovery of Disease Genes. , 2018, Cell systems.

[34]  Anton J. Enright,et al.  MicroRNA targets in Drosophila , 2003, Genome Biology.

[35]  T. Pawson,et al.  Leukocyte‐specific protein 1 links TNF receptor‐associated factor 1 to survival signaling downstream of 4‐1BB in T cells , 2013, Journal of leukocyte biology.

[36]  Judy H. Cho,et al.  Functional annotation of a novel NFKB1 promoter polymorphism that increases risk for ulcerative colitis. , 2004, Human molecular genetics.

[37]  Yao Yao,et al.  Monogenic diseases in respiratory medicine: Clinical perspectives , 2017, Pediatric investigation.

[38]  J. Hampe,et al.  Activation of nuclear factor κB in inflammatory bowel disease , 1998, Gut.

[39]  Patrick Flick,et al.  GOATOOLS: A Python library for Gene Ontology analyses , 2018, Scientific Reports.

[40]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[41]  I. C. Allen,et al.  Emerging Roles for Noncanonical NF-&kgr;B Signaling in the Modulation of Inflammatory Bowel Disease Pathobiology , 2016, Inflammatory Bowel Diseases.

[42]  Prabina Kumar Meher,et al.  A statistical approach for 5′ splice site prediction using short sequence motifs and without encoding sequence data , 2014, BMC Bioinformatics.

[43]  M. Cybulsky,et al.  Modulation of Mac-1 (CD11b/CD18)-Mediated Adhesion by the Leukocyte-Specific Protein 1 Is Key to Its Role in Neutrophil Polarization and Chemotaxis1 , 2002, The Journal of Immunology.

[44]  Lars Schmidt-Thieme,et al.  Data Analysis, Machine Learning and Applications - Proceedings of the 31st Annual Conference of the Gesellschaft für Klassifikation e.V., Albert-Ludwigs-Universität Freiburg, March 7-9, 2007 , 2008, GfKl.

[45]  N. Wray,et al.  Underestimated Effect Sizes in GWAS: Fundamental Limitations of Single SNP Analysis for Dichotomous Phenotypes , 2011, PloS one.

[46]  Babajan Banaganapalli,et al.  In-Silico Analysis of Inflammatory Bowel Disease (IBD) GWAS Loci to Novel Connections , 2015, PloS one.

[47]  R. Donn,et al.  Network analysis and juvenile idiopathic arthritis (JIA): a new horizon for the understanding of disease pathogenesis and therapeutic target identification , 2016, Pediatric Rheumatology.

[48]  Shraddha Pai,et al.  netDx: interpretable patient classification using integrated patient similarity networks , 2019, Molecular systems biology.

[49]  Preeti Kumari,et al.  ERROR DETECTING AND ERROR CORRECTING CODES , 2014 .

[50]  J. Varkey,et al.  Construction of Parkinson's disease marker-based weighted protein-protein interaction network for prioritization of co-expressed genes. , 2019, Gene.

[51]  Fei Guo,et al.  Multivariate Information Fusion With Fast Kernel Learning to Kernel Ridge Regression in Predicting LncRNA-Protein Interactions , 2019, Front. Genet..

[52]  Shaokang Li,et al.  Pathway-based gene-gene interaction network modelling to predict potential biomarkers of essential hypertension , 2018, Biosyst..

[53]  Gary D. Bader,et al.  clusterMaker: a multi-algorithm clustering plugin for Cytoscape , 2011, BMC Bioinformatics.

[54]  Yang Xu,et al.  PyPathway: Python Package for Biological Network Analysis and Visualization , 2018, J. Comput. Biol..

[55]  Qian Zhou,et al.  Computational analyses of obesity associated loci generated by genome-wide association studies , 2018, PloS one.

[56]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[57]  M. Downes,et al.  A Dynamic Role for HDAC7 in MEF2-mediated Muscle Differentiation* , 2001, The Journal of Biological Chemistry.

[58]  G. Sonenshein,et al.  Notch1 augments NF-kappaB activity by facilitating its nuclear retention. , 2006, The EMBO journal.

[59]  Bart De Moor,et al.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis , 2005, Bioinform..

[60]  J. Buettner,et al.  Genes involved in the regulation of intestinal permeability and their role in ulcerative colitis , 2015, Journal of digestive diseases.

[61]  Ana Kozomara,et al.  miRBase: integrating microRNA annotation and deep-sequencing data , 2010, Nucleic Acids Res..

[62]  Gary D Bader,et al.  Biological Network Exploration with Cytoscape 3 , 2014, Current protocols in bioinformatics.

[63]  J. Schauber,et al.  Influence of protein kinase C on transcription of the tight junction elements ZO‐1 and occludin , 2005, Journal of cellular physiology.

[64]  M. Osanai,et al.  Protein Kinase C Enhances Tight Junction Barrier Function of Human Nasal Epithelial Cells in Primary Culture by Transcriptional Regulation , 2008, Molecular Pharmacology.

[65]  T. Golde,et al.  Off the beaten pathway: the complex cross talk between Notch and NF-kappaB. , 2008, Laboratory investigation; a journal of technical methods and pathology.

[66]  D. Haber,et al.  DNA Methyltransferases Dnmt3a and Dnmt3b Are Essential for De Novo Methylation and Mammalian Development , 1999, Cell.

[67]  Shane J. Neph,et al.  Systematic Localization of Common Disease-Associated Variation in Regulatory DNA , 2012, Science.

[68]  R. Arasaradnam,et al.  A systematic review of the role of DNA methylation on inflammatory genes in ulcerative colitis. , 2016, Epigenomics.

[69]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[70]  David J. Arenillas,et al.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles , 2015, Nucleic Acids Res..

[71]  T. Golde,et al.  Off the beaten pathway: the complex cross talk between Notch and NF-κB , 2008, Laboratory Investigation.

[72]  J. Helden,et al.  Using RSAT to scan genome sequences for transcription factor binding sites and cis-regulatory modules , 2008, Nature Protocols.

[73]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[74]  S. Bray Notch signalling: a simple pathway becomes complex , 2006, Nature Reviews Molecular Cell Biology.

[75]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[76]  D. Iliopoulos,et al.  The IBD interactome: an integrated view of aetiology, pathogenesis and therapy , 2017, Nature Reviews Gastroenterology & Hepatology.

[77]  Ting-Ting Zeng,et al.  Association between DNA Methyltransferases 3B Gene Polymorphisms and the Susceptibility to Acute Myeloid Leukemia in Chinese Han Population , 2013, PloS one.

[78]  K. Hao,et al.  A functional genomics predictive network model identifies regulators of inflammatory bowel disease , 2017, Nature Genetics.

[79]  Thomas Lengauer,et al.  Dissection of the Inflammatory Bowel Disease Transcriptome Using Genome-Wide cDNA Microarrays , 2005, PLoS medicine.

[80]  S. Heath,et al.  Integrative epigenome-wide analysis demonstrates that DNA methylation may mediate genetic risk in inflammatory bowel disease , 2016, Nature communications.

[81]  G. Feng,et al.  Polymorphisms in microRNA target sites influence susceptibility to schizophrenia by altering the binding of miRNAs to their targets , 2013, European Neuropsychopharmacology.

[82]  J. Sutcliffe,et al.  A Bayesian framework that integrates multi-omics data and gene networks predicts risk genes from schizophrenia GWAS data , 2019, Nature Neuroscience.

[83]  A. Barabasi,et al.  A disease module in the interactome explains disease heterogeneity, drug response and captures novel pathways and genes in asthma. , 2015, Human molecular genetics.

[84]  Julio Saez-Rodriguez,et al.  OmniPath: guidelines and gateway for literature-curated signaling pathway resources , 2016, Nature Methods.

[85]  David C. Wilson,et al.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease , 2012, Nature.

[86]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[87]  E. Birney,et al.  Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt , 2009, Nature Protocols.

[88]  Shraddha Pai,et al.  Patient Similarity Networks for Precision Medicine. , 2018, Journal of molecular biology.

[89]  Sankar Ghosh,et al.  Signaling to NF-kappaB. , 2004, Genes & development.

[90]  M. Daly,et al.  Genetic and Epigenetic Fine-Mapping of Causal Autoimmune Disease Variants , 2014, Nature.

[91]  J. Payton,et al.  PLAIDOH: a novel method for functional prediction of long non-coding RNAs identifies cancer-specific LncRNA activities , 2019, BMC Genomics.

[92]  Ivan V. Kulakovskiy,et al.  Multiple single nucleotide polymorphisms in the first intron of the IL2RA gene affect transcription factor binding and enhancer activity. , 2017, Gene.

[93]  Allan Kuchinsky,et al.  GLay: community structure analysis of biological networks , 2010, Bioinform..

[94]  Yang Du,et al.  rSNPBase: a database for curated regulatory SNPs , 2013, Nucleic Acids Res..

[95]  Thorsten Meinl,et al.  KNIME: The Konstanz Information Miner , 2007, GfKl.

[96]  William S. Bush,et al.  Large-scale association analysis identifies new lung cancer susceptibility loci and heterogeneity in genetic susceptibility across histological subtypes , 2017, Nature Genetics.

[97]  R. Bamezai,et al.  Understanding Genetic Heterogeneity in Type 2 Diabetes by Delineating Physiological Phenotypes: SIRT1 and its Gene Network in Impaired Insulin Secretion. , 2016, The review of diabetic studies : RDS.

[98]  Li-Na Wei,et al.  MicroRNA mir-346 targets the 5'-untranslated region of receptor-interacting protein 140 (RIP140) mRNA and up-regulates its protein expression. , 2009, The Biochemical journal.

[99]  Fei Wang,et al.  Measuring Patient Similarities via a Deep Architecture with Medical Concept Embedding , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[100]  Annick Harel-Bellan,et al.  A synonymous variant in IRGM alters a binding site for miR-196 and causes deregulation of IRGM-dependent xenophagy in Crohn's disease , 2011, Nature Genetics.

[101]  David C. Wilson,et al.  Genome-wide association study implicates immune activation of multiple integrin genes in inflammatory bowel disease , 2016, Nature Genetics.

[102]  J. Shelhamer,et al.  Platelet-activating factor induces airway mucin release via activation of protein kinase C: evidence for translocation of protein kinase C to membranes. , 1994, American journal of respiratory cell and molecular biology.

[103]  Andreas Bender,et al.  Neighbours of cancer-related proteins have key influence on pathogenesis and could increase the drug target space for anticancer therapies , 2017, npj Systems Biology and Applications.

[104]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .