A Literature-Based Knowledge Graph Embedding Method for Identifying Drug Repurposing Opportunities in Rare Diseases.

Millions of Americans are affected by rare diseases, many of which have poor survival rates. However, the small market size of individual rare diseases, combined with the time and capital requirements of pharmaceutical R&D, have hindered the development of new drugs for these cases. A promising alternative is drug repurposing, whereby existing FDA-approved drugs might be used to treat diseases different from their original indications. In order to generate drug repurposing hypotheses in a systematic and comprehensive fashion, it is essential to integrate information from across the literature of pharmacology, genetics, and pathology. To this end, we leverage a newly developed knowledge graph, the Global Network of Biomedical Relationships (GNBR). GNBR is a large, heterogeneous knowledge graph comprising drug, disease, and gene (or protein) entities linked by a small set of semantic themes derived from the abstracts of biomedical literature. We apply a knowledge graph embedding method that explicitly models the uncertainty associated with literature-derived relationships and uses link prediction to generate drug repurposing hypotheses. This approach achieves high performance on a gold-standard test set of known drug indications (AUROC = 0.89) and is capable of generating novel repurposing hypotheses, which we independently validate using external literature sources and protein interaction networks. Finally, we demonstrate the ability of our model to produce explanations of its predictions.

[1]  H. Ghofrani,et al.  Sildenafil: from angina to erectile dysfunction to pulmonary hypertension and beyond , 2006, Nature Reviews Drug Discovery.

[2]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2004, Nucleic Acids Res..

[3]  Alex H. Wagner,et al.  DGIdb 3.0: a redesign and expansion of the drug–gene interaction database , 2017, bioRxiv.

[4]  Alexander A. Morgan,et al.  Discovery and Preclinical Validation of Drug Indications Using Compendia of Public Gene Expression Data , 2011, Science Translational Medicine.

[5]  A. Paller,et al.  Methotrexate: new uses for an old drug. , 2014, The Journal of pediatrics.

[6]  Joshua C. Denny,et al.  Validation and Enhancement of a Computable Medication Indication Resource (MEDI) Using a Large Practice-based Dataset , 2013, AMIA.

[7]  D. Haber,et al.  Wilms tumor and the WT1 gene. , 2001, Experimental cell research.

[8]  Justin K. Huang,et al.  Typing tumors using pathways selected by somatic evolution , 2018, Nature Communications.

[9]  Chi-Ying F. Huang,et al.  Trifluoperazine, an antipsychotic agent, inhibits cancer stem cell growth and overcomes drug resistance of lung cancer. , 2012, American journal of respiratory and critical care medicine.

[10]  Jure Leskovec,et al.  Modeling polypharmacy side effects with graph convolutional networks , 2018, bioRxiv.

[11]  Albert-László Barabási,et al.  Network-based prediction of drug combinations , 2019, Nature Communications.

[12]  Daniel A Culver,et al.  A concise review of pulmonary sarcoidosis. , 2011, American journal of respiratory and critical care medicine.

[13]  Peer Bork,et al.  The SIDER database of drugs and side effects , 2015, Nucleic Acids Res..

[14]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[15]  Jie Li,et al.  Review of Drug Repositioning Approaches and Resources , 2018, International journal of biological sciences.

[16]  Stuart J. Nelson,et al.  Normalized names for clinical drugs: RxNorm at 6 years , 2011, J. Am. Medical Informatics Assoc..

[17]  Wolfram Weckwerth,et al.  Chronic signaling via the metabolic checkpoint kinase mTORC1 induces macrophage granuloma formation and marks sarcoidosis progression , 2016, Nature Immunology.

[18]  S M Hewitt,et al.  Regulation of the proto-oncogenes bcl-2 and c-myc by the Wilms' tumor suppressor gene WT1. , 1995, Cancer research.

[19]  P. Sanseau,et al.  Drug repurposing: progress, challenges and recommendations , 2018, Nature Reviews Drug Discovery.

[20]  Russ B. Altman,et al.  A global network of biomedical relationships derived from text , 2018, Bioinform..

[21]  C E Lipscomb,et al.  Medical Subject Headings (MeSH). , 2000, Bulletin of the Medical Library Association.

[22]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..