Graph Theory Enables Drug Repurposing – How a Mathematical Model Can Drive the Discovery of Hidden Mechanisms of Action

We introduce a methodology to efficiently exploit natural-language expressed biomedical knowledge for repurposing existing drugs towards diseases for which they were not initially intended. Leveraging on developments in Computational Linguistics and Graph Theory, a methodology is defined to build a graph representation of knowledge, which is automatically analysed to discover hidden relations between any drug and any disease: these relations are specific paths among the biomedical entities of the graph, representing possible Modes of Action for any given pharmacological compound. We propose a measure for the likeliness of these paths based on a stochastic process on the graph. This measure depends on the abundance of indirect paths between a peptide and a disease, rather than solely on the strength of the shortest path connecting them. We provide real-world examples, showing how the method successfully retrieves known pathophysiological Mode of Action and finds new ones by meaningfully selecting and aggregating contributions from known bio-molecular interactions. Applications of this methodology are presented, and prove the efficacy of the method for selecting drugs as treatment options for rare diseases.

[1]  B. Luo,et al.  Correlation of Epstein-Barr virus and its encoded proteins with Helicobacter pylori and expression of c-met and c-myc in gastric carcinoma. , 2006, World journal of gastroenterology.

[2]  Zhiyong Lu,et al.  OpenDMAP: An open source, ontology-driven concept analysis engine, with applications to capturing knowledge regarding protein transport, protein interactions and cell-type-specific gene expression , 2008, BMC Bioinformatics.

[3]  K. E. Ravikumar,et al.  An online literature mining tool for protein phosphorylation , 2006, Bioinform..

[4]  Mark E. J. Newman,et al.  Structure and Dynamics of Networks , 2009 .

[5]  Don R. Swanson,et al.  Intervening in the Life Cycles of Scientific Knowledge Patrick Wilson, The Value of Currency , 1993, Libr. Trends.

[6]  N. Inestrosa,et al.  STI571 prevents apoptosis, tau phosphorylation and behavioural impairments induced by Alzheimer's beta-amyloid deposits. , 2008, Brain : a journal of neurology.

[7]  K. E. Ravikumar,et al.  Beyond the clause: extraction of phosphorylation information from medline abstracts , 2005, ISMB.

[8]  Haijun Zhou Network landscape from a Brownian particle's perspective. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Joyce A. Mitchell,et al.  Using literature-based discovery to identify disease candidate genes , 2005, Int. J. Medical Informatics.

[10]  Xing Chen,et al.  Drug-target interaction prediction by random walk on the heterogeneous network. , 2012, Molecular bioSystems.

[11]  J. Müller‐Quernheim Sarcoidosis: immunopathogenetic concepts and their clinical application. , 1998, The European respiratory journal.

[12]  D. Swanson Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge , 2015, Perspectives in biology and medicine.

[13]  Michael R. Seringhaus,et al.  Seeking a New Biology through Text Mining , 2008, Cell.

[14]  Richard F. Betzel,et al.  Exploring the Morphospace of Communication Efficiency in Complex Networks , 2013, PloS one.

[15]  Amin Sadeghi,et al.  Prion diseases - current theories and potential therapies: a brief review. , 2012, Folia neuropathologica.

[16]  Miguel A. Andrade-Navarro,et al.  Automatic Extraction of Biological Information from Scientific Text: Protein-Protein Interactions , 1999, ISMB.

[17]  P. Liberski,et al.  Apoptosis in relation to neuronal loss in experimental Creutzfeldt-Jakob disease in mice. , 2001, Acta neurobiologiae experimentalis.

[18]  Jacob de Vlieg,et al.  Literature Mining for the Discovery of Hidden Connections between Drugs, Genes and Diseases , 2010, PLoS Comput. Biol..

[19]  K. Bretonnel Cohen,et al.  Frontiers of biomedical text mining: current progress , 2007, Briefings Bioinform..

[20]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[21]  A. Prasse,et al.  Inhaled vasoactive intestinal peptide exerts immunoregulatory effects in sarcoidosis. , 2010, American journal of respiratory and critical care medicine.

[22]  Virginia Teller Review of Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition by Daniel Jurafsky and James H. Martin. Prentice Hall 2000. , 2000 .

[23]  Jagdish Chandra Patra,et al.  Genome-wide inferring gene-phenotype relationship by walking on the heterogeneous network , 2010, Bioinform..

[24]  S. Aurora Beata Stawarska, Saussure’s Philosophy of Language as Phenomenology. Undoing the Doctrine of the Course in General Linguistics, New York, Oxford University Press, 304 pp. , 2015 .

[25]  Vassilis Virvilis,et al.  Literature mining, ontologies and information visualization for drug repurposing , 2011, Briefings Bioinform..

[26]  F. Saussure,et al.  Course in General Linguistics , 1960 .

[27]  Arthur M. Lesk,et al.  Introduction to bioinformatics , 2002 .

[28]  P. Davies,et al.  c-Abl in Neurodegenerative Disease , 2011, Journal of Molecular Neuroscience.

[29]  Magnus Sahlgren,et al.  The Distributional Hypothesis , 2008 .

[30]  Seong-Wook Yun,et al.  The Tyrosine Kinase Inhibitor STI571 Induces Cellular Clearance of PrPSc in Prion-infected Cells* , 2004, Journal of Biological Chemistry.

[31]  Michael S Lajiness,et al.  Systems chemical biology and the Semantic Web: what they mean for the future of drug discovery research. , 2012, Drug discovery today.

[32]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[33]  Sophia Ananiadou,et al.  Text Mining for Biology And Biomedicine , 2005 .

[34]  P. Seeburg,et al.  Structural mechanism for STI-571 inhibition of abelson tyrosine kinase. , 2000, Science.

[35]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[36]  Shoji Kudoh,et al.  Quantitative Analysis of Mycobacterial and Propionibacterial DNA in Lymph Nodes of Japanese and European Patients with Sarcoidosis , 2002, Journal of Clinical Microbiology.

[37]  D. Rebholz-Schuhmann,et al.  Text-mining solutions for biomedical research: enabling integrative biology , 2012, Nature Reviews Genetics.

[38]  A. Persidis,et al.  Literature analysis for systematic drug repurposing: a case study from Biovista , 2011 .

[39]  Saso Dzeroski,et al.  Computational Discovery of Scientific Knowledge , 2007, Computational Discovery of Scientific Knowledge.

[40]  Tiziana di Matteo,et al.  Hierarchical Information Clustering by Means of Topologically Embedded Graphs , 2011, PloS one.

[41]  J. Firth,et al.  Papers in linguistics, 1934-1951 , 1957 .

[42]  Michel Klein,et al.  Combining and relating ontologies: an analysis of problems and solutions , 2001, OIS@IJCAI.

[43]  J. Fromm The Emergence of Complexity , 2004 .

[44]  Laurent Beuret,et al.  Up-regulation of MET Expression by α-Melanocyte-stimulating Hormone and MITF Allows Hepatocyte Growth Factor to Protect Melanocytes and Melanoma Cells from Apoptosis* , 2007, Journal of Biological Chemistry.

[45]  Neil R. Smalheiser,et al.  Artificial Intelligence An interactive system for finding complementary literatures : a stimulus to scientific discovery , 1995 .

[46]  P. Bork,et al.  Literature mining for the biologist: from information retrieval to biological discovery , 2006, Nature Reviews Genetics.

[47]  Mario Delgado,et al.  Regulation of immune tolerance by anti-inflammatory neuropeptides , 2007, Nature Reviews Immunology.

[48]  Christian Blaschke,et al.  Status of text-mining techniques applied to biomedical text. , 2006, Drug discovery today.

[49]  Lars Juhl Jensen,et al.  Large-scale extraction of gene regulation for model organisms in an ontological context , 2004, Silico Biol..

[50]  T. Nakayama The genetic contribution of the natriuretic peptide system to cardiovascular diseases. , 2005, Endocrine journal.

[51]  Wei Jin,et al.  HCAMiner: Mining Concept Associations for Knowledge Discovery through Concept Chain Queries , 2007, COLING.

[52]  Jon Hill,et al.  Cheminformatic/bioinformatic analysis of large corporate databases: Application to drug repurposing , 2011 .

[53]  T. Meyer,et al.  Inhibition of the Abl protein-tyrosine kinase in vitro and in vivo by a 2-phenylaminopyrimidine derivative. , 1996, Cancer research.

[54]  Renaud Lambiotte,et al.  Uncovering space-independent communities in spatial networks , 2010, Proceedings of the National Academy of Sciences.

[55]  Wanda Pratt,et al.  Using statistical and knowledge-based approaches for literature-based discovery , 2006, J. Biomed. Informatics.

[56]  Jun'ichi Tsujii,et al.  New challenges for text mining: mapping between text and manually curated pathways , 2008, BMC Bioinformatics.

[57]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[58]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[59]  Guido Caldarelli,et al.  Scale-Free Networks , 2007 .

[60]  G. Caldarelli,et al.  Using Networks To Understand Medical Data: The Case of Class III Malocclusions , 2012, PloS one.

[61]  Padmini Srinivasan,et al.  Mining MEDLINE for implicit links between dietary substances and diseases , 2004, ISMB/ECCB.

[62]  K. Bretonnel Cohen,et al.  Getting Started in Text Mining , 2008, PLoS Comput. Biol..