PREDICT: a method for inferring novel drug indications with application to personalized medicine

Inferring potential drug indications, for either novel or approved drugs, is a key step in drug development. Previous computational methods in this domain have focused on either drug repositioning or matching drug and disease gene expression profiles. Here, we present a novel method for the large‐scale prediction of drug indications (PREDICT) that can handle both approved drugs and novel molecules. Our method is based on the observation that similar drugs are indicated for similar diseases, and utilizes multiple drug–drug and disease–disease similarity measures for the prediction task. On cross‐validation, it obtains high specificity and sensitivity (AUC=0.9) in predicting drug indications, surpassing existing methods. We validate our predictions by their overlap with drug indications that are currently under clinical trials, and by their agreement with tissue‐specific expression information on the drug targets. We further show that disease‐specific genetic signatures can be used to accurately predict drug indications for new diseases (AUC=0.92). This lays the computational foundation for future personalized drug treatments, where gene expression signatures from individual patients would replace the disease‐specific signatures.

[1]  Temple F. Smith,et al.  The statistical distribution of nucleic acid similarities. , 1985, Nucleic acids research.

[2]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[3]  M. Rogawski,et al.  Cycloleucine blocks NMDA responses in cultured hippocampal neurones under voltage clamp: antagonism at the strychnine‐insensitive glycine receptor , 1989, British journal of pharmacology.

[4]  L. Oreland,et al.  Regulation of methionine adenosyltransferase catalytic activity and messenger RNA in SH-SY5Y human neuroblastoma cells. , 1998, Biochemical pharmacology.

[5]  J. Kemp,et al.  NMDA RECEPTOR ANTAGONISTS , 1998 .

[6]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[7]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[8]  C. Raftopoulos,et al.  Cabergoline in the treatment of hyperprolactinemia: a study in 455 patients. , 1999, The Journal of clinical endocrinology and metabolism.

[9]  C E Lipscomb,et al.  Medical Subject Headings (MeSH). , 2000, Bulletin of the Medical Library Association.

[10]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[11]  Alan R. Aronson,et al.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program , 2001, AMIA.

[12]  G. Terstappen,et al.  In silico research in drug discovery. , 2001, Trends in pharmacological sciences.

[13]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[14]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[15]  Y. Martin,et al.  Do structurally similar molecules have similar biological activity? , 2002, Journal of medicinal chemistry.

[16]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[17]  R. W. Hansen,et al.  The price of innovation: new estimates of drug development costs. , 2003, Journal of health economics.

[18]  Wanda Pratt,et al.  A Study of Biomedical Concept Identification: MetaMap vs. People , 2003, AMIA.

[19]  A. Skrbo,et al.  [Classification of drugs using the ATC system (Anatomic, Therapeutic, Chemical Classification) and the latest changes]. , 2004, Medicinski arhiv.

[20]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[21]  M. Cantor,et al.  Mining OMIM for insight into complex diseases. , 2004, Studies in health technology and informatics.

[22]  M. Farlow NMDA receptor antagonists. A new therapeutic approach for Alzheimer's disease. , 2004, Geriatrics.

[23]  K. Alagiakrishnan,et al.  Orthostatic hypotension: a primary care primer for assessment and treatment.(Cardiovascular disorders) , 2004 .

[24]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[25]  H. Lehrach,et al.  A Human Protein-Protein Interaction Network: A Resource for Annotating the Proteome , 2005, Cell.

[26]  Peer Bork,et al.  Systematic Association of Genes to Phenotypes by Genome and Literature Mining , 2005, PLoS biology.

[27]  S. L. Wong,et al.  Towards a proteome-scale map of the human protein–protein interaction network , 2005, Nature.

[28]  Y. Soyoral,et al.  Successful treatment of nephrotic syndrome due to FMF amyloidosis with azathioprine: report of three Turkish cases , 2006, Rheumatology International.

[29]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[30]  O. Kelber,et al.  Effects of St. John's wort extract and single constituents on stress-induced hyperthermia in mice. , 2006, Planta medica.

[31]  C. Steinbeck,et al.  Recent developments of the chemistry development kit (CDK) - an open-source java library for chemo- and bioinformatics. , 2006, Current pharmaceutical design.

[32]  G. Vriend,et al.  A text-mining analysis of the human phenome , 2006, European Journal of Human Genetics.

[33]  A. Rosatello,et al.  High prolactin levels as a worsening factor for migraine , 2006, The Journal of Headache and Pain.

[34]  Luhua Lai,et al.  Prediction of potential drug targets based on simple sequence properties , 2007, BMC Bioinformatics.

[35]  M. Moran,et al.  Large-scale mapping of human protein–protein interactions by mass spectrometry , 2007, Molecular systems biology.

[36]  [A case of recurrent renal cell carcinoma which recurred after fourth surgical resection and survived for about 2 years by medroxyporgesterone acetate administration]. , 2007, Hinyokika kiyo. Acta urologica Japonica.

[37]  Justin Lamb,et al.  The Connectivity Map: a new tool for biomedical research , 2007, Nature Reviews Cancer.

[38]  K. Dolinski,et al.  The BioGRID Interaction Database: 2008 update , 2007, Nucleic Acids Res..

[39]  Sampsa Hautaniemi,et al.  Fast Gene Ontology based clustering for microarray experiments , 2008, BioData Mining.

[40]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[41]  Francisco S. Roque,et al.  A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes , 2008, Proceedings of the National Academy of Sciences.

[42]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[43]  P. Robinson,et al.  The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. , 2008, American journal of human genetics.

[44]  S. Brunak,et al.  Generating Genome‐Scale Candidate Gene Lists for Pharmacogenomics , 2009, Clinical pharmacology and therapeutics.

[45]  Guanghui Hu,et al.  Human Disease-Drug Network Based on Genomic Expression Profiles , 2009, PloS one.

[46]  Yoshihiro Yamanishi,et al.  Supervised prediction of drug–target interactions using bipartite local models , 2009, Bioinform..

[47]  Xiaoyan Zhu,et al.  Building Disease-Specific Drug-Protein Connectivity Maps from Molecular Interaction Networks and PubMed Abstracts , 2009, PLoS Comput. Biol..

[48]  W. Kibbe,et al.  Annotating the human genome with Disease Ontology , 2009, BMC Genomics.

[49]  Ibrahim Emam,et al.  ArrayExpress update—from an archive of functional genomics experiments to the atlas of gene expression , 2008, Nucleic Acids Res..

[50]  Philip E. Bourne,et al.  Drug Discovery Using Chemical Systems Biology: Repositioning the Safe Medicine Comtan to Treat Multi-Drug and Extensively Drug Resistant Tuberculosis , 2009, PLoS Comput. Biol..

[51]  A. Chiang,et al.  Systematic Evaluation of Drug–Disease Relationships to Identify Leads for Novel Drug Uses , 2009, Clinical pharmacology and therapeutics.

[52]  Peter B. McGarvey,et al.  Infrastructure for the life sciences: design and implementation of the UniProt website , 2009, BMC Bioinformatics.

[53]  Xin Chen,et al.  DCDB: Drug combination database , 2010, Bioinform..

[54]  S. Mundlos,et al.  The Human Phenotype Ontology , 2010, Clinical genetics.

[55]  Roded Sharan,et al.  An Algorithmic Framework for Predicting Side-Effects of Drugs , 2010, RECOMB.

[56]  P. Bork,et al.  A side effect resource to capture phenotypic effects of drugs , 2010, Molecular systems biology.

[57]  Anton Yuryev,et al.  Computational Approaches for Drug Repositioning and Combination Therapy Design , 2010, J. Bioinform. Comput. Biol..

[58]  Roded Sharan,et al.  An Algorithmic Framework for Predicting Side Effects of Drugs , 2011, J. Comput. Biol..

[59]  Roded Sharan,et al.  Combining Drug and Gene Similarity Measures for Drug-Target Elucidation , 2011, J. Comput. Biol..

[60]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..