Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration

BackgroundPrediction of drug-disease interactions is promising for either drug repositioning or disease treatment fields. The discovery of novel drug-disease interactions, on one hand can help to find novel indictions for the approved drugs; on the other hand can provide new therapeutic approaches for the diseases. Recently, computational methods for finding drug-disease interactions have attracted lots of attention because of their far more higher efficiency and lower cost than the traditional wet experiment methods. However, they still face several challenges, such as the organization of the heterogeneous data, the performance of the model, and so on.MethodsIn this work, we present to hierarchically integrate the heterogeneous data into three layers. The drug-drug and disease-disease similarities are first calculated separately in each layer, and then the similarities from three layers are linearly fused into comprehensive drug similarities and disease similarities, which can then be used to measure the similarities between two drug-disease pairs. We construct a novel weighted drug-disease pair network, where a node is a drug-disease pair with known or unknown treatment relation, an edge represents the node-node relation which is weighted with the similarity score between two pairs. Now that similar drug-disease pairs are supposed to show similar treatment patterns, we can find the optimal graph cut of the network. The drug-disease pair with unknown relation can then be considered to have similar treatment relation with that within the same cut. Therefore, we develop a semi-supervised graph cut algorithm, SSGC, to find the optimal graph cut, based on which we can identify the potential drug-disease treatment interactions.ResultsBy comparing with three representative network-based methods, SSGC achieves the highest performances, in terms of both AUC score and the identification rates of true drug-disease pairs. The experiments with different integration strategies also demonstrate that considering several sources of data can improve the performances of the predictors. Further case studies on four diseases, the top-ranked drug-disease associations have been confirmed by KEGG, CTD database and the literature, illustrating the usefulness of SSGC.ConclusionsThe proposed comprehensive similarity scores from multi-views and multiple layers and the graph-cut based algorithm can greatly improve the prediction performances of drug-disease associations.

[1]  A. Chiang,et al.  Systematic Evaluation of Drug–Disease Relationships to Identify Leads for Novel Drug Uses , 2009, Clinical pharmacology and therapeutics.

[2]  Simon M Lin,et al.  Opportunities for drug repositioning from phenome-wide association studies , 2015, Nature Biotechnology.

[3]  Alexander E. Ivliev,et al.  Drug Target Prediction and Repositioning Using an Integrated Network-Based Approach , 2013, PloS one.

[4]  Natalia Novac,et al.  Challenges and opportunities of drug repositioning. , 2013, Trends in pharmacological sciences.

[5]  C E Lipscomb,et al.  Medical Subject Headings (MeSH). , 2000, Bulletin of the Medical Library Association.

[6]  G. Giaccone,et al.  Temozolomide in patients with advanced non-small cell lung cancer with and without brain metastases. a phase II study of the EORTC Lung Cancer Group (08965). , 2003, European journal of cancer.

[7]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[8]  C. Harrison Drug repositioning: Genetic signatures uncover new uses , 2011, Nature Reviews Drug Discovery.

[9]  Francisco S. Roque,et al.  A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes , 2008, Proceedings of the National Academy of Sciences.

[10]  Nir Giladi,et al.  Olanzapine in Huntington's disease , 2002, Acta neurologica Scandinavica.

[11]  Zhiyong Lu,et al.  A survey of current trends in computational drug repositioning , 2016, Briefings Bioinform..

[12]  T. Ashburn,et al.  Drug repositioning: identifying and developing new uses for existing drugs , 2004, Nature Reviews Drug Discovery.

[13]  Zuping Zhang,et al.  Network-Based Inference Methods for Drug Repositioning , 2015, Comput. Math. Methods Medicine.

[14]  Jing Li,et al.  Drug Target Predictions Based on Heterogeneous Graph Inference , 2012, Pacific Symposium on Biocomputing.

[15]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[16]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[17]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[18]  W. Koroshetz,et al.  Quetiapine in the treatment of behavioral disturbances in patients with Huntington's disease. , 2006, Psychosomatics.

[19]  Xiang Zhang,et al.  Drug repositioning by integrating target information through a heterogeneous network model , 2014, Bioinform..

[20]  G. Vriend,et al.  A text-mining analysis of the human phenome , 2006, European Journal of Human Genetics.

[21]  Chuang Liu,et al.  Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference , 2012, PLoS Comput. Biol..

[22]  Juan Liu,et al.  Semi-supervised graph cut algorithm for drug repositioning by integrating drug, disease and genomic associations , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[23]  Vassilis Virvilis,et al.  Literature mining, ontologies and information visualization for drug repurposing , 2011, Briefings Bioinform..

[24]  R. Sharan,et al.  PREDICT: a method for inferring novel drug indications with application to personalized medicine , 2011, Molecular systems biology.

[25]  David Weininger,et al.  SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules , 1988, J. Chem. Inf. Comput. Sci..

[26]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[27]  R. Roos,et al.  Clozapine versus placebo in Huntington’s disease: a double blind randomised comparative study , 1997, Journal of neurology, neurosurgery, and psychiatry.

[28]  A. Butte,et al.  A drug repositioning approach identifies tricyclic antidepressants as inhibitors of small cell lung cancer and other neuroendocrine tumors. , 2013, Cancer discovery.

[29]  Roded Sharan,et al.  Combining Drug and Gene Similarity Measures for Drug-Target Elucidation , 2011, J. Comput. Biol..

[30]  J. Segura-Aguilar,et al.  Aminochrome as New Preclinical Model to Find New Pharmacological Treatment that Stop the Development of Parkinson's Disease. , 2016, Current medicinal chemistry.

[31]  Alexander A. Morgan,et al.  Discovery and Preclinical Validation of Drug Indications Using Compendia of Public Gene Expression Data , 2011, Science Translational Medicine.

[32]  W. Bergfeld,et al.  Androgenetic Alopecia: An Evidence-Based Treatment Update , 2014, American Journal of Clinical Dermatology.

[33]  Masahiro Fukuoka,et al.  Post-marketing surveillance (PMS) of all patients treated with irinotecan in Japan: clinical experience and ADR profile of 13,935 patients. , 2011, Japanese journal of clinical oncology.

[34]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[35]  J. Yamashita,et al.  Plasma endothelin‐1 As a marker for doxorubicin cardiotoxicity , 1995, International journal of cancer.

[36]  Livia Perfetto,et al.  MINT, the molecular interaction database: 2012 update , 2011, Nucleic Acids Res..

[37]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[39]  T. Ganesan,et al.  A Phase Ib trial of CA4P (combretastatin A-4 phosphate), carboplatin, and paclitaxel in patients with advanced cancer , 2010, British Journal of Cancer.

[40]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[41]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[42]  F. Iorio,et al.  Transcriptional data: a new gateway to drug repositioning? , 2013, Drug discovery today.

[43]  Laura Amato,et al.  Antidepressants for the treatment of depression in alcohol dependent individuals , 2010 .

[44]  Jane S. Paulsen,et al.  Risperidone and the treatment of psychiatric, motor, and cognitive symptoms in Huntington's disease. , 2008, Annals of clinical psychiatry : official journal of the American Academy of Clinical Psychiatrists.

[45]  Barend Mons,et al.  Online tools to support literature-based discovery in the life sciences , 2005, Briefings Bioinform..

[46]  F. Pannuti,et al.  Activity of high-dose epirubicin in advanced non-small cell lung cancer. , 1991, European journal of cancer.

[47]  Halil Kilicoglu,et al.  Using the Literature-Based Discovery Paradigm to Investigate Drug Mechanisms , 2007, AMIA.

[48]  Joshua F. McMichael,et al.  DGIdb - Mining the druggable genome , 2013, Nature Methods.

[49]  Ying Fu,et al.  LRSSL: predict and interpret drug‐disease associations based on data integration using sparse subspace learning , 2017, Bioinform..

[50]  A. Ardizzoni,et al.  Cisplatin- versus carboplatin-based chemotherapy in first-line treatment of advanced non-small-cell lung cancer: an individual patient data meta-analysis. , 2007, Journal of the National Cancer Institute.

[51]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2017 , 2016, Nucleic Acids Res..

[52]  A. Rossi,et al.  Carboplatin plus epirubicin plus VP-16, concurrent 'split course' radiotherapy and adjuvant surgery for limited small cell lung cancer. Gruppo Oncologico Centro-Sud-Isole (GOCSI). , 1994, Lung cancer.

[53]  R. Bonelli,et al.  Ziprasidone in Huntington’s Disease: The First Case Reports , 2003, Journal of psychopharmacology.

[54]  Nicola Biagio Mercuri,et al.  Treatment of the symptoms of Huntington's disease: Preliminary results comparing aripiprazole and tetrabenazine , 2009, Movement disorders : official journal of the Movement Disorder Society.

[55]  Feng Liu,et al.  Predicting potential drug-drug interactions by integrating chemical, biological, phenotypic and network data , 2017, BMC Bioinformatics.