Predicting effective drug combinations using gradient tree boosting based on features extracted from drug-protein heterogeneous network

BackgroundAlthough targeted drugs have contributed to impressive advances in the treatment of cancer patients, their clinical benefits on tumor therapies are greatly limited due to intrinsic and acquired resistance of cancer cells against such drugs. Drug combinations synergistically interfere with protein networks to inhibit the activity level of carcinogenic genes more effectively, and therefore play an increasingly important role in the treatment of complex disease.ResultsIn this paper, we combined the drug similarity network, protein similarity network and known drug-protein associations into a drug-protein heterogenous network. Next, we ran random walk with restart (RWR) on the heterogenous network using the combinatorial drug targets as the initial probability, and obtained the converged probability distribution as the feature vector of each drug combination. Taking these feature vectors as input, we trained a gradient tree boosting (GTB) classifier to predict new drug combinations. We conducted performance evaluation on the widely used drug combination data set derived from the DCDB database. The experimental results show that our method outperforms seven typical classifiers and traditional boosting algorithms.ConclusionsThe heterogeneous network-derived features introduced in our method are more informative and enriching compared to the primary ontology features, which results in better performance. In addition, from the perspective of network pharmacology, our method effectively exploits the topological attributes and interactions of drug targets in the overall biological network, which proves to be a systematic and reliable approach for drug discovery.

[1]  Armin Varmaz,et al.  RMatlab-app2web: Web Deployment of R/MATLAB Applications , 2013 .

[2]  S. Pongor,et al.  A Network-Based Target Overlap Score for Characterizing Drug Combinations: High Correlation with Cancer Clinical Trial Results , 2015, PloS one.

[3]  Mourad Tighiouart,et al.  Combinatorial Effects of Lapatinib and Rapamycin in Triple-Negative Breast Cancer Cells , 2011, Molecular Cancer Therapeutics.

[4]  Marvin Johnson,et al.  Concepts and applications of molecular similarity , 1990 .

[5]  Karin J Jensen,et al.  Synthetic Lethal Screening with Small-Molecule Inhibitors Provides a Pathway to Rational Combination Therapies for Melanoma , 2012, Molecular Cancer Therapeutics.

[6]  Xiaohua Ma,et al.  Mechanisms of drug combinations: interaction and network perspectives , 2009, Nature Reviews Drug Discovery.

[7]  Xin Chen,et al.  DCDB 2.0: a major update of the drug combination database , 2014, Database J. Biol. Databases Curation.

[8]  Bart De Moor,et al.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis , 2005, Bioinform..

[9]  Péter Csermely,et al.  The efficiency of multi-target drugs: the network approach might help drug design. , 2004, Trends in pharmacological sciences.

[10]  Hui Liu,et al.  SynLethDB: synthetic lethality database toward discovery of selective and sensitive anticancer drug targets , 2015, Nucleic Acids Res..

[11]  B. Greenwood,et al.  A randomised, double‐blind, placebo‐controlled trial of atovaquone–proguanil vs. sulphadoxine–pyrimethamine in the treatment of malarial anaemia in Zambian children , 2006, Tropical medicine & international health : TM & IH.

[12]  Jinhong Jung,et al.  Supervised and extended restart in random walks for ranking and link prediction in networks , 2017, PloS one.

[13]  Lawrence A. Donehower,et al.  Combinatorial therapy discovery using mixed integer linear programming , 2014, Bioinform..

[14]  Yiling Lu,et al.  Identification of optimal drug combinations targeting cellular networks: integrating phospho-proteomics and computational network analysis. , 2010, Cancer research.

[15]  Astrid Gall,et al.  Ensembl 2018 , 2017, Nucleic Acids Res..

[16]  Jingpu Zhang,et al.  Integrating Multiple Interaction Networks for Gene Function Inference , 2018, Molecules.

[17]  Emanuel F Petricoin,et al.  Compensatory Pathways Induced by MEK Inhibition Are Effective Drug Targets for Combination Therapy against Castration-Resistant Prostate Cancer , 2011, Molecular Cancer Therapeutics.

[18]  Yi-Cheng Zhang,et al.  Bipartite network projection and personal recommendation. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  S. Vilar,et al.  High-Throughput Methods for Combinatorial Drug Discovery , 2013, Science Translational Medicine.

[20]  Zhigang Chen,et al.  PredRSA: a gradient boosted regression trees approach for predicting protein solvent accessibility , 2016, BMC Bioinformatics.

[21]  Keith T Flaherty,et al.  Multiple signaling pathways must be targeted to overcome drug resistance in cell lines derived from melanoma metastases , 2006, Molecular Cancer Therapeutics.

[22]  Damian Szklarczyk,et al.  STITCH 5: augmenting protein–chemical interaction networks with tissue and affinity data , 2015, Nucleic Acids Res..

[23]  Yi Pan,et al.  Drug repositioning based on comprehensive similarity measures and Bi-Random walk algorithm , 2016, Bioinform..

[24]  Dong-Sheng Cao,et al.  protr/ProtrWeb: R package and web server for generating various numerical representation schemes of protein sequences , 2015, Bioinform..

[25]  Kwok-Kin Wong,et al.  Exploiting cancer cell vulnerabilities to develop a combination therapy for ras-driven tumors. , 2011, Cancer cell.

[26]  E. Birney,et al.  Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt , 2009, Nature Protocols.

[27]  Ranadip Pal,et al.  Combination therapy design for maximizing sensitivity and minimizing toxicity , 2016, BMC Bioinformatics.

[28]  Lei Deng,et al.  Probing the functions of long non-coding RNAs by exploiting the topology of global association and interaction network , 2018, Comput. Biol. Chem..

[29]  Hui Liu,et al.  Accurate prediction of protein-lncRNA interactions by diffusion and HeteSim features across heterogeneous network , 2018, BMC Bioinformatics.

[30]  Alireza Mashaghi,et al.  Atorvastatin treatment softens human red blood cells: an optical tweezers study. , 2018, Biomedical optics express.

[31]  B. Al-Lazikani,et al.  Combinatorial drug therapy for cancer in the post-genomic era , 2012, Nature Biotechnology.

[32]  P. Sorger,et al.  Sequential Application of Anticancer Drugs Enhances Cell Death by Rewiring Apoptotic Signaling Networks , 2012, Cell.

[33]  Hui Liu,et al.  Improving compound–protein interaction prediction by building up highly credible negative samples , 2015, Bioinform..

[34]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics, ProbabilityTheory Group (Formerly: E1071), TU Wien , 2015 .

[35]  Tao Xu,et al.  Target Inhibition Networks: Predicting Selective Combinations of Druggable Targets to Block Cancer Survival Pathways , 2013, PLoS Comput. Biol..

[36]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[37]  Hui Liu,et al.  Inferring new indications for approved drugs via random walk on drug-disease heterogenous networks , 2016, BMC Bioinformatics.

[38]  C. Sander,et al.  Models from experiments: combinatorial drug perturbations of cancer cells , 2008, Molecular systems biology.

[39]  Klaus Hechenbichler,et al.  Weighted k-Nearest-Neighbor Techniques and Ordinal Classification , 2004 .

[40]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[41]  Karen A. Ryall,et al.  Systems biology approaches for advancing the discovery of effective drug combinations , 2015, Journal of Cheminformatics.

[42]  J. Friedman Stochastic gradient boosting , 2002 .

[43]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[44]  Jingpu Zhang,et al.  Integrating Multiple Heterogeneous Networks for Novel LncRNA-Disease Association Inference , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[45]  Jianxin Chen,et al.  Large-scale exploration and analysis of drug combinations , 2015, Bioinform..

[46]  Laura M. Heiser,et al.  A community effort to assess and improve drug sensitivity prediction algorithms , 2014, Nature Biotechnology.

[47]  Matías Gámez,et al.  adabag: An R Package for Classification with Boosting and Bagging , 2013 .

[48]  Lei Huang,et al.  DrugComboRanker: drug combination discovery based on target network analysis , 2014, Bioinform..

[49]  Jingpu Zhang,et al.  KATZLGO: Large-Scale Prediction of LncRNA Functions by Using the KATZ Measure Based on Multiple Networks , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[50]  Yibo Wu,et al.  GOSemSim: an R package for measuring semantic similarity among GO terms and gene products , 2010, Bioinform..

[51]  Peter Dalgaard,et al.  R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[52]  Andreas Bender,et al.  DeepSynergy: predicting anti-cancer drug synergy with Deep Learning , 2017, Bioinform..

[53]  Rich Caruana,et al.  An empirical evaluation of supervised learning in high dimensions , 2008, ICML '08.

[54]  Lei Deng,et al.  Accurate prediction of functional effects for variants by combining gradient tree boosting with optimal neighborhood properties , 2017, PloS one.

[55]  Nci Dream Community A community effort to assess and improve drug sensitivity prediction algorithms , 2014 .

[56]  Xing-Ming Zhao,et al.  Prediction of Drug Combinations by Integrating Molecular and Pharmacological Data , 2011, PLoS Comput. Biol..

[57]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[58]  CHUN WEI YAP,et al.  PaDEL‐descriptor: An open source software to calculate molecular descriptors and fingerprints , 2011, J. Comput. Chem..