Predicting combinative drug pairs towards realistic screening via integrating heterogeneous features

BackgroundDrug Combination is one of the effective approaches for treating complex diseases. However, determining combinative drug pairs in clinical trials is still costly. Thus, computational approaches are used to identify potential drug pairs in advance. Existing computational approaches have the following shortcomings: (i) the lack of an effective integration of heterogeneous features leads to a time-consuming training and even results in an over-fitted classifier; and (ii) the narrow consideration of predicting potential drug combinations only among known drugs having known combinations cannot meet the demand of realistic screenings, which pay more attention to potential combinative pairs among newly-coming drugs that have no approved combination with other drugs at all.ResultsIn this paper, to tackle the above two problems, we propose a novel drug-driven approach for predicting potential combinative pairs on a large scale. We define four new features based on heterogeneous data and design an efficient fusion scheme to integrate these feature. Moreover importantly, we elaborate appropriate cross-validations towards realistic screening scenarios of drug combinations involving both known drugs and new drugs. In addition, we perform an extra investigation to show how each kind of heterogeneous features is related to combinative drug pairs. The investigation inspires the design of our approach. Experiments on real data demonstrate the effectiveness of our fusion scheme for integrating heterogeneous features and its predicting power in three scenarios of realistic screening. In terms of both AUC and AUPR, the prediction among known drugs achieves 0.954 and 0.821, that between known drugs and new drugs achieves 0.909 and 0.635, and that among new drugs achieves 0.809 and 0.592 respectively.ConclusionsOur approach provides not only an effective tool to integrate heterogeneous features but also the first tool to predict potential combinative pairs among new drugs.

[1]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[2]  Yuchen Zhang,et al.  HEALER: homomorphic computation of ExAct Logistic rEgRession for secure rare disease variants analysis in GWAS , 2015, Bioinform..

[3]  Lei Huang,et al.  DrugComboRanker: drug combination discovery based on target network analysis , 2014, Bioinform..

[4]  Lawrence A. Donehower,et al.  Combinatorial therapy discovery using mixed integer linear programming , 2014, Bioinform..

[5]  Bo Wang,et al.  Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities , 2018, Inf. Fusion.

[6]  Lei Chen,et al.  Prediction of Effective Drug Combinations by Chemical Interaction, Protein Interaction and Target Enrichment of KEGG Pathways , 2013, BioMed research international.

[7]  P. Sanseau,et al.  Systematic prediction of drug combinations based on clinical side-effects , 2014, Scientific Reports.

[8]  Siu-Ming Yiu,et al.  BMCMDA: a novel model for predicting human microbe-disease associations via binary matrix completion , 2018, BMC Bioinformatics.

[9]  J Henkel,et al.  Attacking AIDS with a 'cocktail' therapy? , 1999, FDA consumer.

[10]  Peer Bork,et al.  The SIDER database of drugs and side effects , 2015, Nucleic Acids Res..

[11]  Xing-Ming Zhao,et al.  Prediction of Drug Combinations by Integrating Molecular and Pharmacological Data , 2011, PLoS Comput. Biol..

[12]  Dan Wang,et al.  Similarity-based prediction for Anatomical Therapeutic Chemical classification of drugs by integrating multiple data sources , 2015, Bioinform..

[13]  Fang-Xiang Wu,et al.  A fast and high performance multiple data integration algorithm for identifying human disease genes , 2015, BMC Medical Genomics.

[14]  Xiaohua Ma,et al.  Mechanisms of drug combinations: interaction and network perspectives , 2009, Nature Reviews Drug Discovery.

[15]  Jian-Yu Shi,et al.  A unified solution for different scenarios of predicting drug-target interactions via triple matrix factorization , 2018, BMC Systems Biology.

[16]  Enrique Casado,et al.  Chemotherapy for colorectal cancer in the elderly: Whom to treat and what to use. , 2009, Cancer treatment reviews.

[17]  Xin Chen,et al.  DCDB: Drug combination database , 2010, Bioinform..

[18]  Wei Zhou,et al.  Systems pharmacology strategies for drug discovery and combination with applications to cardiovascular diseases. , 2014, Journal of ethnopharmacology.

[19]  Siu-Ming Yiu,et al.  TMFUF: a triple matrix factorization-based unified framework for predicting comprehensive drug-drug interactions of new drugs , 2018, BMC Bioinformatics.