A hybrid ensemble‐based technique for predicting drug–target interactions

Drug–target interaction is the intercommunication between chemical drugs and target proteins to produce any kind of change in the human body. The laboratory experiments conducted to recognize these potential intercommunications are costly and tedious. Various computational methods have been established recently, including the chemogenomic approach for identifying drug–target intercommunication, that integrates the chemical data of the drugs and the genomic properties of the proteins for identifying the interactions between them. This paper proposes a novel technique based on hybrid ensemble to predict drug target interactions. Hybrid ensembles introduce diversity in classification that helps to improve the performance of prediction. The proposed method has been evaluated by comparing the technique with state–of‐the‐art methods on two different databases under three cross‐validation settings. The comparison clearly shows that the technique produced significant improvement in the interaction prediction.

[1]  Qingsong Xu,et al.  Rcpi: R/Bioconductor package to generate various descriptors of proteins, compounds and their interactions , 2015, Bioinform..

[2]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[3]  Chee-Keong Kwoh,et al.  Computational prediction of drug-target interactions using chemogenomic approaches: an empirical survey , 2019, Briefings Bioinform..

[4]  Gerard Pujadas,et al.  Protein-ligand Docking: A Review of Recent Advances and Future Perspectives , 2008 .

[5]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[6]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[7]  Yasuo Tabei,et al.  Identification of chemogenomic features from drug–target interaction networks using interpretable classifiers , 2012, Bioinform..

[8]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[9]  Marvin Johnson,et al.  Concepts and applications of molecular similarity , 1990 .

[10]  Chee Keong Kwoh,et al.  Drug-target interaction prediction via class imbalance-aware ensemble learning , 2016, BMC Bioinformatics.

[11]  M. Barker,et al.  Partial least squares for discrimination , 2003 .

[12]  E. Marchiori,et al.  Predicting Drug-Target Interactions for New Drug Compounds Using a Weighted Nearest Neighbor Profile , 2013, PloS one.

[13]  Giles M. Foody,et al.  Increasing soft classification accuracy through the use of an ensemble of classifiers , 2005, Proceedings. 2005 IEEE International Geoscience and Remote Sensing Symposium, 2005. IGARSS '05..

[14]  Luis Mateus Rocha,et al.  Singular value decomposition and principal component analysis , 2003 .

[15]  Chuang Liu,et al.  Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference , 2012, PLoS Comput. Biol..

[16]  Manoj Kumar Gupta,et al.  A comprehensive review of feature based methods for drug target interaction prediction , 2019, J. Biomed. Informatics.

[17]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[18]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Jessica J. Fridrich,et al.  Ensemble Classifiers for Steganalysis of Digital Media , 2012, IEEE Transactions on Information Forensics and Security.

[21]  Dirk Van,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[22]  R. Cramer Partial Least Squares (PLS): Its strengths and limitations , 1993 .

[23]  Min Wu,et al.  Drug-target interaction prediction using ensemble learning and dimensionality reduction. , 2017, Methods.

[24]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[25]  L. Jiang,et al.  PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence , 2006, Nucleic Acids Res..

[26]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[27]  Hua Yu,et al.  A Systematic Prediction of Multiple Drug-Target Interactions from Chemical, Genomic, and Pharmacological Data , 2012, PloS one.

[28]  Xiaomin Luo,et al.  TarFisDock: a web server for identifying drug targets with docking approach , 2006, Nucleic Acids Res..

[29]  Michael J. Keiser,et al.  Large Scale Prediction and Testing of Drug Activity on Side-Effect Targets , 2012, Nature.

[30]  K. Chou,et al.  Predicting Drug-Target Interaction Networks Based on Functional Groups and Biological Features , 2010, PloS one.

[31]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[32]  Yongdong Zhang,et al.  Drug-target interaction prediction: databases, web servers and computational models , 2016, Briefings Bioinform..

[33]  Elena Marchiori,et al.  Gaussian interaction profile kernels for predicting drug-target interaction , 2011, Bioinform..

[34]  Joel Dudley,et al.  Exploiting drug-disease relationships for computational drug repositioning , 2011, Briefings Bioinform..

[35]  Anne Mai Wassermann,et al.  Ligand Prediction for Orphan Targets Using Support Vector Machines and Various Target-Ligand Kernels Is Dominated by Nearest Neighbor Effects , 2009, J. Chem. Inf. Model..