Predicting drug-target interactions using multi-label learning with community detection method (DTI-MLCD)

Identifying drug-target interactions (DTIs) is an important step for drug discovery and drug repositioning. To reduce heavily experiment cost, booming machine learning has been applied to this field and developed many computational methods, especially binary classification methods. However, there is still much room for improvement in the performance of current methods. Multi-label learning can reduce difficulties faced by binary classification learning with high predictive performance, and has not been explored extensively. The key challenge it faces is the exponential-sized output space, and considering label correlations can help it. Thus, we facilitate the multi-label classification by introducing community detection methods for DTIs prediction, named DTI-MLCD. On the other hand, we updated the gold standard data set proposed in 2008 and still in use today. The proposed DTI-MLCD is performed on the gold standard data set before and after the update, and shows the superiority than other classical machine learning methods and other benchmark proposed methods, which confirms the efficiency of it. The data and code for this study can be found at https://github.com/a96123155/DTI-MLCD.

[1]  Tao Zhang,et al.  Classification Models for Predicting Cytochrome P450 Enzyme‐Substrate Selectivity , 2012, Molecular informatics.

[2]  Bin Chen,et al.  Predicting drug target interactions using meta-path-based semantic network analysis , 2016, BMC Bioinformatics.

[3]  Shuigeng Zhou,et al.  Boosting compound-protein interaction prediction by deep learning , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[4]  Dingfang Li,et al.  Drug-Target Interaction Prediction through Label Propagation with Linear Neighborhood Information , 2017, Molecules.

[5]  Seongok Ryu,et al.  Predicting Drug-Target Interaction Using a Novel Graph Neural Network with 3D Structure-Embedded Graph Representation , 2019, J. Chem. Inf. Model..

[6]  Yasuo Tabei,et al.  Identification of chemogenomic features from drug–target interaction networks using interpretable classifiers , 2012, Bioinform..

[7]  Arezou Koohi Prediction of drug-target interactions using popular Collaborative Filtering methods , 2013, 2013 IEEE International Workshop on Genomic Signal Processing and Statistics.

[8]  Chee Keong Kwoh,et al.  Drug-target interaction prediction via class imbalance-aware ensemble learning , 2016, BMC Bioinformatics.

[9]  Yoshihiro Yamanishi,et al.  Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework , 2010, Bioinform..

[10]  Chuang Liu,et al.  Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference , 2012, PLoS Comput. Biol..

[11]  Mehmet Gönen,et al.  Predicting drug-target interactions from chemical and genomic kernels using Bayesian matrix factorization , 2012, Bioinform..

[12]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[13]  Wei Chen,et al.  Reconstruction and analysis of a genome-scale metabolic model of the oleaginous fungus Mortierella alpina , 2015, BMC Systems Biology.

[14]  Kristian Kersting,et al.  How is a data-driven approach better than random choice in label space division for multi-label classification? , 2016, Entropy.

[15]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[16]  Egon L. Willighagen,et al.  The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching , 2017, Journal of Cheminformatics.

[17]  Grigorios Tsoumakas,et al.  On the Stratification of Multi-label Data , 2011, ECML/PKDD.

[18]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[19]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[21]  Hanbi Lee,et al.  Comparison of Target Features for Predicting Drug-Target Interactions by Deep Neural Network Based on Large-Scale Drug-Induced Transcriptome Data , 2019, Pharmaceutics.

[22]  Keith C. C. Chan,et al.  Large-scale prediction of drug-target interactions from deep representations , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[23]  Mikhail Pachkov,et al.  MotEvo: integrated Bayesian probabilistic methods for inferring regulatory sites and motifs on multiple alignments of DNA sequences , 2012, Bioinform..

[24]  Sereina Riniker,et al.  Open-source platform to benchmark fingerprints for ligand-based virtual screening , 2013, Journal of Cheminformatics.

[25]  Song He,et al.  Deep learning-based transcriptome data classification for drug-target interaction prediction , 2018, BMC Genomics.

[26]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[27]  Salvatore Alaimo,et al.  Drug–target interaction prediction through domain-tuned network-based inference , 2013, Bioinform..

[28]  Ming Wen,et al.  Deep-Learning-Based Drug-Target Interaction Prediction. , 2017, Journal of proteome research.

[29]  Rinkle Rani,et al.  BE-DTI': Ensemble framework for drug target interaction prediction using dimensionality reduction and active learning , 2018, Comput. Methods Programs Biomed..

[30]  Yong Zhou,et al.  Prediction of Drug–Target Interaction Networks from the Integration of Protein Sequences and Drug Chemical Structures , 2017, Molecules.

[31]  Hojung Nam,et al.  SELF-BLM: Prediction of drug-target interactions via self-training SVM , 2017, PloS one.

[32]  Shi-Hua Zhang,et al.  DrugE-Rank: improving drug–target interaction prediction of new candidate drugs or targets by ensemble learning to rank , 2016, Bioinform..

[33]  Grigorios Tsoumakas,et al.  Random K-labelsets for Multilabel Classification , 2022 .

[34]  Jian Peng,et al.  A Network Integration Approach for Drug-Target Interaction Prediction and Computational Drug Repositioning from Heterogeneous Information , 2017, RECOMB 2017.

[35]  Peter Antal,et al.  VB-MK-LMF: fusion of drugs, targets and interactions using variational Bayesian multiple kernel logistic matrix factorization , 2017, BMC Bioinformatics.

[36]  Xiaobo Zhou,et al.  Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces , 2010, BMC Systems Biology.

[37]  Sajid Ahmed,et al.  FRnet-DTI: Deep Convolutional Neural Networks with Evolutionary and Structural Features for Drug-Target Interaction , 2018, 1806.07174.

[38]  Yongsheng Liu,et al.  iDTi-CSsmoteB: Identification of Drug–Target Interaction Based on Drug Chemical Structure and Protein Sequence Using XGBoost With Over-Sampling Technique SMOTE , 2019, IEEE Access.

[39]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[40]  Mark H Johnson,et al.  The development of spatial frequency biases in face recognition. , 2010, Journal of experimental child psychology.

[41]  Sahand Khakabimamaghani,et al.  Drug-target interaction prediction from PSSM based evolutionary information. , 2016, Journal of pharmacological and toxicological methods.

[42]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[43]  Bin Yu,et al.  Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure. , 2019, Genomics.

[44]  Peng Chen,et al.  DrugRPE: Random projection ensemble approach to drug-target interaction prediction , 2017, Neurocomputing.

[45]  Xiao-Ying Yan,et al.  Prediction of drug-target interaction by label propagation with mutual interaction information derived from heterogeneous network. , 2016, Molecular bioSystems.

[46]  Zhi-Hua Zhou,et al.  ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[47]  Xing Chen,et al.  Drug-target interaction prediction by random walk on the heterogeneous network. , 2012, Molecular bioSystems.

[48]  Carl T. Bergstrom,et al.  The map equation , 2009, 0906.1405.

[49]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[50]  Bo Liao,et al.  Screening drug-target interactions with positive-unlabeled learning , 2017, Scientific Reports.

[51]  Grigorios Tsoumakas,et al.  Predicting Drug-Target Interactions With Multi-Label Classification and Label Partitioning , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[52]  M. Friedman A Comparison of Alternative Tests of Significance for the Problem of $m$ Rankings , 1940 .

[53]  Ivan G. Costa,et al.  A multiple kernel learning algorithm for drug-target interaction prediction , 2016, BMC Bioinformatics.

[54]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..

[55]  Chunyan Miao,et al.  Neighborhood Regularized Logistic Matrix Factorization for Drug-Target Interaction Prediction , 2016, PLoS Comput. Biol..

[56]  Chris Morley,et al.  Pybel: a Python wrapper for the OpenBabel cheminformatics toolkit , 2008, Chemistry Central journal.

[57]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[58]  Anna Korhonen,et al.  Link prediction in drug-target interactions network using similarity indices , 2017, BMC Bioinformatics.

[59]  Xiangrong Liu,et al.  Machine Learning for Drug-Target Interaction Prediction , 2018, Molecules.

[60]  Abdollah Dehzangi,et al.  iDTI-ESBoost: Identification of Drug Target Interaction Using Evolutionary and Structural Features with Boosting , 2017, Scientific Reports.

[61]  Qing Liu,et al.  Predicting drug-target interaction networks of human diseases based on multiple feature information. , 2013, Pharmacogenomics.

[62]  Geoff Holmes,et al.  Classifier chains for multi-label classification , 2009, Machine Learning.

[63]  Yongdong Zhang,et al.  Drug-target interaction prediction: databases, web servers and computational models , 2016, Briefings Bioinform..

[64]  Yoshihiro Yamanishi,et al.  Supervised prediction of drug–target interactions using bipartite local models , 2009, Bioinform..

[65]  Yong Zhou,et al.  A Computational-Based Method for Predicting Drug-Target Interactions by Using Stacked Autoencoder Deep Neural Network , 2017, J. Comput. Biol..

[66]  Bin Chen,et al.  PubChem as a Source of Polypharmacology , 2009, J. Chem. Inf. Model..

[67]  Shuqin Zhang,et al.  Drug-target interaction prediction by integrating multiview network data , 2017, Comput. Biol. Chem..

[68]  Raymond J. Mooney,et al.  Using Both Latent and Supervised Shared Topics for Multitask Learning , 2013, ECML/PKDD.

[69]  Licheng Jiao,et al.  Integrating Spectral Kernel Learning and Constraints in Semi-Supervised Classification , 2012, Neural Processing Letters.

[70]  Yun Xie,et al.  Identification of drug-target interaction from interactome network with 'guilt-by-association' principle and topology features , 2016, Bioinform..

[71]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[72]  Yanqing Niu,et al.  Recent Advances in the Machine Learning-Based Drug-Target Interaction Prediction. , 2019, Current drug metabolism.

[73]  Alexander E. Ivliev,et al.  Drug Target Prediction and Repositioning Using an Integrated Network-Based Approach , 2013, PloS one.

[74]  L. Jiang,et al.  PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence , 2006, Nucleic Acids Res..

[75]  Hojung Nam,et al.  DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences , 2018, PLoS Comput. Biol..

[76]  Jean-Philippe Vert,et al.  Protein-ligand interaction prediction: an improved chemogenomics approach , 2008, Bioinform..

[77]  Min Wu,et al.  Drug-target interaction prediction using ensemble learning and dimensionality reduction. , 2017, Methods.

[78]  Yuhao Wang,et al.  Predicting drug-target interactions using restricted Boltzmann machines , 2013, Bioinform..

[79]  Elisa Michelini,et al.  Protein ligand interaction prediction , 2012 .

[80]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[81]  Silvio C. E. Tosatto,et al.  The Pfam protein families database in 2019 , 2018, Nucleic Acids Res..

[82]  Ping Zhang,et al.  Interpretable Drug Target Prediction Using Deep Neural Representation , 2018, IJCAI.

[83]  Qi Zhao,et al.  Computational Model Development of Drug-Target Interaction Prediction: A Review. , 2019, Current protein & peptide science.

[84]  Chang Liu,et al.  Predicting Drug–Target Interactions Using Probabilistic Matrix Factorization , 2013, J. Chem. Inf. Model..

[85]  Chee Keong Kwoh,et al.  Drug-target interaction prediction by learning from local information and neighbors , 2013, Bioinform..

[86]  Boleslaw K. Szymanski,et al.  Community Detection via Maximization of Modularity and Its Variants , 2014, IEEE Transactions on Computational Social Systems.

[87]  K. Mealey,et al.  Clinical pharmacology and therapeutics. , 2013, The Veterinary clinics of North America. Small animal practice.

[88]  Vladimir B. Bajic,et al.  DDR: efficient computational method to predict drug–target interactions using graph mining and machine learning approaches , 2017, Bioinform..

[89]  Hui Liu,et al.  Improving compound–protein interaction prediction by building up highly credible negative samples , 2015, Bioinform..

[90]  Hao Ding,et al.  Collaborative matrix factorization with multiple similarities for predicting drug-target interactions , 2013, KDD.

[91]  Guozheng Li,et al.  Modelling of inquiry diagnosis for coronary heart disease in traditional Chinese medicine by using multi-label learning , 2010, BMC complementary and alternative medicine.

[92]  Elena Marchiori,et al.  Gaussian interaction profile kernels for predicting drug-target interaction , 2011, Bioinform..

[93]  Giorgio Valentini,et al.  Network-Based Drug Ranking and Repositioning with Respect to DrugBank Therapeutic Categories , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[94]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[95]  Dong-Qing Wei,et al.  SPVec: A Word2vec-Inspired Feature Representation Method for Drug-Target Interaction Prediction , 2020, Frontiers in Chemistry.

[96]  Antje Chang,et al.  BRENDA , the enzyme database : updates and major new developments , 2003 .

[97]  Zhu-Hong You,et al.  RFDT: A Rotation Forest-based Predictor for Predicting Drug-Target Interactions Using Drug Structure and Protein Sequence Information. , 2016, Current protein & peptide science.

[98]  Salvatore Alaimo,et al.  DT-Web: a web-based application for drug-target interaction and drug combination prediction through domain-tuned network-based inference , 2015, BMC Systems Biology.

[99]  Xin Geng,et al.  Binary relevance for multi-label learning: an overview , 2018, Frontiers of Computer Science.

[100]  Z. R. Li,et al.  Update of PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence , 2006, Nucleic Acids Res..

[101]  James G. Nourse,et al.  Reoptimization of MDL Keys for Use in Drug Discovery , 2002, J. Chem. Inf. Comput. Sci..

[102]  Jijun Tang,et al.  Identification of drug-target interactions via multiple information integration , 2017, Inf. Sci..

[103]  Chee Keong Kwoh,et al.  Drug-Target Interaction Prediction with Graph Regularized Matrix Factorization , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[104]  Yong-Yeol Ahn,et al.  Optimizing drug–target interaction prediction based on random walk on heterogeneous networks , 2015, Journal of Cheminformatics.

[105]  Yuye He,et al.  PaDEL‐DDPredictor: Open‐source software for PD‐PK‐T prediction , 2013, J. Comput. Chem..

[106]  Jean-Loup Guillaume,et al.  Fast unfolding of community hierarchies in large networks , 2008, ArXiv.

[107]  Xing Chen,et al.  In silico prediction of drug-target interaction networks based on drug chemical structure and protein sequences , 2017, Scientific Reports.

[108]  Ulrik Brandes,et al.  On Modularity Clustering , 2008, IEEE Transactions on Knowledge and Data Engineering.

[109]  M Michael Gromiha,et al.  Drug-Target Interactions: Prediction Methods and Applications. , 2018, Current protein & peptide science.

[110]  Yanli Wang,et al.  Predicting drug-target interactions by dual-network integrated logistic matrix factorization , 2017, Scientific Reports.

[111]  Jie Li,et al.  SDTNBI: an integrated network and chemoinformatics tool for systematic prediction of drug–target interactions and drug repositioning , 2016, Briefings Bioinform..

[112]  Ladislav Peska,et al.  Drug-target interaction prediction: A Bayesian ranking approach , 2017, Comput. Methods Programs Biomed..

[113]  Pingzhao Hu,et al.  Predicting drug-target interaction network using deep learning model , 2019, Comput. Biol. Chem..