Prediction of Drug-Target Interactions by Ensemble Learning Method From Protein Sequence and Drug Fingerprint

Predicting the target-drug interactions (DITs) is of great important for screening new drug candidate and understanding biological processes. However, identifying the drug-target interactions through traditional experiments is still costly, laborious and complicated. Thus, there is a great need for developing reliable computational methods to effectively predict DTIs. In this study, we report a novel computational method combining local optimal oriented pattern (LOOP), Position Specific Scoring Matrix (PSSM) and Rotation Forest (RF) for predicting DTI. Specifically, the target protein sequence is firstly transformed as the PSSM, in which the evolutionary information of protein is retained. Then, the LOOP is used to extract the feature vectors from PSSM, and the sub-structure information of drug molecule is represented as fingerprint features. Finally, RF classifier is adopted to infer the potential drug-target interactions. When the experiment is carried out on four benchmark datasets including enzyme, ion channel, $G$ protein-coupled receptors (GPCRs), and nuclear receptor, we achieved the high average prediction accuracies of 89.09%, 87.53%, 82.05%, and 73.33% respectively. For further evaluating the proposed method, we compare the prediction performance of the proposed method with the state-of-the-art support vector machine (SVM) and K-Nearest Neighbor (KNN). The comprehensive experimental results illustrate that the proposed method is reliable and efficiency for predicting DTIs. It is anticipated that the proposed method can become a useful tool for predicting a large-scale potential DTIs.

[1]  Hailin Chen,et al.  A Semi-Supervised Method for Drug-Target Interaction Prediction with Consistency in Networks , 2013, PloS one.

[2]  Yong Zhou,et al.  Prediction of Drug–Target Interaction Networks from the Integration of Protein Sequences and Drug Chemical Structures , 2017, Molecules.

[3]  Jijun Tang,et al.  Identification of drug-target interactions via multiple information integration , 2017, Inf. Sci..

[4]  Xing Chen,et al.  A Systematic Prediction of Drug-Target Interactions Using Molecular Fingerprints and Protein Sequences. , 2018, Current protein & peptide science.

[5]  Narjes Rohani,et al.  Drug-Drug Interaction Predicting by Neural Network Using Integrated Similarity , 2019, Scientific Reports.

[6]  Min Wu,et al.  Drug-target interaction prediction using ensemble learning and dimensionality reduction. , 2017, Methods.

[7]  Juan José Rodríguez Diez,et al.  Rotation Forest: A New Classifier Ensemble Method , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Chee Keong Kwoh,et al.  Drug-target interaction prediction by learning from local information and neighbors , 2013, Bioinform..

[9]  Yoshihiro Yamanishi,et al.  Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework , 2010, Bioinform..

[10]  Umapada Pal,et al.  LOOP Descriptor: Local Optimal-Oriented Pattern , 2017, IEEE Signal Processing Letters.

[11]  Oksam Chae,et al.  Gender Classification Using Local Directional Pattern (LDP) , 2010, 2010 20th International Conference on Pattern Recognition.

[12]  S F Altschul,et al.  Iterated profile searches with PSI-BLAST--a tool for discovery in protein databases. , 1998, Trends in biochemical sciences.

[13]  Mehmet Gönen,et al.  Predicting drug-target interactions from chemical and genomic kernels using Bayesian matrix factorization , 2012, Bioinform..

[14]  D. E. Clark,et al.  Outstanding challenges in protein–ligand docking and structure‐based virtual screening , 2011 .

[15]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[16]  Antje Chang,et al.  BRENDA , the enzyme database : updates and major new developments , 2003 .

[17]  Matti Pietikäinen,et al.  Performance evaluation of texture measures with classification based on Kullback discrimination of distributions , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[18]  Zhu-Hong You,et al.  RFDT: A Rotation Forest-based Predictor for Predicting Drug-Target Interactions Using Drug Structure and Protein Sequence Information. , 2016, Current protein & peptide science.

[19]  Chunyan Miao,et al.  Neighborhood Regularized Logistic Matrix Factorization for Drug-Target Interaction Prediction , 2016, PLoS Comput. Biol..

[20]  Abdollah Dehzangi,et al.  Improving protein fold recognition and structural class prediction accuracies using physicochemical properties of amino acids. , 2016, Journal of theoretical biology.

[21]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Zhu-Hong You,et al.  Incorporating chemical sub-structures and protein evolutionary information for inferring drug-target interactions , 2020, Scientific Reports.

[23]  Ville Ojansivu,et al.  Blur Insensitive Texture Classification Using Local Phase Quantization , 2008, ICISP.

[24]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[25]  Jie Li,et al.  SDTNBI: an integrated network and chemoinformatics tool for systematic prediction of drug–target interactions and drug repositioning , 2016, Briefings Bioinform..

[26]  Xiaobo Zhou,et al.  Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces , 2010, BMC Systems Biology.

[27]  Charles C. Persinger,et al.  How to improve R&D productivity: the pharmaceutical industry's grand challenge , 2010, Nature Reviews Drug Discovery.

[28]  M. Murcko,et al.  Chemogenomic approaches to drug discovery. , 2001, Current opinion in chemical biology.

[29]  Yoshihiro Yamanishi,et al.  Drug Side-Effect Prediction Based on the Integration of Chemical and Biological Spaces , 2012, J. Chem. Inf. Model..

[30]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[31]  Sahand Khakabimamaghani,et al.  Drug-target interaction prediction from PSSM based evolutionary information. , 2016, Journal of pharmacological and toxicological methods.

[32]  Feng Zhu,et al.  Therapeutic target database 2020: enriched resource for facilitating research and early development of targeted therapeutics , 2019, Nucleic Acids Res..

[33]  Siewert J Marrink,et al.  Structural determinants of the supramolecular organization of G protein-coupled receptors in bilayers. , 2012, Journal of the American Chemical Society.

[34]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[35]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[36]  Xing-Ming Zhao,et al.  A Survey on the Computational Approaches to Identify Drug Targets in the Postgenomic Era , 2015, BioMed research international.

[37]  Meir Glick,et al.  Prediction of Biological Targets for Compounds Using Multiple-Category Bayesian Models Trained on Chemogenomics Databases , 2006, J. Chem. Inf. Model..

[38]  Dik-Lung Ma,et al.  Drug repositioning by structure-based virtual screening. , 2013, Chemical Society reviews.

[39]  P. Bork,et al.  Systematic identification of proteins that elicit drug side effects , 2013, Molecular systems biology.

[40]  Yong Wang,et al.  Computationally Probing Drug-Protein Interactions Via Support Vector Machine , 2010 .

[41]  Grigorios Tsoumakas,et al.  Predicting Drug-Target Interactions With Multi-Label Classification and Label Partitioning , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[42]  Lai Wei,et al.  Analysis and prediction of drug–drug interaction by minimum redundancy maximum relevance and incremental feature selection , 2017, Journal of biomolecular structure & dynamics.

[43]  Selma Dere,et al.  Prediction of Drug–Drug Interactions by Using Profile Fingerprint Vectors and Protein Similarities , 2020, Healthcare informatics research.

[44]  Reza Safdari,et al.  Computational prediction of drug-drug interactions based on drugs functional similarities , 2017, J. Biomed. Informatics.

[45]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[46]  Yanlin Chen,et al.  Manifold regularized matrix factorization for drug-drug interaction prediction , 2018, J. Biomed. Informatics.

[47]  Lu Huang,et al.  Update of TTD: Therapeutic Target Database , 2009, Nucleic Acids Res..

[48]  Andrew L. Hopkins,et al.  Predicting promiscuity , 2009 .

[49]  Chee Keong Kwoh,et al.  Drug-target interaction prediction via class imbalance-aware ensemble learning , 2016, BMC Bioinformatics.

[50]  Michael T. M. Emmerich,et al.  A novel chemogenomics analysis of G protein-coupled receptors (GPCRs) and their ligands: a potential strategy for receptor de-orphanization , 2010, BMC Bioinformatics.

[51]  Lise Getoor,et al.  A probabilistic approach for collective similarity-based drug-drug interaction prediction , 2016, Bioinform..

[52]  Arzucan Özgür,et al.  A comparative study of SMILES-based compound similarity functions for drug-target interaction prediction , 2016, BMC Bioinformatics.

[53]  Umapada Pal,et al.  LOOP Descriptor: Encoding Repeated Local Patterns for Fine-grained Visual Identification of Lepidoptera , 2017, ArXiv.

[54]  Dingfang Li,et al.  Drug-Target Interaction Prediction through Label Propagation with Linear Neighborhood Information , 2017, Molecules.