Integrating multi-scale neighbouring topologies and cross-modal similarities for drug-protein interaction prediction

MOTIVATION Identifying the proteins that interact with drugs can reduce the cost and time of drug development. Existing computerized methods focus on integrating drug-related and protein-related data from multiple sources to predict candidate drug-target interactions (DTIs). However, multi-scale neighboring node sequences and various kinds of drug and protein similarities are neither fully explored nor considered in decision making. RESULTS We propose a drug-target interaction prediction method, DTIP, to encode and integrate multi-scale neighbouring topologies, multiple kinds of similarities, associations, interactions related to drugs and proteins. We firstly construct a three-layer heterogeneous network to represent interactions and associations across drug, protein, and disease nodes. Then a learning framework based on fully-connected autoencoder is proposed to learn the nodes' low-dimensional feature representations within the heterogeneous network. Secondly, multi-scale neighbouring sequences of drug and protein nodes are formulated by random walks. A module based on bidirectional gated recurrent unit is designed to learn the neighbouring sequential information and integrate the low-dimensional features of nodes. Finally, we propose attention mechanisms at feature level, neighbouring topological level and similarity level to learn more informative features, topologies and similarities. The prediction results are obtained by integrating neighbouring topologies, similarities and feature attributes using a multiple layer CNN. Comprehensive experimental results over public dataset demonstrated the effectiveness of our innovative features and modules. Comparison with other state-of-the-art methods and case studies of five drugs further validated DTIP's ability in discovering the potential candidate drug-related proteins.

[1]  Russ B Altman,et al.  Graph Convolutional Neural Networks for Predicting Drug-Target Interactions , 2019, J. Chem. Inf. Model..

[2]  Chang Sun,et al.  Gradient Boosting Decision Tree-Based Method for Predicting Interactions Between Target Genes and Drugs , 2019, Front. Genet..

[3]  Xiaoping Zheng,et al.  DTI-RCNN: New Efficient Hybrid Neural Network Model to Predict Drug-Target Interactions , 2018, ICANN.

[4]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[5]  Yoshihiro Yamanishi,et al.  Supervised prediction of drug–target interactions using bipartite local models , 2009, Bioinform..

[6]  Jijun Tang,et al.  Identification of drug-target interactions via multiple information integration , 2017, Inf. Sci..

[7]  D. di Bernardo,et al.  Identification of small molecules enhancing autophagic function from drug network analysis. , 2010, Autophagy.

[8]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[9]  Chee Keong Kwoh,et al.  Drug-Target Interaction Prediction with Graph Regularized Matrix Factorization , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[10]  Ping Xuan,et al.  Prediction of Drug–Target Interactions Based on Network Representation Learning and Ensemble Learning , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[11]  Tapio Pahikkala,et al.  Toward more realistic drug^target interaction predictions , 2014 .

[12]  Jijun Tang,et al.  Identification of Protein-Ligand Binding Sites by Sequence Information and Ensemble Classifier , 2017, J. Chem. Inf. Model..

[13]  Yun Xie,et al.  Identification of drug-target interaction from interactome network with 'guilt-by-association' principle and topology features , 2016, Bioinform..

[14]  Jian Peng,et al.  A network integration approach for drug-target interaction prediction and computational drug repositioning from heterogeneous information , 2017, Nature Communications.

[15]  Kayvan Najarian,et al.  Machine learning approaches and databases for prediction of drug–target interaction: a survey paper , 2020, Briefings Bioinform..

[16]  Jin Zhao,et al.  Drug repositioning based on triangularly balanced structure for tissue-specific diseases in incomplete interactome , 2017, Artif. Intell. Medicine.

[17]  Hai-Cheng Yi,et al.  Prediction of Drug–Target Interactions From Multi-Molecular Network Based on Deep Walk Embedding Model , 2020, Frontiers in Bioengineering and Biotechnology.

[18]  The UniProt Consortium,et al.  UniProt: a worldwide hub of protein knowledge , 2018, Nucleic Acids Res..

[19]  K. Hajian‐Tilaki,et al.  Receiver Operating Characteristic (ROC) Curve Analysis for Medical Diagnostic Test Evaluation. , 2013, Caspian journal of internal medicine.

[20]  Vladimir B. Bajic,et al.  DDR: efficient computational method to predict drug–target interactions using graph mining and machine learning approaches , 2017, Bioinform..

[21]  Xiao-Ying Yan,et al.  Prediction of drug-target interaction by integrating diverse heterogeneous information source with multiple kernel learning and clustering methods , 2019, Comput. Biol. Chem..

[22]  Huiyuan Chen,et al.  iDrug: Integration of drug repositioning and drug-target prediction via cross-network embedding , 2020, PLoS Comput. Biol..

[23]  Tudor I. Oprea,et al.  DrugCentral: online drug compendium , 2016, Nucleic Acids Res..

[24]  Yongdong Zhang,et al.  Drug-target interaction prediction: databases, web servers and computational models , 2016, Briefings Bioinform..

[25]  Xiangxiang Zeng,et al.  Probability-based collaborative filtering model for predicting gene–disease associations , 2017, BMC Medical Genomics.

[26]  Xing Chen,et al.  Drug-target interaction prediction by random walk on the heterogeneous network. , 2012, Molecular bioSystems.

[27]  Mehrdad Nourani,et al.  Graph Convolutional Networks for Predicting Drug-Protein Interactions , 2019, 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[28]  David S. Wishart,et al.  DrugBank 5.0: a major update to the DrugBank database for 2018 , 2017, Nucleic Acids Res..

[29]  David S. Goodsell,et al.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility , 2009, J. Comput. Chem..

[30]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2013 , 2012, Nucleic Acids Res..

[31]  Sandhya Rani,et al.  Human Protein Reference Database—2009 update , 2008, Nucleic Acids Res..

[32]  Hojung Nam,et al.  SELF-BLM: Prediction of drug-target interactions via self-training SVM , 2017, PloS one.

[33]  Xiang Zhang,et al.  Drug repositioning by integrating target information through a heterogeneous network model , 2014, Bioinform..

[34]  Ping Xuan,et al.  Graph Convolutional Autoencoder and Generative Adversarial Network-Based Method for Predicting Drug-Target Interactions , 2020, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[35]  Hojung Nam,et al.  Identification of drug-target interaction by a random walk with restart method on an interactome network , 2018, BMC Bioinformatics.

[36]  Jijun Tang,et al.  Identification of drug-side effect association via multiple information integration with centered kernel alignment , 2019, Neurocomputing.

[37]  Xinying Xu,et al.  An Ameliorated Prediction of Drug–Target Interactions Based on Multi-Scale Discrete Wavelet Transform and Network Features , 2017, International journal of molecular sciences.

[38]  Andrew R. Leach,et al.  Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery , 2019, Journal of Cheminformatics.

[39]  Lin Gao,et al.  Inferring drug-disease associations based on known protein complexes , 2015, BMC Medical Genomics.

[40]  Zheng Wang,et al.  Drug-Target Interaction Prediction Based on Drug Fingerprint Information and Protein Sequence , 2019, Molecules.

[41]  Q. Zou,et al.  Similarity computation strategies in the microRNA-disease network: a survey. , 2015, Briefings in functional genomics.