DTI-HeNE: a novel method for drug-target interaction prediction based on heterogeneous network embedding

Background Prediction of the drug-target interaction (DTI) is a critical step in the drug repurposing process, which can effectively reduce the following workload for experimental verification of potential drugs’ properties. In recent studies, many machine-learning-based methods have been proposed to discover unknown interactions between drugs and protein targets. A recent trend is to use graph-based machine learning, e.g., graph embedding to extract features from drug-target networks and then predict new drug-target interactions. However, most of the graph embedding methods are not specifically designed for DTI predictions; thus, it is difficult for these methods to fully utilize the heterogeneous information of drugs and targets (e.g., the respective vertex features of drugs and targets and path-based interactive features between drugs and targets). Results We propose a DTI prediction method DTI-HeNE (DTI based on Heterogeneous Network Embedding), which is specifically designed to cope with the bipartite DTI relations for generating high-quality embeddings of drug-target pairs. This method splits a heterogeneous DTI network into a bipartite DTI network, multiple drug homogeneous networks and target homogeneous networks, and extracts features from these sub-networks separately to better utilize the characteristics of bipartite DTI relations as well as the auxiliary similarity information related to drugs and targets. The features extracted from each sub-network are integrated using pathway information between these sub-networks to acquire new features, i.e., embedding vectors of drug-target pairs. Finally, these features are fed into a random forest (RF) model to predict novel DTIs. Conclusions Our experimental results show that, the proposed DTI network embedding method can learn higher-quality features of heterogeneous drug-target interaction networks for novel DTIs discovery.

[1]  Vladimir B. Bajic,et al.  DDR: efficient computational method to predict drug–target interactions using graph mining and machine learning approaches , 2017, Bioinform..

[2]  M. Simmaco,et al.  The future of pharmacogenetics in the treatment of migraine. , 2019, Pharmacogenomics.

[3]  Chee Keong Kwoh,et al.  Drug-target interaction prediction by learning from local information and neighbors , 2013, Bioinform..

[4]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[5]  Bruce Randall Donald,et al.  Algorithms in Structural Molecular Biology , 2011 .

[6]  A. Bhardwaj,et al.  In situ click chemistry generation of cyclooxygenase-2 inhibitors , 2017, Nature Communications.

[7]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[8]  Tapio Pahikkala,et al.  Toward more realistic drug^target interaction predictions , 2014 .

[9]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[10]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[11]  Fang-Xiang Wu,et al.  MDIPA: a microRNA-drug interaction prediction approach based on non-negative matrix factorization , 2020, Bioinform..

[12]  Xuequn Shang,et al.  A learning-based method for drug-target interaction prediction based on feature representation learning and deep neural network , 2020, BMC Bioinformatics.

[13]  Ming Gao,et al.  BiNE: Bipartite Network Embedding , 2018, SIGIR.

[14]  Fang-Xiang Wu,et al.  Predicting drug-target interaction based on sequence and structure information , 2015 .

[15]  Yutaka Akiyama,et al.  NRLMFβ: Beta-distribution-rescored neighborhood regularized logistic matrix factorization for improving the performance of drug–target interaction prediction , 2019, Biochemistry and biophysics reports.

[16]  David S. Wishart,et al.  T3DB: the toxic exposome database , 2014, Nucleic Acids Res..

[17]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[18]  G. Schneider,et al.  New use of an old drug: inhibition of breast cancer stem cells by benztropine mesylate , 2016, Oncotarget.

[19]  Saturnino Luz,et al.  A Network-Based Embedding Method for Drug-Target Interaction Prediction , 2020, 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[20]  Ivan G. Costa,et al.  A multiple kernel learning algorithm for drug-target interaction prediction , 2016, BMC Bioinformatics.

[21]  Jian Peng,et al.  A Network Integration Approach for Drug-Target Interaction Prediction and Computational Drug Repositioning from Heterogeneous Information , 2017, RECOMB 2017.

[22]  Mehrdad Nourani,et al.  Drug-target interaction prediction using semi-bipartite graph model and deep learning , 2020, BMC Bioinformatics.

[23]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[24]  Chao Li,et al.  HetNERec: Heterogeneous network embedding based recommendation , 2020, Knowl. Based Syst..

[25]  Zhiyong Lu,et al.  A survey of current trends in computational drug repositioning , 2016, Briefings Bioinform..

[26]  J. Reynolds Martindale : the extra pharmacopoeia , 1972 .

[27]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[28]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[29]  Xing Chen miREFRWR: a novel disease-related microRNA-environmental factor interactions prediction method. , 2016, Molecular bioSystems.

[30]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[31]  Xing Chen,et al.  MicroRNA-small molecule association identification: from experimental results to computational models , 2018, Briefings Bioinform..

[32]  Qiao Zhu,et al.  GRTR: Drug-Disease Association Prediction Based on Graph Regularized Transductive Regression on Heterogeneous Network , 2018, ISBRA.

[33]  Jia Qu,et al.  RFSMMA: A New Computational Model to Identify and Prioritize Potential Small Molecule-MiRNA Associations , 2019, J. Chem. Inf. Model..

[34]  Alexander Gammerman,et al.  Learning by Transduction , 1998, UAI.

[35]  M. Bruce MacIver,et al.  Carbachol-induced EEG ‘theta’ activity in hippocampal brain slices , 1987, Brain Research.

[36]  Jianye Hao,et al.  An end-to-end heterogeneous graph representation learning-based framework for drug-target interaction prediction , 2021, Briefings Bioinform..

[37]  Min Li,et al.  HNEDTI: Prediction of drug-target interaction based on heterogeneous network embedding , 2019, 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[38]  Michael R. Lyu,et al.  A generalized Co-HITS algorithm and its application to bipartite graphs , 2009, KDD.

[39]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[40]  Xiaobo Zhou,et al.  Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces , 2010, BMC Systems Biology.

[41]  MeiJian-Ping,et al.  Drug–target interaction prediction by learning from local information and neighbors , 2013 .

[42]  Hao Ding,et al.  Collaborative matrix factorization with multiple similarities for predicting drug-target interactions , 2013, KDD.

[43]  Xing Chen,et al.  Drug-pathway association prediction: from experimental results to computational models , 2020, Briefings Bioinform..

[44]  Thorsten Joachims,et al.  Transductive Learning via Spectral Graph Partitioning , 2003, ICML.

[45]  Yongdong Zhang,et al.  Drug-target interaction prediction: databases, web servers and computational models , 2016, Briefings Bioinform..

[46]  Xiangliang Zhang,et al.  ActiveHNE: Active Heterogeneous Network Embedding , 2019, IJCAI.

[47]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[48]  Xin Gao,et al.  DTiGEMS+: drug–target interaction prediction using graph embedding, graph mining, and similarity-based techniques , 2020, Journal of Cheminformatics.

[49]  Elena Marchiori,et al.  Gaussian interaction profile kernels for predicting drug-target interaction , 2011, Bioinform..

[50]  Yoshihiro Yamanishi,et al.  Supervised prediction of drug–target interactions using bipartite local models , 2009, Bioinform..

[51]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[52]  Konstantinos Pliakos,et al.  Drug-target interaction prediction with tree-ensemble learning and output space reconstruction , 2020, BMC Bioinformatics.

[53]  Zi Huang,et al.  Joint Event-Partner Recommendation in Event-Based Social Networks , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[54]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2017 , 2016, Nucleic Acids Res..

[55]  Min Li,et al.  NEDD: a network embedding based method for predicting drug-disease associations , 2020, BMC Bioinformatics.

[56]  Sun-Yuan Kung,et al.  Transductive Learning for Multi-Label Protein Subchloroplast Localization Prediction , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[57]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.