Scalable and Accurate Drug–target Prediction Based on Heterogeneous Bio-linked Network Mining

Motivation Despite the existing classification- and inference-based machine learning methods that show promising results in drug-target prediction, these methods possess inevitable limitations, where: 1) results are often biased as it lacks negative samples in the classification-based methods, and 2) novel drug-target associations with new (or isolated) drugs/targets cannot be explored by inference-based methods. As big data continues to boom, there is a need to study a scalable, robust, and accurate solution that can process large heterogeneous datasets and yield valuable predictions. Results We introduce a drug-target prediction method that improved our previously proposed method from the three aspects: 1) we constructed a heterogeneous network which incorporates 12 repositories and includes 7 types of biomedical entities (#20,119 entities, # 194,296 associations), 2) we enhanced the feature learning method with Node2Vec, a scalable state-of-art feature learning method, 3) we integrate the originally proposed inference-based model with a classification model, which is further fine-tuned by a negative sample selection algorithm. The proposed method shows a better result for drug–target association prediction: 95.3% AUC ROC score compared to the existing methods in the 10-fold cross-validation tests. We studied the biased learning/testing in the network-based pairwise prediction, and conclude a best training strategy. Finally, we conducted a disease specific prediction task based on 20 diseases. New drug-target associations were successfully predicted with AUC ROC in average, 97.2% (validated based on the DrugBank 5.1.0). The experiments showed the reliability of the proposed method in predicting novel drug-target associations for the disease treatment.

[1]  J. Mestres,et al.  Drug‐Target Networks , 2010, Molecular informatics.

[2]  Jitender Madan,et al.  Forskolin convalesces memory in high fat diet-induced dementia in wistar rats—Plausible role of pregnane x receptors , 2018, Pharmacological reports : PR.

[3]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[4]  Zhongming Zhao,et al.  A network-based drug repositioning infrastructure for precision cancer medicine through targeting significantly mutated genes in the human cancer genomes , 2016, J. Am. Medical Informatics Assoc..

[5]  Jean-Philippe Vert,et al.  Protein-ligand interaction prediction: an improved chemogenomics approach , 2008, Bioinform..

[6]  Hao Ding,et al.  Similarity-based machine learning methods for predicting drug-target interactions: a brief review , 2014, Briefings Bioinform..

[7]  David S. Wishart,et al.  DrugBank 5.0: a major update to the DrugBank database for 2018 , 2017, Nucleic Acids Res..

[8]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[9]  Daniel R. Caffrey,et al.  Structure-based maximal affinity model predicts small-molecule druggability , 2007, Nature Biotechnology.

[10]  Chuang Liu,et al.  Prediction of Drug-Target Interactions and Drug Repositioning via Network-Based Inference , 2012, PLoS Comput. Biol..

[11]  Charu C. Aggarwal,et al.  Multi-dimensional Graph Convolutional Networks , 2018, SDM.

[12]  Salvatore Alaimo,et al.  Drug–target interaction prediction through domain-tuned network-based inference , 2013, Bioinform..

[13]  Hyeon-Eui Kim,et al.  Deep mining heterogeneous networks of biomedical linked data to predict novel drug‐target associations , 2017, Bioinform..

[14]  Nancy L. Kanagy,et al.  α2-Adrenergic receptor signalling in hypertension , 2005 .

[15]  Ian H. Witten,et al.  WEKA: a machine learning workbench , 1994, Proceedings of ANZIIS '94 - Australian New Zealnd Intelligent Information Systems Conference.

[16]  Ryan A. Rossi,et al.  Deep Inductive Network Representation Learning , 2018, WWW.

[17]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[18]  Tudor I. Oprea,et al.  A comprehensive map of molecular drug targets , 2016, Nature Reviews Drug Discovery.

[19]  Kunpeng Zhang,et al.  vec2Link: Unifying Heterogeneous Data for Social Link Prediction , 2018, CIKM.

[20]  Hiroshi Mamitsuka,et al.  A probabilistic model for mining implicit 'chemical compound-gene' relations from literature , 2005, ECCB/JBI.

[21]  Yong-Yeol Ahn,et al.  Optimizing drug–target interaction prediction based on random walk on heterogeneous networks , 2015, Journal of Cheminformatics.

[22]  Ming Wen,et al.  Deep-Learning-Based Drug-Target Interaction Prediction. , 2017, Journal of proteome research.

[23]  Bo Liao,et al.  Screening drug-target interactions with positive-unlabeled learning , 2017, Scientific Reports.

[24]  Xiaochao Ma,et al.  The pregnane X receptor: from bench to bedside. , 2008, Expert opinion on drug metabolism & toxicology.

[25]  A. Barabasi,et al.  Drug—target network , 2007, Nature Biotechnology.

[26]  YamanishiYoshihiro,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008 .

[27]  Alexander E. Ivliev,et al.  Drug Target Prediction and Repositioning Using an Integrated Network-Based Approach , 2013, PloS one.

[28]  Jing Li,et al.  Drug Target Predictions Based on Heterogeneous Graph Inference , 2012, Pacific Symposium on Biocomputing.

[29]  A. Fryer,et al.  Muscarinic acetylcholine receptors and airway diseases. , 2003, Pharmacology & therapeutics.

[30]  Ling-Yun Wu,et al.  Semi-supervised Drug-Protein Interaction Prediction from Heterogeneous Spaces , 2009 .

[31]  Vladimir B. Bajic,et al.  DDR: efficient computational method to predict drug–target interactions using graph mining and machine learning approaches , 2017, Bioinform..

[32]  S. Bodine,et al.  World Lung Day 2020 at the Journal of Applied Physiology and the American Journal of Physiology - Lung Cellular and Molecular Physiology. , 2020, American journal of physiology. Lung cellular and molecular physiology.

[33]  Artem Cherkasov,et al.  PADME: A Deep Learning-based Framework for Drug-Target Interaction Prediction , 2018, ArXiv.

[34]  S T Holgate,et al.  Leukotriene antagonists and synthesis inhibitors: new directions in asthma therapy. , 1996, The Journal of allergy and clinical immunology.

[35]  T L Croxton,et al.  Role of M2 muscarinic receptors in airway smooth muscle contraction. , 1999, Life sciences.

[36]  Hui Liu,et al.  Improving compound–protein interaction prediction by building up highly credible negative samples , 2015, Bioinform..

[37]  K. Jellinger,et al.  Prevalence of dementia disorders in the oldest-old: an autopsy study , 2010, Acta Neuropathologica.

[38]  Shi-Hua Zhang,et al.  DrugE-Rank: improving drug–target interaction prediction of new candidate drugs or targets by ensemble learning to rank , 2016, Bioinform..

[39]  Siddharth N. Shah,et al.  Overview of Alpha-blockers in Hypertension: Reappraisal of Perspectives. , 2014, The Journal of the Association of Physicians of India.

[40]  Zhaochun Ren,et al.  Multi-Dimensional Network Embedding with Hierarchical Structure , 2018, WSDM.

[41]  Yoshihiro Yamanishi,et al.  Supervised prediction of drug–target interactions using bipartite local models , 2009, Bioinform..

[42]  P. Barnes,et al.  Characterization of muscarinic receptor subtypes in pig airways: radioligand binding and northern blotting studies. , 1994, The American journal of physiology.

[43]  Philip S. Yu,et al.  Partially Supervised Classification of Text Documents , 2002, ICML.

[44]  Jian Peng,et al.  A Network Integration Approach for Drug-Target Interaction Prediction and Computational Drug Repositioning from Heterogeneous Information , 2017, RECOMB 2017.

[45]  Yongdong Zhang,et al.  Drug-target interaction prediction: databases, web servers and computational models , 2016, Briefings Bioinform..

[46]  Kurt Driessens,et al.  Using Weighted Nearest Neighbor to Benefit from Unlabeled Data , 2006, PAKDD.

[47]  Edda Klipp,et al.  Biochemical network-based drug-target prediction. , 2010, Current opinion in biotechnology.

[48]  Frank J. Gonzalez,et al.  The pregnane X receptor: from bench to bedside , 2008, Expert opinion on drug metabolism & toxicology.

[49]  Yoshihiro Yamanishi,et al.  Prediction of drug–target interaction networks from the integration of chemical and genomic spaces , 2008, ISMB.

[50]  Miia Kivipelto,et al.  Lifestyle interventions to prevent cognitive impairment, dementia and Alzheimer disease , 2018, Nature Reviews Neurology.