DeepDock: Enhancing Ligand-protein Interaction Prediction by a Combination of Ligand and Structure Information

The prediction of precise protein-ligand binding activities can accelerate drug discovery by virtual screening—a computational technique that predicts whether a small molecule ligand is able to bind to a specific target. Thus, it is crucial to improve the performance of virtual screening. However, previous models for solving this problem are either ligand-based or structure-based. In this paper, we propose a universal deep neural network model called DeepDock that predicts protein-ligand interaction by using both ligand and structure information. Using the combination of two types of information, our model consists of embedding, convolution, max pooling, and fully-connected layers. In particular, different types of inputs are concatenated before being fed into the fully-connected layers. In the experiments, we compare our approach to the competing methods against two benchmark datasets under different settings. The experiment results have demonstrated that DeepDock can improve predictive performance by more than 4% on both DUD-E and MUV datasets in terms of AUPR

[1]  Dan Li,et al.  ADMET Evaluation in Drug Discovery. 16. Predicting hERG Blockers by Combining Multiple Pharmacophores and Machine Learning Approaches. , 2016, Molecular pharmaceutics.

[2]  Antonio Lavecchia,et al.  Machine-learning approaches in drug discovery: methods and applications. , 2015, Drug discovery today.

[3]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[4]  Jakub M. Tomczak,et al.  Interaction prediction in structure-based virtual screening using deep learning , 2018, Comput. Biol. Medicine.

[5]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[6]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[7]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[8]  Gianni De Fabritiis,et al.  PlayMolecule BindScope: large scale CNN-based virtual screening on the web , 2018, Bioinform..

[9]  Gary B. Fogel,et al.  Computational Intelligence Methods for Docking Scores , 2009 .

[10]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[11]  Hongyi Zhou,et al.  FINDSITEcomb2.0: A New Approach for Virtual Ligand Screening of Proteins and Virtual Target Screening of Biomolecules , 2018, J. Chem. Inf. Model..

[12]  Sebastian G. Rohrer,et al.  Maximum Unbiased Validation (MUV) Data Sets for Virtual Screening Based on PubChem Bioactivity Data , 2009, J. Chem. Inf. Model..

[13]  Izhar Wallach,et al.  AtomNet: A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery , 2015, ArXiv.

[14]  David Ryan Koes,et al.  Lessons Learned in Empirical Scoring with smina from the CSAR 2011 Benchmarking Exercise , 2013, J. Chem. Inf. Model..

[15]  Thomas Fox,et al.  Machine learning techniques for in silico modeling of drug metabolism. , 2006, Current topics in medicinal chemistry.

[16]  Brian K. Shoichet,et al.  Virtual screening of chemical libraries , 2004, Nature.

[17]  Cícero Nogueira dos Santos,et al.  Boosting Docking-Based Virtual Screening with Deep Learning , 2016, J. Chem. Inf. Model..

[18]  Hao Ding,et al.  Similarity-based machine learning methods for predicting drug-target interactions: a brief review , 2014, Briefings Bioinform..

[19]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[20]  George Papadatos,et al.  The ChEMBL database in 2017 , 2016, Nucleic Acids Res..

[21]  Alán Aspuru-Guzik,et al.  Convolutional Networks on Graphs for Learning Molecular Fingerprints , 2015, NIPS.

[22]  David S. Wishart,et al.  DrugBank 5.0: a major update to the DrugBank database for 2018 , 2017, Nucleic Acids Res..

[23]  Hao Ding,et al.  Collaborative matrix factorization with multiple similarities for predicting drug-target interactions , 2013, KDD.

[24]  Yoshua Bengio,et al.  Deep Learning of Representations for Unsupervised and Transfer Learning , 2011, ICML Unsupervised and Transfer Learning.

[25]  Ignacio Ponzoni,et al.  Multi‐Objective Feature Selection in QSAR Using a Machine Learning Approach , 2009 .

[26]  David Ryan Koes,et al.  Protein-Ligand Scoring with Convolutional Neural Networks , 2016, Journal of chemical information and modeling.

[27]  Michael M. Mysinger,et al.  Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking , 2012, Journal of medicinal chemistry.