Machine Learning Classification Models to Improve the Docking‐based Screening: A Case of PI3K‐Tankyrase Inhibitors

One of the major challenges in the current drug discovery is the improvement of the docking‐based virtual screening performance. It is especially important in the rational design of compounds with desired polypharmacology or selectivity profiles. To address this problem, we present a methodology for the development of target‐specific scoring functions possessing high screening power. These scoring functions were built using the machine learning methods for the dual target inhibitors of PI3Kα and tankyrase, promising targets for colorectal cancer therapy. The Deep Neural Network models achieve the external test AUC ROC values of 0.96 and 0.93 for the random split and 0.90 and 0.84 for the time‐based split of the PI3Kα and tankyrase inhibitors, respectively. In addition, the impact of the training set size and the actives/decoys ratio on the model quality was assessed. The study demonstrates that the optimized scoring functions could significantly improve the docking screening power for each individual target. This is very useful in the design of multitarget or selective drugs wherein the screening filters are applied in sequence.

[1]  Huikun Zhang,et al.  Machine Learning Consensus Scoring Improves Performance Across Targets in Structure-Based Virtual Screening , 2017, J. Chem. Inf. Model..

[2]  Lin-Li Li,et al.  TS‐Chemscore, a Target‐Specific Scoring Function, Significantly Improves the Performance of Scoring in Virtual Screening , 2015, Chemical biology & drug design.

[3]  Christoph A. Sotriffer,et al.  SFCscoreRF: A Random Forest-Based Scoring Function for Improved Affinity Prediction of Protein-Ligand Complexes , 2013, J. Chem. Inf. Model..

[4]  Kai-Cheng Hsu,et al.  Novel Class IIa-Selective Histone Deacetylase Inhibitors Discovered Using an in Silico Virtual Screening Approach , 2017, Scientific Reports.

[5]  W. Yung,et al.  MSK1-Mediated β-Catenin Phosphorylation Confers Resistance to PI3K/mTOR Inhibitors in Glioblastoma , 2016, Molecular Cancer Therapeutics.

[6]  George Papadatos,et al.  Beyond the hype: deep neural networks outperform established methods using a ChEMBL bioactivity benchmark set , 2017, bioRxiv.

[7]  Ashutosh Kumar,et al.  A cross docking pipeline for improving pose prediction and virtual screening performance , 2017, Journal of Computer-Aided Molecular Design.

[8]  J. Skolnick,et al.  Comprehensive prediction of drug-protein interactions and side effects for the human proteome , 2015, Scientific Reports.

[9]  Yu Wang,et al.  A comparative study of family-specific protein–ligand complex affinity prediction based on random forest approach , 2015, Journal of Computer-Aided Molecular Design.

[10]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[11]  Gustavo Henrique Goulart Trossini,et al.  Use of machine learning approaches for novel drug discovery , 2016, Expert opinion on drug discovery.

[12]  S. Krauss,et al.  TANKYRASE Inhibition Enhances the Antiproliferative Effect of PI3K and EGFR Inhibition, Mutually Affecting β-CATENIN and AKT Signaling in Colorectal Cancer , 2017, Molecular Cancer Research.

[13]  Weilin Zhang,et al.  Computational Multitarget Drug Design , 2017, J. Chem. Inf. Model..

[14]  A. Cleton-Jansen,et al.  IWR-1, a tankyrase inhibitor, attenuates Wnt/β-catenin signaling in cancer stem-like cells and inhibits in vivo the growth of a subcutaneous human osteosarcoma xenograft. , 2018, Cancer letters.

[15]  David Ryan Koes,et al.  Lessons Learned in Empirical Scoring with smina from the CSAR 2011 Benchmarking Exercise , 2013, J. Chem. Inf. Model..

[16]  Patrick Aloy,et al.  Structural systems pharmacology: the role of 3D structures in next-generation drug development. , 2013, Chemistry & biology.

[17]  Alexander Tropsha,et al.  Target-Specific Native/Decoy Pose Classifier Improves the Accuracy of Ligand Ranking in the CSAR 2013 Benchmark , 2015, J. Chem. Inf. Model..

[18]  Rodrigo Dienstmann,et al.  Tankyrase Inhibition Blocks Wnt/β-Catenin Pathway and Reverts Resistance to PI3K and AKT Inhibitors in the Treatment of Colorectal Cancer , 2015, Clinical Cancer Research.

[19]  Esben J. Bjerrum,et al.  Machine learning optimization of cross docking accuracy , 2016, Comput. Biol. Chem..

[20]  S. Chiou,et al.  Mutations in PI3K/AKT pathway genes and amplifications of PIK3CA are associated with patterns of recurrence in gastric cancers , 2015, Oncotarget.

[21]  John P. Overington,et al.  Can we rationally design promiscuous drugs? , 2006, Current opinion in structural biology.

[22]  Jie Liu,et al.  Classification of Current Scoring Functions , 2015, J. Chem. Inf. Model..

[23]  Sourav Das,et al.  Binding Affinity Prediction with Property-Encoded Shape Distribution Signatures , 2010, J. Chem. Inf. Model..

[24]  Rodrigo Quiroga,et al.  Vinardo: A Scoring Function Based on Autodock Vina Improves Scoring, Docking, and Virtual Screening , 2016, PloS one.

[25]  Laura Bonati,et al.  Modeling Binding with Large Conformational Changes: Key Points in Ensemble-Docking Approaches , 2017, J. Chem. Inf. Model..

[26]  Robert P. Sheridan,et al.  Time-Split Cross-Validation as a Method for Estimating the Goodness of Prospective Prediction , 2013, J. Chem. Inf. Model..

[27]  Hao Jiang,et al.  Discovery of Novel Disruptor of Silencing Telomeric 1-Like (DOT1L) Inhibitors using a Target-Specific Scoring Function for the (S)-Adenosyl-l-methionine (SAM)-Dependent Methyltransferase Family. , 2017, Journal of medicinal chemistry.

[28]  M. Katoh,et al.  Molecular genetics and targeted therapy of WNT-related human diseases (Review) , 2017, International journal of molecular medicine.

[29]  A. Hopkins Network pharmacology: the next paradigm in drug discovery. , 2008, Nature chemical biology.

[30]  Jacob D. Durrant,et al.  NNScore: A Neural-Network-Based Scoring Function for the Characterization of Protein−Ligand Complexes , 2010, J. Chem. Inf. Model..

[31]  Darren Vs Green,et al.  Virtual screening of chemical libraries for drug discovery , 2008, Expert opinion on drug discovery.

[32]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[33]  Jean Y. J. Wang,et al.  Exploiting the promiscuity of imatinib , 2009, Journal of biology.

[34]  David Ryan Koes,et al.  Protein-Ligand Scoring with Convolutional Neural Networks , 2016, Journal of chemical information and modeling.

[35]  Cheng Wang,et al.  Improving scoring‐docking‐screening powers of protein–ligand scoring functions using random forest , 2017, J. Comput. Chem..

[36]  Cícero Nogueira dos Santos,et al.  Boosting Docking-Based Virtual Screening with Deep Learning , 2016, J. Chem. Inf. Model..

[37]  George Papadatos,et al.  The ChEMBL database in 2017 , 2016, Nucleic Acids Res..

[38]  K. Y. Tsang,et al.  Synthesis and biological evaluation of sulfonamide analogues of the phosphatidylinositol 3-kinase inhibitor ZSTK474. , 2017, Bioorganic & medicinal chemistry.

[39]  Michael M. Mysinger,et al.  Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking , 2012, Journal of medicinal chemistry.

[40]  C. Porta,et al.  Targeting PI3K/Akt/mTOR Signaling in Cancer , 2014, Front. Oncol..

[41]  Miao Yu,et al.  Discovering new PI3Kα inhibitors with a strategy of combining ligand-based and structure-based virtual screening , 2018, Journal of Computer-Aided Molecular Design.

[42]  M L Bolognesi,et al.  Polypharmacology in a single drug: multitarget drugs. , 2013, Current medicinal chemistry.

[43]  Pedro J. Ballester,et al.  Performance of machine-learning scoring functions in structure-based virtual screening , 2017, Scientific Reports.

[44]  J B Brown,et al.  Classifiers and their Metrics Quantified , 2018, Molecular informatics.

[45]  A. Barabasi,et al.  Drug—target network , 2007, Nature Biotechnology.

[46]  W. Weis,et al.  The β-catenin destruction complex. , 2013, Cold Spring Harbor perspectives in biology.

[47]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..