Improving inverse docking target identification with Z‐score selection

The utilization of inverse docking methods for target identification has been driven by an increasing demand for efficient tools for detecting potential drug side‐effects. Despite impressive achievements in the field of inverse docking, identifying true positives from a pool of potential targets still remains challenging. Notably, most of the developed techniques have low accuracies, limit the pool of possible targets that can be investigated or are not easy to use for non‐experts due to a lack of available scripts or webserver. Guided by our finding that the absolute docking score was a poor indication of a ligand's protein target, we developed a novel “combined Z‐score” method that used a weighted fraction of ligand and receptor‐based Z‐scores to identify the most likely binding target of a ligand. With our combined Z‐score method, an additional 14%, 3.6%, and 6.3% of all ligand–protein pairs of the Astex, DUD, and DUD‐E databases, respectively, were correctly predicted compared to a docking score‐based selection. The combined Z‐score had the highest area under the curve in a ROC curve analysis of all three datasets and the enrichment factor for the top 1% predictions using the combined Z‐score analysis was the highest for the Astex and DUD‐E datasets. Additionally, we developed a user‐friendly python script (compatible with both Python2 and Python3) that enables users to employ the combined Z‐score analysis for target identification using a user‐defined list of ligands and targets. We are providing this python script and a user tutorial as part of the supplemental information.

[1]  Yu Zong Chen,et al.  Drug Adverse Reaction Target Database (DART) , 2003, Drug safety.

[2]  Santosh A. Khedkar,et al.  Successful applications of computer aided drug discovery: moving drugs from concept to the clinic. , 2010, Current topics in medicinal chemistry.

[3]  N. P. Todorov,et al.  Receptor flexibility in de novo ligand design and docking. , 2005, Journal of medicinal chemistry.

[4]  Jonathan P. Davis,et al.  Successful Identification of Cardiac Troponin Calcium Sensitizers Using a Combination of Virtual Screening and ROC Analysis of Known Troponin C Binders , 2017, J. Chem. Inf. Model..

[5]  Kuo-Chen Chou,et al.  Predicting Drugs Side Effects Based on Chemical-Chemical Interactions and Protein-Chemical Interactions , 2013, BioMed research international.

[6]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[7]  M. Mezei,et al.  Molecular docking: a powerful approach for structure-based drug discovery. , 2011, Current computer-aided drug design.

[8]  Y.Z. Chen,et al.  Ligand–protein inverse docking and its potential use in the computer search of protein targets of a small molecule , 2001, Proteins.

[9]  Xin Chen,et al.  The interprotein scoring noises in glide docking scores , 2012, Proteins.

[10]  Kari Eldal Tvedt,et al.  Facing the challenges? , 2014, Appetite.

[11]  Jacob D. Durrant,et al.  Non-Bisphosphonate Inhibitors of Isoprenoid Biosynthesis Identified via Computer-Aided Drug Design , 2011, Chemical biology & drug design.

[12]  Richard D. Taylor,et al.  Improved protein–ligand docking using GOLD , 2003, Proteins.

[13]  J. Irwin,et al.  Benchmarking sets for molecular docking. , 2006, Journal of medicinal chemistry.

[14]  Yi Wang,et al.  DITOP: drug-induced toxicity related protein database , 2007, Bioinform..

[15]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[16]  J Andrew McCammon,et al.  Farnesyl Diphosphate Synthase Inhibitors from In Silico Screening , 2013, Chemical biology & drug design.

[17]  Matthias Rarey,et al.  Facing the Challenges of Structure-Based Target Prediction by Inverse Virtual Screening , 2014, J. Chem. Inf. Model..

[18]  Mark McGann,et al.  FRED Pose Prediction and Virtual Screening Accuracy , 2011, J. Chem. Inf. Model..

[19]  Xiaomin Luo,et al.  TarFisDock: a web server for identifying drug targets with docking approach , 2006, Nucleic Acids Res..

[20]  Lin He,et al.  SePreSA: a server for the prediction of populations susceptible to serious adverse drug reactions implementing the methodology of a chemical–protein interactome , 2009, Nucleic Acids Res..

[21]  Chenghua Shao,et al.  Trendspotting in the Protein Data Bank , 2013, FEBS letters.

[22]  Anil Vasudevan,et al.  Emerging Approaches for the Identification of Protein Targets of Small Molecules - A Practitioners' Perspective. , 2018, Journal of medicinal chemistry.

[23]  J. Mccammon,et al.  Discovery of Staphylococcus aureus Sortase A Inhibitors Using Virtual Screening and the Relaxed Complex Scheme , 2013, Chemical biology & drug design.

[24]  J. Mccammon,et al.  Accounting for Receptor Flexibility and Enhanced Sampling Methods in Computer‐Aided Drug Design , 2013, Chemical biology & drug design.

[25]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[26]  Hongbin Huang,et al.  Reverse Screening Methods to Search for the Protein Targets of Chemopreventive Compounds , 2018, Front. Chem..

[27]  Hege S. Beard,et al.  Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. , 2004, Journal of medicinal chemistry.

[28]  Arthur J. Olson,et al.  Robust Scoring Functions for Protein-Ligand Interactions with Quantum Chemical Charge Models , 2011, J. Chem. Inf. Model..

[29]  Zhenming Liu,et al.  The scoring bias in reverse docking and the score normalization strategy to improve success rate of target fishing , 2017, PloS one.

[30]  Steven J. M. Jones,et al.  A Computational Approach to Finding Novel Targets for Existing Drugs , 2011, PLoS Comput. Biol..

[31]  Edward W. Lowe,et al.  Computational Methods in Drug Discovery , 2014, Pharmacological Reviews.

[32]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[33]  Paul N. Mortenson,et al.  Diverse, high-quality test set for the validation of protein-ligand docking performance. , 2007, Journal of medicinal chemistry.

[34]  Alexey Lagunin,et al.  ADVERPred-Web Service for Prediction of Adverse Effects of Drugs , 2017, J. Chem. Inf. Model..

[35]  J. Mccammon,et al.  Exploring the role of receptor flexibility in structure-based drug discovery. , 2014, Biophysical chemistry.

[36]  J Andrew McCammon,et al.  Taxodione and arenarone inhibit farnesyl diphosphate synthase by binding to the isopentenyl diphosphate site , 2014, Proceedings of the National Academy of Sciences.

[37]  Michael M. Mysinger,et al.  Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking , 2012, Journal of medicinal chemistry.

[38]  D. E. Clark What has computer-aided molecular design ever done for drug discovery? , 2006, Expert opinion on drug discovery.

[39]  Yoshihiro Yamanishi,et al.  Relating drug–protein interaction network with drug side effects , 2012, Bioinform..

[40]  Xiaomin Luo,et al.  PDTD: a web-accessible protein database for drug target identification , 2008, BMC Bioinformatics.

[41]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[42]  Michael Pryszlak,et al.  Giving Drugs a Second Chance: Overcoming Regulatory and Financial Hurdles in Repurposing Approved Drugs As Cancer Therapeutics , 2017, Front. Oncol..

[43]  Meekyum Olivia Kim,et al.  A Molecular Dynamics Investigation of Mycobacterium tuberculosis Prenyl Synthases: Conformational Flexibility and Implications for Computer‐aided Drug Discovery , 2015, Chemical biology & drug design.

[44]  P. Clemons,et al.  Target identification and mechanism of action in chemical biology and drug discovery. , 2013, Nature chemical biology.

[45]  Y. Z. Chen,et al.  Prediction of potential toxicity and side effect protein targets of a small molecule by a ligand-protein inverse docking approach. , 2001, Journal of molecular graphics & modelling.

[46]  Jens Meiler,et al.  ROSETTALIGAND: Protein–small molecule docking with full side‐chain flexibility , 2006, Proteins.

[47]  Jung-Hsin Lin,et al.  idTarget: a web server for identifying protein targets of small chemical molecules with robust scoring functions and a divide-and-conquer docking approach , 2012, Nucleic Acids Res..

[48]  Thomas Lengauer,et al.  Evaluation of the FLEXX incremental construction algorithm for protein–ligand docking , 1999, Proteins.

[49]  Gerhard Eisenbrand,et al.  Binding of the potential antitumour agent indirubin-5-sulphonate at the inhibitor site of rabbit muscle glycogen phosphorylase b. Comparison with ligand binding to pCDK2-cyclin A complex. , 2004, European journal of biochemistry.

[50]  Anton Simeonov,et al.  Drug repurposing screens and synergistic drug‐combinations for infectious diseases , 2017, British journal of pharmacology.

[51]  Xiaoqin Zou,et al.  Docking-based inverse virtual screening: methods, applications, and challenges , 2018, Biophysics reports.

[52]  L. Meijer,et al.  Inverse in silico screening for identification of kinase inhibitor targets. , 2007, Chemistry & biology.

[53]  Ruben Abagyan,et al.  ICM—A new method for protein modeling and design: Applications to docking and structure prediction from the distorted native conformation , 1994, J. Comput. Chem..