The scoring bias in reverse docking and the score normalization strategy to improve success rate of target fishing

Target fishing often relies on the use of reverse docking to identify potential target proteins of ligands from protein database. The limitation of reverse docking is the accuracy of current scoring funtions used to distinguish true target from non-target proteins. Many contemporary scoring functions are designed for the virtual screening of small molecules without special optimization for reverse docking, which would be easily influenced by the properties of protein pockets, resulting in scoring bias to the proteins with certain properties. This bias would cause lots of false positives in reverse docking, interferring the identification of true targets. In this paper, we have conducted a large-scale reverse docking (5000 molecules to 100 proteins) to study the scoring bias in reverse docking by DOCK, Glide, and AutoDock Vina. And we found that there were actually some frequency hits, namely interference proteins in all three docking procedures. After analyzing the differences of pocket properties between these interference proteins and the others, we speculated that the interference proteins have larger contact area (related to the size and shape of protein pockets) with ligands (for all three docking programs) or higher hydrophobicity (for Glide), which could be the causes of scoring bias. Then we applied the score normalization method to eliminate this scoring bias, which was effective to make docking score more balanced between different proteins in the reverse docking of benchmark dataset. Later, the Astex Diver Set was utilized to validate the effect of score normalization on actual cases of reverse docking, showing that the accuracy of target prediction significantly increased by 21.5% in the reverse docking by Glide after score normalization, though there was no obvious change in the reverse docking by DOCK and AutoDock Vina. Our results demonstrate the effectiveness of score normalization to eliminate the scoring bias and improve the accuracy of target prediction in reverse docking. Moreover, the properties of protein pockets causing scoring bias to certain proteins we found here can provide the theory basis to further optimize the scoring functions of docking programs for future research.

[1]  Weida Tong,et al.  In silico drug repositioning: what we need to know. , 2013, Drug discovery today.

[2]  Mire Zloh,et al.  Target fishing and docking studies of the novel derivatives of aryl-aminopyridines with potential anticancer activity. , 2012, Bioorganic & medicinal chemistry.

[3]  Richard M. Jackson,et al.  ReverseScreen3D: A Structure-Based Ligand Matching Method To Identify Protein Targets , 2011, J. Chem. Inf. Model..

[4]  C. Ung,et al.  Can an in silico drug-target search method be used to probe potential mechanisms of medicinal plant ingredients? , 2003, Natural product reports.

[5]  B Testa,et al.  In silico pharmacology for drug discovery: applications to targets and beyond , 2007, British journal of pharmacology.

[6]  Anders Wallqvist,et al.  Exploring Polypharmacology Using a ROCS-Based Target Fishing Approach , 2012, J. Chem. Inf. Model..

[7]  Didier Rognan,et al.  Ranking Targets in Structure-Based Virtual Screening of Three-Dimensional Protein Libraries: Methods and Problems , 2008, J. Chem. Inf. Model..

[8]  J. Medina-Franco,et al.  Shifting from the single to the multitarget paradigm in drug discovery. , 2013, Drug discovery today.

[9]  B. Tidor,et al.  Rational Approaches to Improving Selectivity in Drug Design , 2012, Journal of medicinal chemistry.

[10]  Xian Liu,et al.  In Silico target fishing: addressing a “Big Data” problem by ligand-based similarity rankings with data fusion , 2014, Journal of Cheminformatics.

[11]  Sean Ekins,et al.  In silico repositioning of approved drugs for rare and neglected diseases. , 2011, Drug discovery today.

[12]  Eric J. Deeds,et al.  Structural Properties of Non-Traditional Drug Targets Present New Challenges for Virtual Screening , 2013, J. Chem. Inf. Model..

[13]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[14]  A. Bender,et al.  Modeling Promiscuity Based on in vitro Safety Pharmacology Profiling Data , 2007, ChemMedChem.

[15]  R. Rothman,et al.  Serotonergic drugs and valvular heart disease , 2009, Expert opinion on drug safety.

[16]  Maurizio Recanatini,et al.  The role of fragment-based and computational methods in polypharmacology. , 2012, Drug discovery today.

[17]  Byron L. Lam,et al.  Acute Effects of Sildenafil (Viagra) on Blue‐on‐Yellow and White‐on‐White Humphrey Perimetry , 2000, Journal of neuro-ophthalmology : the official journal of the North American Neuro-Ophthalmology Society.

[18]  Didier Rognan,et al.  Structure‐Based Approaches to Target Fishing and Ligand Profiling , 2010, Molecular informatics.

[19]  Michael J. Keiser,et al.  Relating protein pharmacology by ligand chemistry , 2007, Nature Biotechnology.

[20]  Matthias Rarey,et al.  Facing the Challenges of Structure-Based Target Prediction by Inverse Virtual Screening , 2014, J. Chem. Inf. Model..

[21]  Ram Samudrala,et al.  Novel paradigms for drug discovery: computational multitarget screening. , 2008, Trends in pharmacological sciences.

[22]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[23]  Y.Z. Chen,et al.  Ligand–protein inverse docking and its potential use in the computer search of protein targets of a small molecule , 2001, Proteins.

[24]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[25]  Michael J. Keiser,et al.  Predicting new molecular targets for known drugs , 2009, Nature.

[26]  Xin Chen,et al.  The interprotein scoring noises in glide docking scores , 2012, Proteins.

[27]  Xiaomin Luo,et al.  TarFisDock: a web server for identifying drug targets with docking approach , 2006, Nucleic Acids Res..

[28]  G. V. Paolini,et al.  Global mapping of pharmacological space , 2006, Nature Biotechnology.

[29]  T. Ashburn,et al.  Drug repositioning: identifying and developing new uses for existing drugs , 2004, Nature Reviews Drug Discovery.

[30]  David B Jackson,et al.  Drug profiling: knowing where it hits. , 2010, Drug discovery today.

[31]  B. Roth,et al.  Magic shotguns versus magic bullets: selectively non-selective drugs for mood disorders and schizophrenia , 2004, Nature Reviews Drug Discovery.

[32]  Tom Halgren,et al.  New Method for Fast and Accurate Binding‐site Identification and Analysis , 2007, Chemical biology & drug design.

[33]  Michael M. Mysinger,et al.  Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking , 2012, Journal of medicinal chemistry.

[34]  A. Bender,et al.  Analysis of Pharmacology Data and the Prediction of Adverse Drug Reactions and Off‐Target Effects from Chemical Structure , 2007, ChemMedChem.

[35]  Wang,et al.  Proteomics in drug discovery. , 1999, Drug discovery today.

[36]  Michael J. Keiser,et al.  Large Scale Prediction and Testing of Drug Activity on Side-Effect Targets , 2012, Nature.

[37]  Adrià Cereto-Massagué,et al.  Tools for in silico target fishing. , 2015, Methods.

[38]  Y. Z. Chen,et al.  Prediction of potential toxicity and side effect protein targets of a small molecule by a ligand-protein inverse docking approach. , 2001, Journal of molecular graphics & modelling.

[39]  Paul N. Mortenson,et al.  Diverse, high-quality test set for the validation of protein-ligand docking performance. , 2007, Journal of medicinal chemistry.

[40]  Sudipto Mukherjee,et al.  Evaluation of DOCK 6 as a pose generation and database enrichment tool , 2012, Journal of Computer-Aided Molecular Design.

[41]  I. Khanna,et al.  Drug discovery in pharmaceutical industry: productivity challenges and trends. , 2012, Drug discovery today.

[42]  J. Mestres,et al.  On the origins of drug polypharmacology , 2013 .