MedusaScore: An Accurate Force Field-Based Scoring Function for Virtual Drug Screening

Virtual screening is becoming an important tool for drug discovery. However, the application of virtual screening has been limited by the lack of accurate scoring functions. Here, we present a novel scoring function, MedusaScore, for evaluating protein-ligand binding. MedusaScore is based on models of physical interactions that include van der Waals, solvation, and hydrogen bonding energies. To ensure the best transferability of the scoring function, we do not use any protein-ligand experimental data for parameter training. We then test the MedusaScore for docking decoy recognition and binding affinity prediction and find superior performance compared to other widely used scoring functions. Statistical analysis indicates that one source of inaccuracy of MedusaScore may arise from the unaccounted entropic loss upon ligand binding, which suggests avenues of approach for further MedusaScore improvement.

[1]  Shuangye Yin,et al.  Eris: an automated estimator of protein stability , 2007, Nature Methods.

[2]  Anna Marabotti,et al.  Simple, intuitive calculations of free energy of binding for protein-ligand complexes. 1. Models without explicit constrained water. , 2002, Journal of medicinal chemistry.

[3]  Todd J. A. Ewing,et al.  DOCK 4.0: Search strategies for automated molecular docking of flexible molecule databases , 2001, J. Comput. Aided Mol. Des..

[4]  D. Beveridge,et al.  Free energy via molecular simulation: applications to chemical and biomolecular systems. , 1989, Annual review of biophysics and biophysical chemistry.

[5]  Nikolay V. Dokholyan,et al.  Identification and Rational Redesign of Peptide Ligands to CRIP1, A Novel Biomarker for Cancers , 2008, PLoS Comput. Biol..

[6]  Luhua Lai,et al.  Further development and validation of empirical scoring functions for structure-based binding affinity prediction , 2002, J. Comput. Aided Mol. Des..

[7]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[8]  Shaomeng Wang,et al.  An Extensive Test of 14 Scoring Functions Using the PDBbind Refined Set of 800 Protein-Ligand Complexes , 2004, J. Chem. Inf. Model..

[9]  D. Baker,et al.  An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. , 2003, Journal of molecular biology.

[10]  Luhua Lai,et al.  SCORE: A New Empirical Method for Estimating the Binding Affinity of a Protein-Ligand Complex , 1998 .

[11]  C L Brooks,et al.  Ligand-protein database: linking protein-ligand complex structures to binding data. , 2001, Journal of medicinal chemistry.

[12]  Natasja Brooijmans,et al.  Molecular recognition and docking algorithms. , 2003, Annual review of biophysics and biomolecular structure.

[13]  A. Tropsha,et al.  Development of quantitative structure-binding affinity relationship models based on novel geometrical chemical descriptors of the protein-ligand interfaces. , 2006, Journal of medicinal chemistry.

[14]  Gennady M Verkhivker,et al.  Molecular recognition of the inhibitor AG-1343 by HIV-1 protease: conformationally flexible docking by evolutionary programming. , 1995, Chemistry & biology.

[15]  G. Klebe,et al.  Knowledge-based scoring function to predict protein-ligand interactions. , 2000, Journal of molecular biology.

[16]  D. Baker,et al.  A simple physical model for binding energy hot spots in protein–protein complexes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Peter A. Kollman,et al.  FREE ENERGY CALCULATIONS : APPLICATIONS TO CHEMICAL AND BIOCHEMICAL PHENOMENA , 1993 .

[18]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[19]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.

[20]  Jens Meiler,et al.  ROSETTALIGAND: Protein–small molecule docking with full side‐chain flexibility , 2006, Proteins.

[21]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[22]  Bethan Hughes,et al.  2007 FDA drug approvals: a year of flux , 2008, Nature Reviews Drug Discovery.

[23]  Feng Ding,et al.  Emergence of Protein Fold Families through Rational Design , 2006, PLoS Comput. Biol..

[24]  Renxiao Wang,et al.  Comparative evaluation of 11 scoring functions for molecular docking. , 2003, Journal of medicinal chemistry.

[25]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.

[26]  Ruben Abagyan,et al.  ICM—A new method for protein modeling and design: Applications to docking and structure prediction from the distorted native conformation , 1994, J. Comput. Chem..

[27]  Shaomeng Wang,et al.  How Does Consensus Scoring Work for Virtual Library Screening? An Idealized Computer Experiment , 2001, J. Chem. Inf. Comput. Sci..

[28]  Gisbert Schneider,et al.  Virtual screening and fast automated docking methods. , 2002, Drug discovery today.

[29]  C. E. Peishoff,et al.  A critical assessment of docking programs and scoring functions. , 2006, Journal of medicinal chemistry.

[30]  A. Tropsha,et al.  Beware of q 2 , 2002 .

[31]  G. Klebe,et al.  Statistical potentials and scoring functions applied to protein-ligand binding. , 2001, Current opinion in structural biology.

[32]  Gisbert Schneider,et al.  Computer-based de novo design of drug-like molecules , 2005, Nature Reviews Drug Discovery.

[33]  Hans-Joachim Böhm,et al.  The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure , 1994, J. Comput. Aided Mol. Des..

[34]  F. Ding,et al.  Ab initio folding of proteins with all-atom discrete molecular dynamics. , 2008, Structure.

[35]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[36]  M. Gilson,et al.  Ligand configurational entropy and protein binding , 2007, Proceedings of the National Academy of Sciences.

[37]  M. Karplus,et al.  Effective energy function for proteins in solution , 1999, Proteins.

[38]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[39]  Y. Martin,et al.  A general and fast scoring function for protein-ligand interactions: a simplified potential approach. , 1999, Journal of medicinal chemistry.

[40]  Peter A. Kollman,et al.  Free energy calculations on protein stability: Thr-157 .fwdarw. Val-157 mutation of T4 lysozyme , 1989 .

[41]  B Coupez,et al.  Docking and scoring--theoretically easy, practically impossible? , 2006, Current medicinal chemistry.

[42]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[43]  Feng Ding,et al.  Modeling backbone flexibility improves protein stability estimation. , 2007, Structure.

[44]  P. A. Bash,et al.  Free energy calculations by computer simulation. , 1987, Science.

[45]  Gerhard Klebe,et al.  Recent developments in structure-based drug design , 2000, Journal of Molecular Medicine.

[46]  J D Dunitz,et al.  Win some, lose some: enthalpy-entropy compensation in weak intermolecular interactions. , 1995, Chemistry & biology.

[47]  Renxiao Wang,et al.  The PDBbind database: methodologies and updates. , 2005, Journal of medicinal chemistry.