Scoring functions for prediction of protein-ligand interactions.

The scoring functions for protein-ligand interactions plays central roles in computational drug design, virtual screening of chemical libraries for new lead identification, and prediction of possible binding targets of small chemical molecules. An ideal scoring function for protein-ligand interactions is expected to be able to recognize the native binding pose of a ligand on the protein surface among decoy poses, and to accurately predict the binding affinity (or binding free energy) so that the active molecules can be discriminated from the non-active ones. Due to the empirical nature of most, if not all, scoring functions for protein-ligand interactions, the general applicability of empirical scoring functions, especially to domains far outside training sets, is a major concern. In this review article, we will explore the foundations of different classes of scoring functions, their possible limitations, and their suitable application domains. We also provide assessments of several scoring functions on weakly-interacting protein-ligand complexes, which will be useful information in computational fragment-based drug design or virtual screening.

[1]  Renxiao Wang,et al.  The PDBbind database: methodologies and updates. , 2005, Journal of medicinal chemistry.

[2]  Gerhard Klebe,et al.  SFCscore: Scoring functions for affinity prediction of protein–ligand complexes , 2008, Proteins.

[3]  W Patrick Walters,et al.  A detailed comparison of current docking and scoring methods on systems of pharmaceutical relevance , 2004, Proteins.

[4]  G. Klebe,et al.  DrugScore(CSD)-knowledge-based scoring function derived from small molecule crystal data with superior recognition rate of near-native ligand poses and better affinity prediction. , 2005, Journal of medicinal chemistry.

[5]  J. Mccammon,et al.  Computational drug design accommodating receptor flexibility: the relaxed complex scheme. , 2002, Journal of the American Chemical Society.

[6]  Arthur J. Olson,et al.  Robust Scoring Functions for Protein-Ligand Interactions with Quantum Chemical Charge Models , 2011, J. Chem. Inf. Model..

[7]  Zhihai Liu,et al.  Comparative Assessment of Scoring Functions on a Diverse Test Set , 2009, J. Chem. Inf. Model..

[8]  Maria Kontoyianni,et al.  Evaluation of docking performance: comparative data on docking algorithms. , 2004, Journal of medicinal chemistry.

[9]  R. Glen,et al.  Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. , 1995, Journal of molecular biology.

[10]  Brian K. Shoichet,et al.  Statistical Potential for Modeling and Ranking of Protein-Ligand Interactions , 2011, J. Chem. Inf. Model..

[11]  David S. Goodsell,et al.  A semiempirical free energy force field with charge‐based desolvation , 2007, J. Comput. Chem..

[12]  Richard D. Smith,et al.  CSAR Benchmark Exercise of 2010: Combined Evaluation Across All Submitted Scoring Functions , 2011, J. Chem. Inf. Model..

[13]  Christopher W. Murray,et al.  Empirical scoring functions. II. The testing of an empirical scoring function for the prediction of ligand-receptor binding affinities and the use of Bayesian regression to improve the quality of the model , 1998, J. Comput. Aided Mol. Des..

[14]  Hans-Joachim Böhm,et al.  The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure , 1994, J. Comput. Aided Mol. Des..

[15]  Matthias Rarey,et al.  Towards an Integrated Description of Hydrogen Bonding and Dehydration: Decreasing False Positives in Virtual Screening with the HYDE Scoring Function , 2008, ChemMedChem.

[16]  I. Kuntz,et al.  Automated docking with grid‐based energy evaluation , 1992 .

[17]  Li Xing,et al.  Evaluation and application of multiple scoring functions for a virtual screening experiment , 2004, J. Comput. Aided Mol. Des..

[18]  I. Muegge Effect of ligand volume correction on PMF scoring , 2001, J. Comput. Chem..

[19]  Jung-Hsin Lin Accommodating protein flexibility for structure-based drug design. , 2011, Current topics in medicinal chemistry.

[20]  I. Muegge A knowledge-based scoring function for protein-ligand interactions: Probing the reference state , 2000 .

[21]  U. Singh,et al.  A NEW FORCE FIELD FOR MOLECULAR MECHANICAL SIMULATION OF NUCLEIC ACIDS AND PROTEINS , 1984 .

[22]  Zhihai Liu,et al.  Evaluation of the performance of four molecular docking programs on a diverse set of protein‐ligand complexes , 2010, J. Comput. Chem..

[23]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[24]  E. Jaeger,et al.  Comparison of automated docking programs as virtual screening tools. , 2005, Journal of Medicinal Chemistry.

[25]  Sourav Das,et al.  Binding Affinity Prediction with Property-Encoded Shape Distribution Signatures , 2010, J. Chem. Inf. Model..

[26]  P. Hajduk,et al.  Evaluation of PMF scoring in docking weak ligands to the FK506 binding protein. , 1999, Journal of medicinal chemistry.

[27]  Xiaoqin Zou,et al.  An iterative knowledge‐based scoring function to predict protein–ligand interactions: II. Validation of the scoring function , 2006, J. Comput. Chem..

[28]  C. E. Peishoff,et al.  A critical assessment of docking programs and scoring functions. , 2006, Journal of medicinal chemistry.

[29]  Peter Gedeck,et al.  Leave-Cluster-Out Cross-Validation Is Appropriate for Scoring Functions Derived from Diverse Protein Data Sets , 2010, J. Chem. Inf. Model..

[30]  C L Brooks,et al.  Ligand-protein database: linking protein-ligand complex structures to binding data. , 2001, Journal of medicinal chemistry.

[31]  Gennady M Verkhivker,et al.  Molecular recognition of the inhibitor AG-1343 by HIV-1 protease: conformationally flexible docking by evolutionary programming. , 1995, Chemistry & biology.

[32]  Xin Wen,et al.  BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities , 2006, Nucleic Acids Res..

[33]  Zhilong Xiu,et al.  Rescoring ligand docking poses. , 2010, Current opinion in drug discovery & development.

[34]  A. Fersht,et al.  Hydrogen bonding and biological specificity analysed by protein engineering , 1985, Nature.

[35]  Yanli Wang,et al.  Structure-Based Virtual Screening for Drug Discovery: a Problem-Centric Review , 2012, The AAPS Journal.

[36]  Elizabeth Yuriev,et al.  Challenges and advances in computational docking: 2009 in review , 2011, Journal of molecular recognition : JMR.

[37]  Richard D. Smith,et al.  Binding MOAD, a high-quality protein–ligand database , 2007, Nucleic Acids Res..

[38]  John B. O. Mitchell,et al.  Predicting protein-ligand binding affinities: a low scoring game? , 2004, Organic & biomolecular chemistry.

[39]  Xiaoqin Zou,et al.  Scoring functions and their evaluation methods for protein-ligand docking: recent advances and future directions. , 2010, Physical chemistry chemical physics : PCCP.

[40]  M Rarey,et al.  Detailed analysis of scoring functions for virtual screening. , 2001, Journal of medicinal chemistry.

[41]  P. Kollman,et al.  A well-behaved electrostatic potential-based method using charge restraints for deriving atomic char , 1993 .

[42]  M. Karplus,et al.  Effective energy function for proteins in solution , 1999, Proteins.

[43]  B Honig,et al.  Extracting hydrophobic free energies from experimental data: relationship to protein folding and theoretical models. , 1991, Biochemistry.

[44]  Xiaoqin Zou,et al.  Construction and Test of Ligand Decoy Sets Using MDock: Community Structure-Activity Resource Benchmarks for Binding Mode Prediction , 2011, J. Chem. Inf. Model..

[45]  Matthew P. Repasky,et al.  Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes. , 2006, Journal of medicinal chemistry.

[46]  I. Kuntz,et al.  Ligand solvation in molecular docking , 1999, Proteins.

[47]  Luhua Lai,et al.  Further development and validation of empirical scoring functions for structure-based binding affinity prediction , 2002, J. Comput. Aided Mol. Des..

[48]  Y. Martin,et al.  A general and fast scoring function for protein-ligand interactions: a simplified potential approach. , 1999, Journal of medicinal chemistry.

[49]  Araz Jakalian,et al.  Fast, efficient generation of high‐quality atomic charges. AM1‐BCC model: I. Method , 2000 .

[50]  Renxiao Wang,et al.  The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures. , 2004, Journal of medicinal chemistry.

[51]  John B. O. Mitchell,et al.  Comments on "Leave-Cluster-Out Cross-Validation Is Appropriate for Scoring Functions Derived from Diverse Protein Data Sets": Significance for the Validation of Scoring Functions , 2011, J. Chem. Inf. Model..

[52]  Didier Rognan,et al.  Comparative evaluation of eight docking tools for docking and virtual screening accuracy , 2004, Proteins.

[53]  G. Klebe,et al.  Knowledge-based scoring function to predict protein-ligand interactions. , 2000, Journal of molecular biology.

[54]  Thomas Stützle,et al.  Empirical Scoring Functions for Advanced Protein-Ligand Docking with PLANTS , 2009, J. Chem. Inf. Model..

[55]  D. J. Price,et al.  Assessing scoring functions for protein-ligand interactions. , 2004, Journal of medicinal chemistry.

[56]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[57]  Richard A. Friesner,et al.  Comparative Performance of Several Flexible Docking Programs and Scoring Functions: Enrichment Studies for a Diverse Set of Pharmaceutically Relevant Targets , 2007, J. Chem. Inf. Model..

[58]  Holger Claussen,et al.  Substantial improvements in large-scale redocking and screening using the novel HYDE scoring function , 2012, Journal of Computer-Aided Molecular Design.

[59]  Hans-Joachim Böhm,et al.  Prediction of binding constants of protein ligands: A fast method for the prioritization of hits obtained from de novo design or 3D database search programs , 1998, J. Comput. Aided Mol. Des..

[60]  G. Klebe,et al.  Approaches to the description and prediction of the binding affinity of small-molecule ligands to macromolecular receptors. , 2002, Angewandte Chemie.

[61]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[62]  Renxiao Wang,et al.  Comparative evaluation of 11 scoring functions for molecular docking. , 2003, Journal of medicinal chemistry.

[63]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.

[64]  Richard D. Smith,et al.  CSAR Benchmark Exercise of 2010: Selection of the Protein–Ligand Complexes , 2011, J. Chem. Inf. Model..

[65]  Nicolas Moitessier,et al.  Docking Ligands into Flexible and Solvated Macromolecules. 5. Force-Field-Based Prediction of Binding Affinities of Ligands to Proteins , 2009, J. Chem. Inf. Model..

[66]  Jung-Hsien Chiang,et al.  AutoBind: automatic extraction of protein-ligand-binding affinity data from biological literature , 2012, Bioinform..

[67]  John B. O. Mitchell,et al.  A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking , 2010, Bioinform..

[68]  Jin Li,et al.  On Evaluating Molecular-Docking Methods for Pose Prediction and Enrichment Factors , 2006, J. Chem. Inf. Model..

[69]  M. Karplus,et al.  Simulation of activation free energies in molecular systems , 1996 .

[70]  Manfred J. Sippl,et al.  Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures , 1993, J. Comput. Aided Mol. Des..

[71]  C. Pace,et al.  Contribution of hydrogen bonding to the conformational stability of ribonuclease T1. , 1992, Biochemistry.

[72]  Feng Ding,et al.  Correction: Emergence of Protein Fold Families through Rational Design , 2006, PLoS Comput. Biol..

[73]  Christopher R. Corbeil,et al.  Towards the development of universal, fast and highly accurate docking/scoring methods: a long way to go , 2008, British journal of pharmacology.

[74]  Ruben Abagyan,et al.  Comparative study of several algorithms for flexible ligand docking , 2003, J. Comput. Aided Mol. Des..

[75]  Dudley H. Williams,et al.  The cost of conformational order: entropy changes in molecular associations , 1992 .

[76]  Nikolay V. Dokholyan,et al.  MedusaScore: An Accurate Force Field-Based Scoring Function for Virtual Drug Screening , 2008, J. Chem. Inf. Model..

[77]  Natasja Brooijmans,et al.  Molecular recognition and docking algorithms. , 2003, Annual review of biophysics and biomolecular structure.

[78]  Feng Ding,et al.  Emergence of Protein Fold Families through Rational Design , 2006, PLoS Comput. Biol..

[79]  Ming-Jing Hwang,et al.  An interaction-motif-based scoring function for protein-ligand docking , 2010, BMC Bioinformatics.

[80]  D. Baker,et al.  An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. , 2003, Journal of molecular biology.

[81]  Ajay N. Jain,et al.  Scoring functions for protein-ligand docking. , 2006, Current protein & peptide science.

[82]  Douglas M. Hawkins,et al.  The Problem of Overfitting , 2004, J. Chem. Inf. Model..

[83]  D. Rognan,et al.  Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations. , 2000, Journal of medicinal chemistry.

[84]  A. Sali,et al.  Statistical potential for assessment and prediction of protein structures , 2006, Protein science : a publication of the Protein Society.

[85]  Ruth Nussinov,et al.  Principles of docking: An overview of search algorithms and a guide to scoring functions , 2002, Proteins.

[86]  Michael G. Lerner,et al.  Binding MOAD (Mother Of All Databases) , 2005, Proteins.

[87]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[88]  Xiaoqin Zou,et al.  Inclusion of Solvation and Entropy in the Knowledge-Based Scoring Function for Protein-Ligand Interactions , 2010, J. Chem. Inf. Model..