An Improved PMF Scoring Function for Universally Predicting the Interactions of a Ligand with Protein, DNA, and RNA

An improved potential mean force (PMF) scoring function, named KScore, has been developed by using 23 redefined ligand atom types and 17 protein atom types, as well as 28 newly introduced atom types for nucleic acids (DNA and RNA). Metal ions and water molecules embedded in the binding sites of receptors are considered explicitly by two newly defined atom types. The individual potential terms were devised on the basis of the high-resolution crystal and NMR structures of 2,422 protein-ligand complexes, 300 DNA-ligand complexes, and 97 RNA-ligand complexes. The optimized atom pairwise distances and minima of the potentials overcome some of the disadvantages and ambiguities of current PMF potentials; thus, they more reasonably explain the atomic interaction between receptors and ligands. KScore was validated against five test sets of protein-ligand complexes and two sets of nucleic-acid-ligand complexes. The results showed acceptable correlations between KScore scores and experimentally determined binding affinities (log K i's or binding free energies). In particular, KScore can be used to rank the binding of ligands with metalloproteins; the linear correlation coefficient ( R) for the test set is 0.65. In addition to reasonably ranking protein-ligand interactions, KScore also yielded good results for scoring DNA/RNA--ligand interactions; the linear correlation coefficients for DNA-ligand and RNA-ligand complexes are 0.68 and 0.81, respectively. Moreover, KScore can appropriately reproduce the experimental structures of ligand-receptor complexes. Thus, KScore is an appropriate scoring function for universally ranking the interactions of ligands with protein, DNA, and RNA.

[1]  Shaomeng Wang,et al.  M-score: a knowledge-based potential scoring function accounting for protein atom mobility. , 2006, Journal of medicinal chemistry.

[2]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[3]  D. Goodsell Sequence recognition of DNA by lexitropsins. , 2001, Current medicinal chemistry.

[4]  Gerhard Klebe,et al.  AffinDB: a freely accessible database of affinities for protein–ligand complexes from the PDB , 2005, Nucleic Acids Res..

[5]  Dennis A Dougherty,et al.  Cation-pi interactions in ligand recognition and catalysis. , 2002, Trends in pharmacological sciences.

[6]  E. Shakhnovich,et al.  SMoG: de Novo Design Method Based on Simple, Fast, and Accurate Free Energy Estimates. 1. Methodology and Supporting Evidence , 1996 .

[7]  D. Goodsell,et al.  Automated docking of substrates to proteins by simulated annealing , 1990, Proteins.

[8]  Ajay N. Jain Scoring noncovalent protein-ligand interactions: A continuous differentiable function tuned to compute binding affinities , 1996, J. Comput. Aided Mol. Des..

[9]  I. Kuntz,et al.  Automated docking with grid‐based energy evaluation , 1992 .

[10]  G. Klebe,et al.  DrugScore(CSD)-knowledge-based scoring function derived from small molecule crystal data with superior recognition rate of near-native ligand poses and better affinity prediction. , 2005, Journal of medicinal chemistry.

[11]  Xin Wen,et al.  BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities , 2006, Nucleic Acids Res..

[12]  Tarun Jain,et al.  Computational protocol for predicting the binding affinities of zinc containing metalloprotein–ligand complexes , 2007, Proteins.

[13]  K. Karlin,et al.  Nitric Oxide in Biological Denitrification: Fe/Cu Metalloenzyme and Metal Complex NOx Redox Chemistry , 2002 .

[14]  J. Madura,et al.  Docking Substrates to Metalloenzymes , 2000 .

[15]  Y. Martin,et al.  A general and fast scoring function for protein-ligand interactions: a simplified potential approach. , 1999, Journal of medicinal chemistry.

[16]  John B. O. Mitchell,et al.  Protein Ligand Database (PLD): additional understanding of the nature and specificity of protein-ligand complexes , 2003, Bioinform..

[17]  R. Cramer,et al.  Validation of the general purpose tripos 5.2 force field , 1989 .

[18]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[19]  C L Brooks,et al.  Ligand-protein database: linking protein-ligand complex structures to binding data. , 2001, Journal of medicinal chemistry.

[20]  D. Wemmer Ligands recognizing the minor groove of DNA: Development and applications , 1999, Biopolymers.

[21]  Honglin Li,et al.  GAsDock: a new approach for rapid flexible docking based on an improved multi-population genetic algorithm. , 2004, Bioorganic & medicinal chemistry letters.

[22]  Renxiao Wang,et al.  Comparative evaluation of 11 scoring functions for molecular docking. , 2003, Journal of medicinal chemistry.

[23]  Natasja Brooijmans,et al.  Molecular recognition and docking algorithms. , 2003, Annual review of biophysics and biomolecular structure.

[24]  J A McCammon,et al.  Theory of biomolecular recognition. , 1998, Current opinion in structural biology.

[25]  Song Liu,et al.  A knowledge-based energy function for protein-ligand, protein-protein, and protein-DNA complexes. , 2005, Journal of medicinal chemistry.

[26]  D. Beveridge,et al.  Free energy via molecular simulation: applications to chemical and biomolecular systems. , 1989, Annual review of biophysics and biophysical chemistry.

[27]  I. Kuntz,et al.  Ligand solvation in molecular docking , 1999, Proteins.

[28]  Saher Afshan Shaikh,et al.  A swift all-atom energy-based computational protocol to predict DNA-ligand binding affinity and DeltaTm. , 2007, Journal of medicinal chemistry.

[29]  T. Klabunde,et al.  The Dimetal Center in purple acid phosphatases , 1997 .

[30]  Renxiao Wang,et al.  The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures. , 2004, Journal of medicinal chemistry.

[31]  Xiaomin Luo,et al.  PDTD: a web-accessible protein database for drug target identification , 2008, BMC Bioinformatics.

[32]  J. Gasteiger,et al.  FROM ATOMS AND BONDS TO THREE-DIMENSIONAL ATOMIC COORDINATES : AUTOMATIC MODEL BUILDERS , 1993 .

[33]  C. Fierke,et al.  Zinc-catalyzed sulfur alkyation:insights from protein farnesyltransferase. , 1999, Current opinion in chemical biology.

[34]  D. A. Dougherty,et al.  Cation-π Interactions in Chemistry and Biology: A New View of Benzene, Phe, Tyr, and Trp , 1996, Science.

[35]  K. Karlin,et al.  Nitric oxide in biological denitrification: Fe/Cu metalloenzyme and metal complex NO(x) redox chemistry. , 2002, Chemical reviews.

[36]  I. Kuntz,et al.  Inclusion of Solvation in Ligand Binding Free Energy Calculations Using the Generalized-Born Model , 1999 .

[37]  P. Kollman,et al.  Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. , 2000, Accounts of chemical research.

[38]  John J Irwin,et al.  Virtual screening against metalloenzymes for inhibitors and substrates. , 2005, Biochemistry.

[39]  Holger Gohlke,et al.  DrugScoreRNAKnowledge-Based Scoring Function To Predict RNA-Ligand Interactions , 2007, J. Chem. Inf. Model..

[40]  Hans-Joachim Böhm,et al.  The development of a simple empirical scoring function to estimate the binding constant for a protein-ligand complex of known three-dimensional structure , 1994, J. Comput. Aided Mol. Des..

[41]  I. Muegge PMF scoring revisited. , 2006, Journal of medicinal chemistry.

[42]  E. Shakhnovich,et al.  SMall Molecule Growth 2001 (SMoG2001): an improved knowledge-based scoring function for protein-ligand interactions. , 2002, Journal of medicinal chemistry.

[43]  Renxiao Wang,et al.  The PDBbind database: methodologies and updates. , 2005, Journal of medicinal chemistry.

[44]  Hans-Joachim Böhm,et al.  Prediction of binding constants of protein ligands: A fast method for the prioritization of hits obtained from de novo design or 3D database search programs , 1998, J. Comput. Aided Mol. Des..

[45]  Johan Åqvist,et al.  Ligand binding affinity prediction by linear interaction energy methods , 1998, J. Comput. Aided Mol. Des..

[46]  Xiaoqin Zou,et al.  An iterative knowledge‐based scoring function to predict protein–ligand interactions: II. Validation of the scoring function , 2006, J. Comput. Chem..

[47]  Gerhard Klebe,et al.  Predicting binding modes, binding affinities and ‘hot spots’ for protein-ligand complexes using a knowledge-based scoring function , 2000 .

[48]  M. Gilson,et al.  The statistical-thermodynamic basis for computation of binding affinities: a critical review. , 1997, Biophysical journal.