An all atom energy based computational protocol for predicting binding affinities of protein–ligand complexes

We report here a computationally fast protocol for predicting binding affinities of non‐metallo protein–ligand complexes. The protocol builds in an all atom energy based empirical scoring function comprising electrostatics, van der Waals, hydrophobicity and loss of conformational entropy of protein side chains upon ligand binding. The method is designed to ensure transferability across diverse systems and has been validated on a heterogenous dataset of 161 complexes consisting of 55 unique protein targets. The scoring function trained on a dataset of 61 complexes yielded a correlation of r = 0.92 for the predicted binding free energies against the experimental binding affinities. Model validation and parameter analysis studies ensure the predictive ability of the scoring function. When tested on the remaining 100 protein–ligand complexes a correlation of r = 0.92 was recovered. The high correlation obtained underscores the potential applicability of the methodology in drug design endeavors. The scoring function has been web enabled at www.scfbio‐iitd.res.in/software/drugdesign/bappl.jsp as binding affinity prediction of protein–ligand (BAPPL) server.

[1]  W. C. Still,et al.  Semianalytical treatment of solvation for molecular mechanics and dynamics , 1990 .

[2]  Thomas Simonson,et al.  Free Energy Simulations Come of Age: Protein—Ligand Recognition , 2002 .

[3]  Araz Jakalian,et al.  Fast, efficient generation of high‐quality atomic charges. AM1‐BCC model: I. Method , 2000 .

[4]  John B. O. Mitchell,et al.  Protein Ligand Database (PLD): additional understanding of the nature and specificity of protein-ligand complexes , 2003, Bioinform..

[5]  Nidhi Arora,et al.  Strength of hydrogen bonds in helices , 1997, J. Comput. Chem..

[6]  A. Doig,et al.  Toward the semiquantitative estimation of binding constants guides for peptide peptide binding in aqueous solution , 1991 .

[7]  Song Liu,et al.  A knowledge-based energy function for protein-ligand, protein-protein, and protein-DNA complexes. , 2005, Journal of medicinal chemistry.

[8]  Todd J. A. Ewing,et al.  DOCK 4.0: Search strategies for automated molecular docking of flexible molecule databases , 2001, J. Comput. Aided Mol. Des..

[9]  G. V. Paolini,et al.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes , 1997, J. Comput. Aided Mol. Des..

[10]  J. Åqvist,et al.  Ligand binding affinities from MD simulations. , 2002, Accounts of chemical research.

[11]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[12]  Junmei Wang,et al.  Development and testing of a general amber force field , 2004, J. Comput. Chem..

[13]  S Vajda,et al.  Effect of conformational flexibility and solvation on receptor-ligand binding free energies. , 1994, Biochemistry.

[14]  A. D. McLachlan,et al.  Solvation energy in protein folding and binding , 1986, Nature.

[15]  M. Rami Reddy,et al.  Free energy calculations in rational drug design , 2001 .

[16]  C. Venkatachalam,et al.  LigScore: a novel scoring function for predicting binding affinities. , 2005, Journal of molecular graphics & modelling.

[17]  M. Murcko,et al.  Consensus scoring: A method for obtaining improved hit rates from docking databases of three-dimensional structures into proteins. , 1999, Journal of medicinal chemistry.

[18]  Ajay,et al.  Computational methods to predict binding free energy in ligand-receptor complexes. , 1995, Journal of medicinal chemistry.

[19]  M. Gilson,et al.  The statistical-thermodynamic basis for computation of binding affinities: a critical review. , 1997, Biophysical journal.

[20]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[21]  E. Shakhnovich,et al.  SMoG: de Novo Design Method Based on Simple, Fast, and Accurate Free Energy Estimates. 1. Methodology and Supporting Evidence , 1996 .

[22]  Nidhi Arora,et al.  Strength of hydrogen bonds in α helices , 1997 .

[23]  Natasja Brooijmans,et al.  Molecular recognition and docking algorithms. , 2003, Annual review of biophysics and biomolecular structure.

[24]  David L. Beveridge,et al.  Free energy of an arbitrary charge distribution imbedded in coaxial cylindrical dielectric continua: application to conformational preferences of DNA in aqueous solutions , 1990 .

[25]  Martin Stahl,et al.  Scoring functions for protein-ligand interactions: a critical perspective. , 2004, Drug discovery today. Technologies.

[26]  Alexander Golbraikh,et al.  Rational selection of training and test sets for the development of validated QSAR models , 2003, J. Comput. Aided Mol. Des..

[27]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[28]  Luhua Lai,et al.  Further development and validation of empirical scoring functions for structure-based binding affinity prediction , 2002, J. Comput. Aided Mol. Des..

[29]  Peter A. Kollman,et al.  AMBER, a package of computer programs for applying molecular mechanics, normal mode analysis, molecular dynamics and free energy calculations to simulate the structural and energetic properties of molecules , 1995 .

[30]  M J Sternberg,et al.  Side‐chain conformational entropy in protein folding , 1995, Protein science : a publication of the Protein Society.

[31]  R. Bruccoleri,et al.  On the attribution of binding energy in antigen-antibody complexes McPC 603, D1.3, and HyHEL-5. , 1989, Biochemistry.

[32]  Yuan-Ping Pang,et al.  EUDOC: a computer program for identification of drug interaction sites in macromolecules and drug leads from chemical databases , 2001, J. Comput. Chem..

[33]  Martin Almlöf,et al.  Free energy calculations and ligand binding. , 2003, Advances in protein chemistry.

[34]  G. Klebe,et al.  Knowledge-based scoring function to predict protein-ligand interactions. , 2000, Journal of molecular biology.

[35]  G. Crippen,et al.  Prediction of Physicochemical Parameters by Atomic Contributions. , 1999 .

[36]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[37]  Janet M. Thornton,et al.  BLEEP - potential of mean force describing protein-ligand interactions: II. Calculation of binding energies and comparison with experimental data , 1999, J. Comput. Chem..

[38]  K A Dill,et al.  Additivity Principles in Biochemistry* , 1997, The Journal of Biological Chemistry.

[39]  Hans-Joachim Böhm,et al.  Prediction of binding constants of protein ligands: A fast method for the prioritization of hits obtained from de novo design or 3D database search programs , 1998, J. Comput. Aided Mol. Des..

[40]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.

[41]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[42]  M J Sternberg,et al.  Empirical scale of side-chain conformational entropy in protein folding. , 1993, Journal of molecular biology.

[43]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[44]  B. Jayaram,et al.  Free energy component analysis for drug design: a case study of HIV-1 protease-inhibitor binding. , 2001, Journal of medicinal chemistry.

[45]  Y. Martin,et al.  A general and fast scoring function for protein-ligand interactions: a simplified potential approach. , 1999, Journal of medicinal chemistry.

[46]  Renxiao Wang,et al.  Comparative evaluation of 11 scoring functions for molecular docking. , 2003, Journal of medicinal chemistry.

[47]  I. Kuntz Structure-Based Strategies for Drug Design and Discovery , 1992, Science.

[48]  B Jayaram,et al.  Protein Structure Evaluation using an All-Atom Energy Based Empirical Scoring Function , 2006, Journal of biomolecular structure & dynamics.

[49]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[50]  Hege S. Beard,et al.  Glide: a new approach for rapid, accurate docking and scoring. 2. Enrichment factors in database screening. , 2004, Journal of medicinal chemistry.

[51]  P. Kollman,et al.  A well-behaved electrostatic potential-based method using charge restraints for deriving atomic char , 1993 .

[52]  P. Kollman,et al.  Biomolecular simulations: recent developments in force fields, simulations of enzyme catalysis, protein-ligand, protein-protein, and protein-nucleic acid noncovalent interactions. , 2001, Annual review of biophysics and biomolecular structure.

[53]  Bhyravabhotla Jayaram,et al.  Solvation Free Energy of Biomacromolecules: Parameters for a Modified Generalized Born Model Consistent with the AMBER Force Field , 1998 .

[54]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.

[55]  W. L. Jorgensen The Many Roles of Computation in Drug Discovery , 2004, Science.

[56]  R. Krause,et al.  The Origin of Plagues: Old and New , 1992, Science.

[57]  C L Brooks,et al.  Ligand-protein database: linking protein-ligand complex structures to binding data. , 2001, Journal of medicinal chemistry.

[58]  G. Klebe,et al.  Statistical potentials and scoring functions applied to protein-ligand binding. , 2001, Current opinion in structural biology.

[59]  A V Finkelstein,et al.  The price of lost freedom: entropy of bimolecular complex formation. , 1989, Protein engineering.

[60]  A. Saltelli,et al.  Sensitivity analysis for chemical models. , 2005, Chemical reviews.

[61]  G. Klebe,et al.  Approaches to the Description and Prediction of the Binding Affinity of Small-Molecule Ligands to Macromolecular Receptors , 2002 .

[62]  Mark S. Gordon,et al.  General atomic and molecular electronic structure system , 1993, J. Comput. Chem..

[63]  H. Scheraga,et al.  Model for the conformational analysis of hydrated peptides. Effect of hydration on the conformational stability of the terminally blocked residues of the 20 naturally occurring amino acids , 1979 .

[64]  Garland R. Marshall,et al.  VALIDATE: A New Method for the Receptor-Based Prediction of Binding Affinities of Novel Ligands , 1996 .

[65]  A. J. Hopfinger,et al.  Polymer-Solvent Interactions for Homopolypeptides in Aqueous Solution , 1971 .

[66]  Bhyravabhotla Jayaram,et al.  Local dielectric environment of B-DNA in solution : Results from a 14 ns molecular dynamics trajectory , 1998 .

[67]  F. Momany,et al.  Validation of the general purpose QUANTA ®3.2/CHARMm® force field , 1992 .

[68]  Luhua Lai,et al.  SCORE: A New Empirical Method for Estimating the Binding Affinity of a Protein-Ligand Complex , 1998 .

[69]  D. J. Price,et al.  Assessing scoring functions for protein-ligand interactions. , 2004, Journal of medicinal chemistry.

[70]  Nidhi Arora,et al.  Energetics of Base Pairs in B-DNA in Solution: An Appraisal of Potential Functions and Dielectric Treatments , 1998 .

[71]  B. Jayaram,et al.  A Binding Affinity Based Computational Pathway for Active-Site Directed Lead Molecule Design: Some Promises and Perspectives , 2005 .