Algorithms for computational solvent mapping of proteins

Computational mapping methods place molecular probes (small molecules or functional groups) on a protein surface to identify the most favorable binding positions by calculating an interaction potential. We have developed a novel computational mapping program called CS‐Map (computational solvent mapping of proteins), which differs from earlier mapping methods in three respects: (i) it initially moves the ligands on the protein surface toward regions with favorable electrostatics and desolvation, (ii) the final scoring potential accounts for desolvation, and (iii) the docked ligand positions are clustered, and the clusters are ranked on the basis of their average free energies. To understand the relative importance of these factors, we developed alternative algorithms that use the DOCK and GRAMM programs for the initial search. Because of the availability of experimental solvent mapping data, lysozyme and thermolysin are considered as test proteins. Both DOCK and GRAMM speed up the initial search, and the combined algorithms yield acceptable mapping results. However, the DOCK‐based approaches place the consensus site farther from its experimentally determined position than CS‐Map, primarily because of the lack of a solvation term in the initial search. The GRAMM‐based program also finds the correct consensus site for thermolysin. We conclude that good sampling is the most important requirement for successful mapping, but accounting for desolvation and clustering of ligand positions also help to reduce the number of false positives. Proteins 2003;51:340–351. © 2003 Wiley‐Liss, Inc.

[1]  G. Otting,et al.  Organic solvents identify specific ligand binding sites on protein surfaces , 1997, Nature Biotechnology.

[2]  M. Karplus,et al.  Multiple copy simultaneous search and construction of ligands in binding sites: application to inhibitors of HIV-1 aspartic proteinase. , 1993, Journal of medicinal chemistry.

[3]  S. Vajda,et al.  Exploring potential solvation sites of proteins by multistart local minimization , 2000 .

[4]  W. C. Still,et al.  The GB/SA Continuum Model for Solvation. A Fast Analytical Method for the Calculation of Approximate Born Radii , 1997 .

[5]  S Vajda,et al.  Empirical free energy as a target function in docking and design: application to HIV‐1 protease inhibitors , 1996, FEBS letters.

[6]  Jacquelyn S. Fetrow,et al.  Structural genomics and its importance for gene function analysis , 2000, Nature Biotechnology.

[7]  M. Karplus,et al.  Functionality maps of binding sites: A multiple copy simultaneous search method , 1991, Proteins.

[8]  D. Ringe,et al.  Locating and characterizing binding sites on proteins , 1996, Nature Biotechnology.

[9]  Todd J. A. Ewing,et al.  Critical evaluation of search algorithms for automated molecular docking and database screening , 1997 .

[10]  S Vajda,et al.  Effect of conformational flexibility and solvation on receptor-ligand binding free energies. , 1994, Biochemistry.

[11]  B. Honig,et al.  Classical electrostatics in biology and chemistry. , 1995, Science.

[12]  P. Goodford A computational procedure for determining energetically favorable binding sites on biologically important macromolecules. , 1985, Journal of medicinal chemistry.

[13]  Wim G. J. Hol,et al.  In search of new lead compounds for trypanosomiasis drug design: A protein structure-based linked-fragment approach , 1992, J. Comput. Aided Mol. Des..

[14]  P. Hajduk,et al.  Discovering High-Affinity Ligands for Proteins , 1997, Science.

[15]  Hans-Joachim Böhm,et al.  The computer program LUDI: A new method for the de novo design of enzyme inhibitors , 1992, J. Comput. Aided Mol. Des..

[16]  U. Singh,et al.  A NEW FORCE FIELD FOR MOLECULAR MECHANICAL SIMULATION OF NUCLEIC ACIDS AND PROTEINS , 1984 .

[17]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[18]  Sergio H. Rotstein,et al.  GroupBuild: a fragment-based method for de novo drug design , 1994 .

[19]  D. Ringe,et al.  Proteins in organic solvents. , 2001, Current opinion in structural biology.

[20]  R E Hubbard,et al.  Experimental and computational mapping of the binding surface of a crystalline protein. , 2001, Protein engineering.

[21]  R E Hubbard,et al.  Locating interaction sites on proteins: The crystal structure of thermolysin soaked in 2% to 100% isopropanol , 1999, Proteins.

[22]  S Vajda,et al.  Prediction of protein complexes using empirical free energy functions , 1996, Protein science : a publication of the Protein Society.

[23]  Sandor Vajda,et al.  Computational mapping identifies the binding sites of organic solvents on proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[24]  M Karplus,et al.  HOOK: A program for finding novel molecular architectures that satisfy the chemical and steric requirements of a macromolecule binding site , 1994, Proteins.

[25]  Robert E. Bruccoleri,et al.  Grid positioning independence and the reduction of self‐energy in the solution of the Poisson—Boltzmann equation , 1993, J. Comput. Chem..

[26]  I. Vakser,et al.  A systematic study of low-resolution recognition in protein--protein complexes. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Pieter F. W. Stouten,et al.  Fast prediction and visualization of protein binding pockets with PASS , 2000, J. Comput. Aided Mol. Des..

[28]  M. Karplus,et al.  A Comprehensive Analytical Treatment of Continuum Electrostatics , 1996 .

[29]  P. Hajduk,et al.  Discovering High-Affinity Ligands for Proteins: SAR by NMR , 1996, Science.

[30]  Panos M. Pardalos,et al.  Optimization in Computational Chemistry and Molecular Biology , 2000 .

[31]  B. Honig,et al.  Calculation of the total electrostatic energy of a macromolecular system: Solvation energies, binding energies, and conformational analysis , 1988, Proteins.

[32]  T. Stouch,et al.  Affinity and specificity of serine endopeptidase-protein inhibitor interactions. Empirical free energy calculations based on X-ray crystallographic structures. , 1993, Journal of molecular biology.

[33]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[34]  M J Sternberg,et al.  A continuum model for protein-protein interactions: application to the docking problem. , 1995, Journal of molecular biology.

[35]  H. Edelsbrunner,et al.  Anatomy of protein pockets and cavities: Measurement of binding site geometry and implications for ligand design , 1998, Protein science : a publication of the Protein Society.

[36]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[37]  M. Swindells,et al.  Protein clefts in molecular recognition and function. , 1996, Protein science : a publication of the Protein Society.

[38]  C. DeLisi,et al.  Determination of atomic desolvation energies from the structures of crystallized proteins. , 1997, Journal of molecular biology.

[39]  M. Ondrechen,et al.  THEMATICS: A simple computational predictor of enzyme function from structure , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Karen N. Allen,et al.  An Experimental Approach to Mapping the Binding Surfaces of Crystalline Proteins , 1996 .

[41]  M. Lawrence,et al.  CLIX: A search algorithm for finding novel ligands capable of binding proteins of known three‐dimensional structure , 1992, Proteins.