Orientation-dependent coarse-grained potentials derived by statistical analysis of molecular structural databases

Abstract We present results obtained for anisotropic potentials for protein simulations extracted from the continually growing databases of protein structures. This work is based on the assumption that the detailed information on molecular conformations can be used to derive statistical (a.k.a. ‘knowledge-based’) potentials that can describe on a coarse-grained level the side chain–side chain interactions in peptides and proteins. The complexity of inter-residue interactions is reflected in a high degree of orientational anisotropy for the twenty amino acids. By including in this coarse-grained interaction model the possibility of quantifying the backbone–backbone and backbone–side chain interactions, important improvements are obtained in characterizing the native protein states. Results obtained from tests that involve the identification of native-like conformations from large sets of decoy structures are presented. The method for deriving orientation-dependent statistical potentials is also applied to obtain water–water interactions. Monte Carlo simulations using the new coarse-grained water model show that the locations of the minima and maxima of the oxygen–oxygen radial distribution function correspond well with experimental measurements.

[1]  Sanzo Miyazawa,et al.  Evaluation of short‐range interactions as secondary structure energies for protein fold and sequence recognition , 1999, Proteins.

[2]  P. Swarztrauber,et al.  SPHEREPACK 3.0: A Model Development Facility , 1999 .

[3]  R L Jernigan,et al.  Coordination geometry of nonbonded residues in globular proteins. , 1996, Folding & design.

[4]  Amalendu Chandra,et al.  Dynamical properties of the soft sticky dipole model of water: Molecular dynamics simulations , 1999 .

[5]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[6]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[7]  A. Liwo,et al.  United‐residue force field for off‐lattice protein‐structure simulations: III. Origin of backbone hydrogen‐bonding cooperativity in united‐residue potentials , 1998 .

[8]  Greg L. Hura,et al.  What can x-ray scattering tell us about the radial distribution functions of water? , 2000 .

[9]  D. Thirumalai,et al.  Metastability of the folded states of globular proteins. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[10]  J. Banavar,et al.  Computer Simulation of Liquids , 1988 .

[11]  R. Jernigan,et al.  Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation , 1985 .

[12]  J. Straub,et al.  Orientational potentials extracted from protein structures improve native fold recognition , 2004, Protein science : a publication of the Protein Society.

[13]  Ken A. Dill,et al.  A Simple Model of Water and the Hydrophobic Effect , 1998 .

[14]  R. Samudrala,et al.  An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. , 1998, Journal of molecular biology.

[15]  R Samudrala,et al.  Decoys ‘R’ Us: A database of incorrect conformations to improve protein structure prediction , 2000, Protein science : a publication of the Protein Society.

[16]  M J Sippl,et al.  Knowledge-based potentials for proteins. , 1995, Current opinion in structural biology.

[17]  R L Jernigan,et al.  Short‐range conformational energies, secondary structure propensities, and recognition of correct sequence‐structure matches , 1997, Proteins.

[18]  M. Levitt,et al.  Computer simulation of protein folding , 1975, Nature.

[19]  Michael W. Mahoney,et al.  A five-site model for liquid water and the reproduction of the density anomaly by rigid, nonpolarizable potential functions , 2000 .

[20]  Laxmikant V. Kale,et al.  NAMD2: Greater Scalability for Parallel Molecular Dynamics , 1999 .

[21]  K. Dill,et al.  Hydrophobicity in a simple model of water: Entropy penalty as a sum of competing terms via full, angular expansion , 2001 .

[22]  P. Swarztrauber,et al.  SPHEREPACK 3.0: A Model Development Facility , 1999 .

[23]  D. Thirumalai,et al.  Anisotropic coarse-grained statistical potentials improve the ability to identify nativelike protein structures , 2003, physics/0302009.

[24]  A. Liwo,et al.  Energy-based de novo protein folding by conformational space annealing and an off-lattice united-residue force field: application to the 10-55 fragment of staphylococcal protein A and to apo calbindin D9K. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[25]  James E. Bray,et al.  Assigning genomic sequences to CATH , 2000, Nucleic Acids Res..

[26]  R. Jernigan,et al.  Self‐consistent estimation of inter‐residue protein contact energies based on an equilibrium mixture approximation of residues , 1999, Proteins.

[27]  Greg L. Hura,et al.  A high-quality x-ray scattering experiment on liquid water at ambient conditions , 2000 .

[28]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[29]  Yu Xia,et al.  Determination of optimal Chebyshev-expanded hydrophobic discrimination function for globular proteins , 2001, IBM J. Res. Dev..

[30]  K. Dill,et al.  Statistical potentials extracted from protein structures: how accurate are they? , 1996, Journal of molecular biology.

[31]  A. Godzik,et al.  Are proteins ideal mixtures of amino acids? Analysis of energy parameter sets , 1995, Protein science : a publication of the Protein Society.

[32]  A. Liwo,et al.  A united‐residue force field for off‐lattice protein‐structure simulations. II. Parameterization of short‐range interactions and determination of weights of energy terms by Z‐score optimization , 1997 .

[33]  E S Huang,et al.  Factors affecting the ability of energy functions to discriminate correct from incorrect folds. , 1997, Journal of molecular biology.

[34]  H. Scheraga,et al.  Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. , 1976, Macromolecules.

[35]  M. Levitt,et al.  Energy functions that discriminate X-ray and near native folds from well-constructed decoys. , 1996, Journal of molecular biology.

[36]  D. Thirumalai,et al.  Pair potentials for protein folding: Choice of reference states and sensitivity of predicted native states to variations in the interaction schemes , 2008, Protein science : a publication of the Protein Society.

[37]  Berend Smit,et al.  Understanding molecular simulation: from algorithms to applications , 1996 .

[38]  R. Jernigan,et al.  Inter-residue potentials in globular proteins and the dominance of highly specific hydrophilic interactions at close separation. , 1997, Journal of molecular biology.

[39]  H. Stanley,et al.  Application of Statistical Physics to Understand Static and Dynamic Anomalies in Liquid Water , 2003 .

[40]  A. Liwo,et al.  A united‐residue force field for off‐lattice protein‐structure simulations. I. Functional forms and parameters of long‐range side‐chain interaction potentials from protein crystal data , 1997 .

[41]  M. Levitt,et al.  A novel approach to decoy set generation: designing a physical energy function having local minima with native structure characteristics. , 2003, Journal of molecular biology.

[42]  Toshiko Ichiye,et al.  Soft Sticky Dipole Potential for Liquid Water: A New Model , 1996 .

[43]  G. Arfken Mathematical Methods for Physicists , 1967 .

[44]  G. Casari,et al.  Identification of native protein folds amongst a large number of incorrect models. The calculation of low energy conformations from potentials of mean force. , 1990, Journal of molecular biology.