Anisotropic coarse-grained statistical potentials improve the ability to identify nativelike protein structures

We present a new method to extract distance and orientation dependent potentials between amino acid side chains using a database of protein structures and the standard Boltzmann device. The importance of orientation dependent interactions is first established by computing orientational order parameters for proteins with α-helical and β-sheet architecture. Extraction of the anisotropic interactions requires defining local reference frames for each amino acid that uniquely determine the coordinates of the neighboring residues. Using the local reference frames and histograms of the radial and angular correlation functions for a standard set of nonhomologue protein structures, we construct the anisotropic pair potentials. The performance of the orientation dependent potentials was studied using a large database of decoy proteins. The results demonstrate that the new distance and orientation dependent residue–residue potentials present a significantly improved ability to recognize native folds from a set of native and decoy protein structures.

[1]  D. Baker,et al.  Contact order, transition state placement and the refolding rates of single domain proteins. , 1998, Journal of molecular biology.

[2]  R L Jernigan,et al.  Coordination geometry of nonbonded residues in globular proteins. , 1996, Folding & design.

[3]  M. J. Ruiz-Montero,et al.  Numerical evidence for bcc ordering at the surface of a critical fcc nucleus. , 1995, Physical review letters.

[4]  Excluded volume in protein side-chain packing. , 2001, Journal of molecular biology.

[5]  K. Dill,et al.  Protein core assembly processes , 1993 .

[6]  M R Lee,et al.  Free-energy calculations highlight differences in accuracy between X-ray and NMR structures and add value to protein structure prediction. , 2001, Structure.

[7]  A. Caflisch,et al.  Folding simulations of a three-stranded antiparallel β-sheet peptide , 2000 .

[8]  D. Baker,et al.  Molecular dynamics in the endgame of protein structure prediction. , 2001, Journal of molecular biology.

[9]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[10]  H. Scheraga,et al.  Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. , 1976, Macromolecules.

[11]  N. Linial,et al.  On the design and analysis of protein folding potentials , 2000, Proteins.

[12]  R. Jernigan,et al.  Self‐consistent estimation of inter‐residue protein contact energies based on an equilibrium mixture approximation of residues , 1999, Proteins.

[13]  S Vajda,et al.  Discrimination of near‐native protein structures from misfolded models by empirical free energy functions , 2000, Proteins.

[14]  G. Casari,et al.  Identification of native protein folds amongst a large number of incorrect models. The calculation of low energy conformations from potentials of mean force. , 1990, Journal of molecular biology.

[15]  C. Tanford Macromolecules , 1994, Nature.

[16]  K. Dill,et al.  Statistical potentials extracted from protein structures: how accurate are they? , 1996, Journal of molecular biology.

[17]  E. Domany,et al.  Pairwise contact potentials are unsuitable for protein folding , 1998 .

[18]  J Meller,et al.  Linear programming optimization and a double statistical filter for protein threading protocols , 2001, Proteins.

[19]  M. Levitt,et al.  Protein folding: the endgame. , 1997, Annual review of biochemistry.

[20]  I Bahar,et al.  Packing of sidechains in low-resolution models for proteins. , 1998, Folding & design.

[21]  D. Thirumalai,et al.  Exploring the propensities of helices in PrP(C) to form beta sheet using NMR structures and sequence alignments. , 2002, Biophysical journal.

[22]  U. Bornscheuer,et al.  Protein Engineering , 2018, Methods in Molecular Biology.

[23]  P. Kollman,et al.  Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. , 1998, Science.

[24]  D. Thirumalai,et al.  Pair potentials for protein folding: Choice of reference states and sensitivity of predicted native states to variations in the interaction schemes , 2008, Protein science : a publication of the Protein Society.

[25]  A. Godzik,et al.  Are proteins ideal mixtures of amino acids? Analysis of energy parameter sets , 1995, Protein science : a publication of the Protein Society.

[26]  B. Berne Modification of the overlap potential to mimic a linear site-site potential , 1981 .

[27]  A. Liwo,et al.  A united‐residue force field for off‐lattice protein‐structure simulations. II. Parameterization of short‐range interactions and determination of weights of energy terms by Z‐score optimization , 1997 .

[28]  R. Jernigan,et al.  Residue packing in proteins: Uniform distribution on a coarse-grained scale , 2002 .

[29]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[30]  M J Sippl,et al.  Knowledge-based potentials for proteins. , 1995, Current opinion in structural biology.

[31]  A. Liwo,et al.  Energy-based de novo protein folding by conformational space annealing and an off-lattice united-residue force field: application to the 10-55 fragment of staphylococcal protein A and to apo calbindin D9K. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[32]  A. Liwo,et al.  A united‐residue force field for off‐lattice protein‐structure simulations. I. Functional forms and parameters of long‐range side‐chain interaction potentials from protein crystal data , 1997 .

[33]  Y. Vorobjev Block‐units method for conformational calculations of large nucleic acid chains. II. The two‐hierarchical approach and its application to conformational arrangement of the unusual TψC loop of rabbit tRNAval , 1990, Biopolymers.

[34]  Ron Elber,et al.  Maximum feasibility guideline in the design and analysis of protein folding potentials , 2002, J. Comput. Chem..

[35]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[36]  A. Liwo,et al.  United‐residue force field for off‐lattice protein‐structure simulations: III. Origin of backbone hydrogen‐bonding cooperativity in united‐residue potentials , 1998 .

[37]  P. Steinhardt,et al.  Bond-orientational order in liquids and glasses , 1983 .

[38]  A. Ben-Naim STATISTICAL POTENTIALS EXTRACTED FROM PROTEIN STRUCTURES : ARE THESE MEANINGFUL POTENTIALS? , 1997 .

[39]  F M Richards,et al.  An analysis of packing in the protein folding problem , 1993, Quarterly Reviews of Biophysics.

[40]  R. Samudrala,et al.  An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. , 1998, Journal of molecular biology.

[41]  R Samudrala,et al.  Decoys ‘R’ Us: A database of incorrect conformations to improve protein structure prediction , 2000, Protein science : a publication of the Protein Society.

[42]  Pieter Rein ten Wolde,et al.  Numerical calculation of the rate of crystal nucleation in a Lennard‐Jones system at moderate undercooling , 1996 .

[43]  J Chomilier,et al.  Voronoï tessellation reveals the condensed matter character of folded proteins. , 2000, Physical review letters.

[44]  Daan Frenkel,et al.  COMPUTER-SIMULATION STUDY OF FREE-ENERGY BARRIERS IN CRYSTAL NUCLEATION , 1992 .

[45]  C. Chothia,et al.  The Packing Density in Proteins: Standard Radii and Volumes , 1999 .

[46]  A. Maritan,et al.  Anisotropic effective interactions in a coarse‐grained tube picture of proteins , 2002, Proteins.

[47]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[48]  D. Eisenberg Three-dimensional structure of membrane and surface proteins. , 1984, Annual review of biochemistry.

[49]  Y. Vorobjev,et al.  Block‐units method for conformational calculations of large nucleic acid chains. I. Block‐units approximation of atomic structure and conformational energy of polynucleotides , 1990 .

[50]  R. Elber,et al.  Distance‐dependent, pair potential for protein folding: Results from linear optimization , 2000, Proteins.