Prediction and evaluation of side‐chain conformations for protein backbone structures

A common approach to protein modeling is to propose a backbone structure based on homology or threading and then to attempt to build side chains onto this backbone. A fast algorithm using the simple criteria of atomic overlap and overall rotamer probability is proposed for this purpose. The method was first tested in the context of exhaustive searches of side chain configuration space in protein cores and was then applied to all side chains in 49 proteins of known structure, using simulated annealing to sample space. The latter procedure obtains the correct rotamer for 57% and the correct χ1 value for 74% of the 6751 residues in the sample. When low‐temperature Monte‐Carlo simulations are initiated from the results of the simulated‐annealing processes, consensus configurations are obtained which exhibit slightly more accurate predictions. The Monte‐Carlo procedure also allows converged side chain entropies to be calculated for all residues. These prove to be accurate indicators of prediction reliability. For example, the correct rotamer is obtained for 79% and the correct χ1 value is obtained for 84% of the half of the sample residues exhibiting the lowest entropies. Side chain entropy and predictability are nearly completely uncorrelated with solvent‐accessible area. Some precedents for and implications of this observation are discussed. © 1996 Wiley‐Liss, Inc.

[1]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[2]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[3]  F. Richards The interpretation of protein structures: total volume, group volume distributions and packing density. , 1974, Journal of molecular biology.

[4]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[5]  M. Levitt,et al.  Conformation of amino acid side-chains in proteins. , 1978, Journal of molecular biology.

[6]  J. Garnier,et al.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. , 1978, Journal of molecular biology.

[7]  F E Cohen,et al.  Protein folding: evaluation of some simple rules for the assembly of helices into tertiary structures with myoglobin as an example. , 1979, Journal of molecular biology.

[8]  J. Greer Comparative model-building of the mammalian serine proteases. , 1981, Journal of molecular biology.

[9]  H. S. Gutowsky,et al.  Nuclear magnetic resonance studies of amino acids and proteins. Side-chain mobility of methionine in the crystalline amino acid and in crystalline sperm whale (Physeter catodon) myoglobin. , 1983, Biochemistry.

[10]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[11]  L M Amzel,et al.  Calculating three‐dimensional changes in protein structure due to amino‐acid substitutions: The variable region of immunoglobulins , 1986, Proteins.

[12]  T. A. Jones,et al.  Using known substructures in protein model building and crystallography. , 1986, The EMBO journal.

[13]  C. Levinthal,et al.  Predicting antibody hypervariable loop conformations II: Minimization and molecular dynamics studies of MCPC603 from many randomly generated loop conformations , 1986, Proteins.

[14]  T. L. Blundell,et al.  Knowledge-based prediction of protein structures and the design of novel molecules , 1987, Nature.

[15]  C. Levinthal,et al.  Predicting antibody hypervariable loop conformation. I. Ensembles of random conformations for ringlike structures , 1987, Biopolymers.

[16]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[17]  F E Cohen,et al.  Prediction of the tertiary structure of the α‐subunit of tryptophan synthase , 1987, Proteins.

[18]  M. Karplus,et al.  Prediction of the folding of short polypeptide segments by uniform conformational sampling , 1987, Biopolymers.

[19]  T. Sejnowski,et al.  Predicting the secondary structure of globular proteins using neural network models. , 1988, Journal of molecular biology.

[20]  Jiří Novotný,et al.  Structure of antibody hypervariable loops reproduced by a conformational search algorithm , 1988, Nature.

[21]  Janet M. Thornton,et al.  Rebuilding flavodoxin from Cα coordinates: A test study , 1989 .

[22]  M. Karplus,et al.  Protein secondary structure prediction with a neural network. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[23]  A C Martin,et al.  Modeling antibody hypervariable loops: a combined algorithm. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[24]  A. Lesk,et al.  Conformations of immunoglobulin hypervariable regions , 1989, Nature.

[25]  M Karplus,et al.  Construction of side-chains in homology modelling. Application to the C-terminal lobe of rhizopuspepsin. , 1989, Journal of molecular biology.

[26]  R Langridge,et al.  Improvements in protein secondary structure prediction by an enhanced neural network. , 1990, Journal of molecular biology.

[27]  William H. Press,et al.  Numerical recipes , 1990 .

[28]  R. M. Swanson Entropy measures amount of choice , 1990 .

[29]  R M Stroud,et al.  Prediction of homologous protein structures based on conformational searches and energetics , 1990, Proteins.

[30]  B. Erman,et al.  Information‐theoretical entropy as a measure of sequence variability , 1991, Proteins.

[31]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[32]  R. Lavery,et al.  A new approach to the rapid determination of protein side chain conformations. , 1991, Journal of biomolecular structure & dynamics.

[33]  S. Subbiah,et al.  Prediction of protein side-chain conformation by packing optimization. , 1991, Journal of molecular biology.

[34]  J. Greer,et al.  Comparative modeling of homologous proteins. , 1991, Methods in enzymology.

[35]  J. Thornton,et al.  Stereochemical quality of protein structure coordinates , 1992, Proteins.

[36]  M. Sippl,et al.  Detection of native‐like models for amino acid sequences of unknown three‐dimensional structure in a data base of known protein conformations , 1992, Proteins.

[37]  C. Sander,et al.  Fast and simple monte carlo algorithm for side chain optimization in proteins: Application to model building by homology , 1992, Proteins.

[38]  Harold A. Scheraga,et al.  Standard‐geometry chains fitted to X‐ray derived structures: Validation of the rigid‐geometry approximation. II. Systematic searches for short loops in proteins: Applications to bovine pancreatic ribonuclease A and human lysozyme , 1992 .

[39]  Jean Garnier,et al.  Development of an extended simulated annealing method: Application to the modeling of complementary determining regions of immunoglobulins , 1992, Biopolymers.

[40]  Johan Desmet,et al.  The dead-end elimination theorem and its use in protein side-chain positioning , 1992, Nature.

[41]  S. Bryant,et al.  An empirical energy function for threading protein sequence through the folding motif , 1993, Proteins.

[42]  D A Agard,et al.  Modeling side-chain conformation for homologous proteins using an energy-based rotamer search. , 1993, Journal of molecular biology.

[43]  P Argos,et al.  A method to configure protein side-chains from the main-chain trace in homology modelling. , 1993, Journal of molecular biology.

[44]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[45]  J. Garnier,et al.  Modeling of protein loops by simulated annealing , 1993, Protein science : a publication of the Protein Society.

[46]  S Vajda,et al.  Determining protein loop conformation using scaling‐relaxation techniques , 1993, Protein science : a publication of the Protein Society.

[47]  P. Argos,et al.  Rotamers: to be or not to be? An analysis of amino acid side-chain conformations in globular proteins. , 1993, Journal of molecular biology.

[48]  Pierre Tufféry,et al.  A critical comparison of search algorithms applied to the optimization of protein side‐chain conformations , 1993, J. Comput. Chem..

[49]  Shoshana J. Wodak,et al.  Generating and testing protein folds , 1993 .

[50]  S. Bryant,et al.  New Programs for Protein Tertiary Structure Prediction , 1993, Bio/Technology.

[51]  P. Koehl,et al.  Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. , 1994, Journal of molecular biology.

[52]  A. Kidera,et al.  Determinants of protein side‐chain packing , 1994, Protein science : a publication of the Protein Society.

[53]  R. Goldstein Efficient rotamer elimination applied to protein side-chains and related spin glasses. , 1994, Biophysical journal.

[54]  A molecular model of the inducer binding domain of the galactose repressor of Escherichia coli. , 1994, The Journal of biological chemistry.

[55]  C. Laughton,et al.  Prediction of protein side-chain conformations from local three-dimensional homology relationships. , 1994, Journal of molecular biology.

[56]  C. Lee,et al.  Predicting protein mutant energetics by self-consistent ensemble optimization. , 1994, Journal of molecular biology.

[57]  B. Honig,et al.  Evaluation of the conformational free energies of loops in proteins , 1994, Proteins.

[58]  A. Palmer,et al.  Backbone dynamics of Escherichia coli ribonuclease HI: correlations with structure and function in an active enzyme. , 1995, Journal of molecular biology.

[59]  S. Bryant,et al.  A proposed structural model of domain 1 of fasciclin III neural cell adhesion protein based on an inverse folding algorithm , 1995, Protein science : a publication of the Protein Society.

[60]  R. Srinivasan,et al.  LINUS: A hierarchic procedure to predict the fold of a protein , 1995, Proteins.

[61]  Maximiliano Vásquez,et al.  An evaluvation of discrete and continuum search techniques for conformational analysis of side chains in proteins , 1995 .

[62]  R. Friesner,et al.  Computer modeling of protein folding: conformational and energetic analysis of reduced and detailed protein models. , 1995, Journal of molecular biology.

[63]  K. Nagayama,et al.  Protein backbone dynamics revealed by quasi spectral density function analysis of amide N-15 nuclei. , 1995, Biochemistry.