An algorithm for determining the conformation of polypeptide segments in proteins by systematic search

The feasibility of determining the conformation of segments of polypeptide chain up to six residues in length in globular proteins by means of systematic search through the possibles conformations has been investigated. Trial conformations are generated by using representative sets of ϕ, ψ, and χ angels that have been derived from an examination of the distributions of these angles in refined protein structures. A set of filters based on simple rules that protein structures obey is used to reduce the number of conformations to a manageable total. The most important filters are the maintenance of chain integrity and the avoidance of tooshort van der Waals contacts with the rest of the protein and with other portions of the segment under construction. The procedure is intended to be used with approximate models so that allowance is made throughout for errors in the rest of the structure. All possible main chains are first constructed and then all possible side‐chain conformations are built onto each of these. The electrostatic energy, including a solvent screening term, and the exposed hyrophobic area are evaluated for each accepted conformation. The method has been tested on two segments of chain in the trypsin like enzyme from Streptomyces griseus. It is found that there is a wide spread of energies among the accepted conformations, and the lowest energy ones have satisfactorily small root mean square deviations from the X‐ray structure.

[1]  C. Tanford,et al.  Theory of Protein Titration Curves. I. General Equations for Impenetrable Spheres , 1957 .

[2]  W. Kauzmann Some factors in the interpretation of protein denaturation. , 1959, Advances in protein chemistry.

[3]  G. N. Ramachandran,et al.  Stereochemical criteria for polypeptide and protein chain conformations. II. Allowed conformations for a pair of peptide units. , 1965, Biophysical journal.

[4]  C. Levinthal Are there pathways for protein folding , 1968 .

[5]  B. Hartley Homologies in serine proteinases. , 1970, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[6]  N. Go,et al.  Ring Closure and Local Conformational Deformations of Chain Molecules , 1970 .

[7]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[8]  D. Brooks,et al.  Content of ATP and ADP in Rabbit Blastocysts , 1971, Nature.

[9]  D. M. Shotton,et al.  Structural Similarities between α-Lytic Protease of Myxobacter 495 and Elastase , 1971 .

[10]  D. M. Blow,et al.  Structure of crystalline -chymotrypsin. V. The atomic structure of tosyl- -chymotrypsin at 2 A resolution. , 1972, Journal of molecular biology.

[11]  F. Richards The interpretation of protein structures: total volume, group volume distributions and packing density. , 1974, Journal of molecular biology.

[12]  H. Scheraga,et al.  Analysis of Conformations of Amino Acid Residues and Prediction of Backbone Topography in Proteins , 1974 .

[13]  S. Lifson,et al.  Energy functions for peptides and proteins. I. Derivation of a consistent force field including the hydrogen bond from amide crystals. , 1974, Journal of the American Chemical Society.

[14]  P K Warme,et al.  Computation of structures of homologous proteins. Alpha-lactalbumin from lysozyme. , 1974, Biochemistry.

[15]  B. Robson,et al.  Analysis of code relating sequences to conformation in globular prtoeins. Theory and application of expected information. , 1974, The Biochemical journal.

[16]  Harold L. Friedman,et al.  Image approximation to the reaction field , 1975 .

[17]  C. Chothia Structural invariants in protein folding , 1975, Nature.

[18]  C. Chothia The nature of the accessible and buried surfaces in proteins. , 1976, Journal of molecular biology.

[19]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[20]  E. Reingold,et al.  Combinatorial Algorithms: Theory and Practice , 1977 .

[21]  Robert M. Stroud,et al.  The accuracy of refined protein structures: comparison of two independently refined models of bovine trypsin , 1978 .

[22]  M. Levitt,et al.  Conformation of amino acid side-chains in proteins. , 1978, Journal of molecular biology.

[23]  Arieh Warshel,et al.  Calculations of chemical processes in solutions , 1979 .

[24]  F. Gurd,et al.  Electrostatic effects in hemoglobin: hydrogen ion equilibria in human deoxy- and oxyhemoglobin A. , 1979, Biochemistry.

[25]  A. T. Hagler,et al.  Consistent force field studies of intermolecular forces in hydrogen-bonded crystals. 2. A benchmark for the objective comparison of alternative force fields , 1979 .

[26]  M. James,et al.  Comparison of the predicted model of α-lytic protease with the X-ray structure , 1979, Nature.

[27]  L. Delbaere,et al.  Protein structure refinement: Streptomyces griseus serine protease A at 1.8 A resolution. , 1979, Journal of molecular biology.

[28]  S. Lifson,et al.  Consistent force field studies of intermolecular forces in hydrogen-bonded crystals. 1. Carboxylic acids, amides, and the C:O.cntdot..cntdot..cntdot.H- hydrogen bonds , 1979 .

[29]  William R. Taylor,et al.  Analysis and prediction of protein β-sheet structures by a combinatorial approach , 1980, Nature.

[30]  E N Baker,et al.  Structure of actinidin, after refinement at 1.7 A resolution. , 1980, Journal of molecular biology.

[31]  E. Baker,et al.  Crystallographic refinement of the structure of actinidin at 1.7 Å resolution by fast Fourier least‐squares methods , 1980 .

[32]  J. Greer Comparative model-building of the mammalian serine proteases. , 1981, Journal of molecular biology.

[33]  B. Furie,et al.  Computer-generated models of blood coagulation factor Xa, factor IXa, and thrombin based upon structural homology with other serine proteases. , 1982, The Journal of biological chemistry.

[34]  The prediction of molecular conformation. , 1982, Biochemical Society transactions.

[35]  R J Read,et al.  Structure of the complex of Streptomyces griseus protease B and the third domain of the turkey ovomucoid inhibitor at 1.8-A resolution. , 1983, Biochemistry.

[36]  H. Berendsen,et al.  Computer simulation of the dynamics of hydrated protein crystals and its comparison with x-ray data. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[37]  B. L. Sibanda,et al.  Three-dimensional structure, specificity and catalytic mechanism of renin , 1983, Nature.

[38]  Timothy F. Havel,et al.  The theory and practice of distance geometry , 1983, Bulletin of Mathematical Biology.

[39]  M. Levitt Protein folding by restrained energy minimization and molecular dynamics. , 1983, Journal of molecular biology.

[40]  R J Read,et al.  Critical evaluation of comparative model building of Streptomyces griseus trypsin. , 1984, Biochemistry.

[41]  C Sander,et al.  On the use of sequence homologies to predict protein structure: identical pentapeptides can have completely different conformations. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[42]  M. Karplus,et al.  An analysis of incorrectly folded protein models. Implications for structure predictions. , 1984, Journal of molecular biology.

[43]  G. Rose,et al.  Hydrophobicity of amino acid residues in globular proteins. , 1985, Science.

[44]  B. Honig,et al.  On the calculation of electrostatic interactions in proteins. , 1985, Journal of molecular biology.

[45]  J Moult,et al.  Electron density calculations as an extension of protein structure refinement. Streptomyces griseus protease A at 1.5 A resolution. , 1983, Journal of molecular biology.

[46]  W. DeGrado,et al.  A predicted structure of calmodulin suggests an electrostatic basis for its function. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[47]  V. Madison Cyclic peptides revisited , 1985 .

[48]  Kurt Wüthrich,et al.  Secondary structure of the α-amylase polypeptide inhibitor Tendamistat from Streptomyces tendae determined in solution by 1H nuclear magnetic resonance , 1985 .

[49]  B. L. Sibanda,et al.  β-Hairpin families in globular proteins , 1985, Nature.

[50]  H. Scheraga,et al.  Use of buildup and energy‐minimization procedures to compute low‐energy structures of the backbone of enkephalin , 1985, Biopolymers.

[51]  T. A. Jones,et al.  Using known substructures in protein model building and crystallography. , 1986, The EMBO journal.

[52]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[53]  A. D. McLachlan,et al.  Solvation energy in protein folding and binding , 1986, Nature.