Prediction of the native conformation of a polypeptide by a statistical‐mechanical procedure. I. Backbone structure of enkephalin

A new methodology for theoretically predicting the native, three‐dimensional structure of a polypeptide is presented. Based on equilibrium statistical mechanics, an algorithm has been designed to determine the probable conformation of a polypeptide by calculating conditional free‐energy maps for each residue of the macromolecule. The conditional free‐energy map of each residue is computed from a set of probability integrals, obtained by summing over the interaction energies of all pairs of nonbonded atoms of the whole molecule. By locating the region(s) of lowest free energy for each map, the probable conformation for each residue can be identified. The native structure of the polypeptide is assumed to be the combination of the probable conformations of the individual residues. All multidimensional probability integrals are evaluated by an adaptive Monte Carlo algorithm (SMAPPS—Statistical‐Mechanical Algorithm for Predicting Protein Structure). The Monte Carlo algorithm searches the entire conformational space, adjusting itself automatically to concentrate its sampling in regions where the magnitude of the integrand is largest (“importance sampling”). No assumptions are made about the native conformation. The only prior knowledge necessary for the prediction of the native conformation is the amino acid sequence of the polypeptide. To test the effectiveness of the algorithm, SMAPPS was applied to the prediction of the native conformation of the backbone of Met‐enkephalin, a pentapeptide. In the calculations, only the backbone dihedral angles (ϕ and ψ) were allowed to vary; all side‐chain (χ) and peptide‐bond (ω) dihedral angles were kept fixed at the values corresponding to the alleged global minimum energy previously determined by direct energy minimization. For each conformation generated randomly by the Monte Carlo algorithm, the total conformational energy of the polypeptide was obtained from established empirical potential energy functions. Solvent effects were not included in the computations. With this initial application of SMAPPS, three distinct low‐free‐energy β‐bend structures of Met‐enkephalin were found. In particular, one of the structures has a conformation remarkably similar to the one associated with the previously alleged global minimum energy. The two additional structures of the pentapeptide have conformational energies lower than the previously computed low‐energy structure. However, the Monte Carlo results are in agreement with an improved energy‐minimization procedure. These initial results on the backbone structure of Met‐enkephalin indicate that an equilibrium statistical‐mechanical procedure, coupled with an adaptive Monte Carlo algorithm, can overcome many of the problems associated with the standard methods of direct energy minimization.

[1]  H. Scheraga,et al.  Use of buildup and energy‐minimization procedures to compute low‐energy structures of the backbone of enkephalin , 1985, Biopolymers.

[2]  Shiliang Yu,et al.  Equilibrium distribution of conformation for d-Ala2-Ser5-enkephalin predicted by Monte Carlo calculation , 1984 .

[3]  A. Cooper,et al.  Protein fluctuations and the thermodynamic uncertainty principle. , 1984, Progress in biophysics and molecular biology.

[4]  M GayDavid,et al.  Algorithm 611: Subroutines for Unconstrained Minimization Using a Model/Trust-Region Approach , 1983 .

[5]  C. Bush,et al.  Monte Carlo calculations on the conformations of models for the glycopeptide linkage of glycoproteins. , 1983, Archives of biochemistry and biophysics.

[6]  H. Scheraga,et al.  Computed conformational states of the 20 naturally occurring amino acid residues and of the prototype residue α-aminobutyric acid , 1983 .

[7]  Hiroshi Wako,et al.  Statistical mechanical treatment of α-helices and extended structures in proteins with inclusion of short- and medium-range interactions , 1983 .

[8]  H. Scheraga,et al.  Energy parameters in polypeptides. 9. Updating of geometrical parameters, nonbonded interactions, and hydrogen bond interactions for the naturally occurring amino acids , 1983 .

[9]  H. Scheraga,et al.  Acceleration of convergence in Monte Carlo simulations of aqueous solutions using the metropolis algorithm. Hydrophobic hydration of methane , 1982 .

[10]  D. C. Rapaport,et al.  Evolution and stability of polypeptide chain conformation: a simulation study , 1981 .

[11]  H. Stuhrmann Anomalous small angle scattering , 1981, Quarterly Reviews of Biophysics.

[12]  M Karplus,et al.  The internal dynamics of globular proteins. , 1981, CRC critical reviews in biochemistry.

[13]  H. Scheraga,et al.  Ising Model Treatment of Short-Range Interactions in Polypeptides and Its Application to the Structure of Bovine Pancreatic Trypsin Inhibitor , 1980 .

[14]  F. James,et al.  Monte Carlo theory and practice , 1980 .

[15]  C. Cantor,et al.  Biophysical Chemistry: Part II: Techniques for the Study of Biological Structure and Function , 1980 .

[16]  Harold A. Scheraga,et al.  Conformational analysis of proteins: Algorithms and data structures for array processing , 1980 .

[17]  S. Premilat,et al.  Statistical molecular models for angiotensin II and enkephalin related to NMR coupling constants , 1980 .

[18]  Arnold T. Hagler,et al.  COMPUTER SIMULATION OF THE CONFORMATIONAL PROPERTIES OF OLIGOPEPTIDES. COMPARISON OF THEORETICAL METHODS AND ANALYSIS OF EXPERIMENTAL RESULTS , 1979 .

[19]  Georg E. Schulz,et al.  Principles of Protein Structure , 1979 .

[20]  M. Mezei,et al.  Ab initio calculation of the free energy of liquid water , 1978 .

[21]  G. Lepage A new algorithm for adaptive multidimensional integration , 1978 .

[22]  B. Maigret,et al.  Effects of long range interactions on the conformational statistics of short polypeptide chains generated by a Monte Carlo method , 1977 .

[23]  H. Scheraga,et al.  Enkephalin: conformational analysis by means of empirical energy calculations. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Statistical mechanical treatment of protein conformation. 6. Elimination of empirical rules for prediction by use of a high-order probability. Correlation between the amino acid sequences and conformations for homologous neurotoxin proteins. , 1977, Macromolecules.

[25]  H. Scheraga,et al.  Conformational analysis of the 20 naturally occurring amino acid residues using ECEPP. , 1977, Macromolecules.

[26]  Harold A. Scheraga,et al.  On the Use of Classical Statistical Mechanics in the Treatment of Polymer Chain Conformation , 1976 .

[27]  H. Scheraga,et al.  Statistical mechanical treatment of protein conformation. III. Prediction of protein conformation based on a three-state model. , 1976, Macromolecules.

[28]  H. Scheraga,et al.  Statistical mechanical treatment of protein conformation. II. A three-state model for specific-sequence copolymers of amino acids. , 1976, Macromolecules.

[29]  H A Scheraga,et al.  Statistical mechanical treatment of protein conformation. I. Conformational properties of amino acids in proteins. , 1976, Macromolecules.

[30]  H. Scheraga,et al.  Energy parameters in polypeptides. VII. Geometric parameters, partial atomic charges, nonbonded interactions, hydrogen bond interactions, and intrinsic torsional potentials for the naturally occurring amino acids , 1975 .

[31]  H. Scheraga,et al.  Experimental and theoretical aspects of protein folding. , 1975, Advances in protein chemistry.

[32]  G. Torrie,et al.  Monte Carlo free energy estimates using non-Boltzmann sampling: Application to the sub-critical Lennard-Jones fluid , 1974 .

[33]  S. Lifson,et al.  Energy functions for peptides and proteins. I. Derivation of a consistent force field including the hydrogen bond from amide crystals. , 1974, Journal of the American Chemical Society.

[34]  S. Lifson,et al.  Energy functions for peptides and proteins. II. The amide hydrogen bond and calculation of amide crystal properties. , 1974, Journal of the American Chemical Society.

[35]  F. Hesselink Monte Carlo generation of polypeptide random coil conformations; the persistence vector and the chain vector distribution. , 1974, Biophysical chemistry.

[36]  A. J. Hopfinger,et al.  Conformational Properties of Macromolecules , 1973 .

[37]  J. Hermans,et al.  Conformational statistics of short chains of poly(L‐alanine) and poly(glycine) generated by Monte Carlo method and the partition function of chains with constrained ends , 1973 .

[38]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[39]  H. Scheraga,et al.  Conformational Energy Calculations. Thermodynamic Parameters of the Helix-Coil Transition for Poly(L-lysine) in Aqueous Salt Solution , 1973 .

[40]  H. Scheraga,et al.  Role of medium-range interactions in proteins. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Monte Carlo Calculations on Polypeptide Chains. VII. Higher Even Moments of the End‐to‐End Distance and Radius of the Gyration for Hard‐Sphere Models of Randomly Coiling Random Copolymers of Glycine and l‐Alanine , 1972 .

[42]  N. Go,et al.  A Model for the Helix-Coil Transition in Specific-Sequence Copolymers of Amino Acids , 1971 .

[43]  H A Scheraga,et al.  Predictions of structural homologies in cytochrome c proteins. , 1971, Archives of biochemistry and biophysics.

[44]  H. Scheraga,et al.  Prediction of structural homology between bovine -lactalbumin and hen egg white lysozyme. , 1971, Archives of biochemistry and biophysics.

[45]  I. C. O. B. Nomenclature IUPAC-IUB Commission on Biochemical Nomenclature. Abbreviations and symbols for the description of the conformation of polypeptide chains. Tentative rules (1969). , 1970, Biochemistry.

[46]  N. Go,et al.  Helix probability profiles of denatured proteins and their correlation with native structures. , 1970, Proceedings of the National Academy of Sciences of the United States of America.

[47]  O. Jardetzky,et al.  Nuclear magnetic resonance spectroscopy of amino acids, peptides, and proteins. , 1970, Advances in protein chemistry.

[48]  P. A. W. Lewis,et al.  A Pseudo-Random Number Generator for the System/360 , 1969, IBM Syst. J..

[49]  R. Fraser,et al.  2 – X-Ray Methods , 1969 .

[50]  C. Anfinsen,et al.  The kinetics of formation of native ribonuclease during oxidation of the reduced polypeptide chain. , 1961, Proceedings of the National Academy of Sciences of the United States of America.

[51]  S. Lifson,et al.  On the Theory of Helix—Coil Transition in Polypeptides , 1961 .

[52]  White Fh,et al.  Regeneration of native secondary and tertiary structures by air oxidation of reduced ribonuclease. , 1961 .

[53]  B. Zimm,et al.  Theory of the Phase Transition between Helix and Random Coil in Polypeptide Chains , 1959 .

[54]  C. Anfinsen,et al.  Reductive cleavage of disulfide bridges in ribonuclease. , 1957, Science.

[55]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[56]  J. Kirkwood Statistical Mechanics of Fluid Mixtures , 1935 .

[57]  Lars Onsager,et al.  Theories of Concentrated Electrolytes. , 1933 .