MONSSTER: a method for folding globular proteins with a small number of distance restraints.

The MONSSTER (MOdeling of New Structures from Secondary and TEritary Restraints) method for folding of proteins using a small number of long-distance restraints (which can be up to seven times less than the total number of residues) and some knowledge of the secondary structure of regular fragments is described. The method employs a high-coordination lattice representation of the protein chain that incorporates a variety of potentials designed to produce protein-like behaviour. These include statistical preferences for secondary structure, side-chain burial interactions, and a hydrogen-bond potential. Using this algorithm, several globular proteins (1ctf, 2gbl, 2trx, 3fxn, 1mba, 1pcy and 6pti) have been folded to moderate-resolution, native-like compact states. For example, the 68 residue 1ctf molecule having ten loosely defined, long-range restraints was reproducibly obtained with a C alpha-backbone root-mean-square deviation (RMSD) from native of about 4. A. Flavodoxin with 35 restraints has been folded to structures whose average RMSD is 4.28 A. Furthermore, using just 20 restraints, myoglobin, which is a 146 residue helical protein, has been folded to structures whose average RMSD from native is 5.65 A. Plastocyanin with 25 long-range restraints adopts conformations whose average RMSD is 5.44 A. Possible applications of the proposed approach to the refinement of structures from NMR data, homology model-building and the determination of tertiary structure when the secondary structure and a small number of restraints are predicted are briefly discussed.

[1]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[2]  R. M. Burnett,et al.  Structure of the semiquinone form of flavodoxin from Clostridum MP. Extension of 1.8 A resolution and some comparisons with the oxidized state. , 1978, Journal of molecular biology.

[3]  M. Levitt,et al.  Automatic identification of secondary structure in globular proteins. , 1977, Journal of molecular biology.

[4]  L M Amzel,et al.  Preliminary refinement and structural analysis of the Fab fragment from human immunoglobulin new at 2.0 A resolution. , 1981, The Journal of biological chemistry.

[5]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[6]  J. Guss,et al.  Structure of oxidized poplar plastocyanin at 1.6 A resolution. , 1983, Journal of molecular biology.

[7]  N Go,et al.  Calculation of protein conformations by proton-proton distance constraints. A new efficient algorithm. , 1985, Journal of molecular biology.

[8]  Timothy F. Havel,et al.  An evaluation of the combined use of nuclear magnetic resonance and distance geometry for the determination of protein conformations in solution. , 1985, Journal of molecular biology.

[9]  A. Liljas,et al.  Structure of the C-terminal domain of the ribosomal protein L7/L12 from Escherichia coli at 1.7 A. , 1987, Journal of molecular biology.

[10]  C. Woodward,et al.  Structure of form III crystals of bovine pancreatic trypsin inhibitor. , 1987, Journal of molecular biology.

[11]  A Coda,et al.  Aplysia limacina myoglobin. Crystallographic analysis at 1.6 A resolution. , 1989, Journal of molecular biology.

[12]  H. Eklund,et al.  Crystal structure of thioredoxin from Escherichia coli at 1.68 A resolution. , 1990, Journal of molecular biology.

[13]  A V Finkelstein,et al.  The classification and origins of protein folding patterns. , 1990, Annual review of biochemistry.

[14]  A. Gronenborn,et al.  A novel, highly stable fold of the immunoglobulin binding domain of streptococcal protein G. , 1993, Science.

[15]  K Wüthrich,et al.  Efficient computation of three-dimensional protein structures in solution from nuclear magnetic resonance data using the program DIANA and the supporting programs CALIBA, HABAS and GLOMSA. , 1991, Journal of molecular biology.

[16]  R J Williams,et al.  Topological mirror images in protein structure computation: An underestimated problem , 1991, Proteins.

[17]  P. Kraulis A program to produce both detailed and schematic plots of protein structures , 1991 .

[18]  A. Kajava,et al.  Left‐handed topology of super‐secondary structure formed by aligned α‐helix and β‐hairpin , 1992 .

[19]  R. Levy,et al.  Global folding of proteins using a limited number of distance constraints. , 1993, Protein engineering.

[20]  Adam Godzik,et al.  Lattice representations of globular proteins: How good are they? , 1993, J. Comput. Chem..

[21]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[22]  G M Clore,et al.  Exploring the limits of precision and accuracy of protein structures determined by nuclear magnetic resonance spectroscopy. , 1993, Journal of molecular biology.

[23]  A. Gronenborn,et al.  Where is NMR taking us? , 1994, Proteins.

[24]  J. Skolnick,et al.  Monte carlo simulations of protein folding. I. Lattice model and interaction scheme , 1994, Proteins.

[25]  A Kolinski,et al.  Prediction of the folding pathways and structure of the GCN4 leucine zipper. , 1994, Journal of molecular biology.

[26]  J. Skolnick,et al.  Monte carlo simulations of protein folding. II. Application to protein A, ROP, and crambin , 1994, Proteins.

[27]  W. Taylor,et al.  Global fold determination from a small number of distance restraints. , 1995, Journal of molecular biology.

[28]  M. Clamp,et al.  Lattice models of protein folding. , 1995, Biochemical Society transactions.

[29]  B. Kolmerer,et al.  The complete primary structure of human nebulin and its correlation to muscle structure. , 1995, Journal of molecular biology.

[30]  W. Braun,et al.  Automated assignment of simulated and experimental NOESY spectra of proteins by feedback filtering and self-correcting distance geometry. , 1995, Journal of molecular biology.

[31]  A Kolinski,et al.  Does a backwardly read protein sequence have a unique native state? , 1996, Protein engineering.

[32]  J. Skolnick,et al.  Lattice Models of Protein Folding, Dynamics and Thermodynamics , 1996 .

[33]  A Kolinski,et al.  Folding simulations and computer redesign of protein A three‐helix bundle motifs , 1996, Proteins.

[34]  A Kolinski,et al.  A method for the prediction of surface “U”‐turns and transglobular connections in small proteins , 1997, Proteins.

[35]  A. Godzik,et al.  Derivation and testing of pair potentials for protein folding. When is the quasichemical approximation correct? , 1997, Protein science : a publication of the Protein Society.