Tertiary structure prediction of mixed α/β proteins via energy minimization

We describe an improved algorithm for protein structure prediction, assuming that the location of secondary structural elements is known, with particular focus on prediction for proteins containing β‐strands. Hydrogen bonding terms are incorporated into the potential function, supplementing our previously developed residue‐residue potential which is based on a combination of database statistics and an excluded volume term. Two small mixed α/β proteins, 1‐CTF and BPTI, are studied. In order to obtain native‐like structures, it is necessary to allow the β‐strands in BPTI to distort substantially from an ideal geometry, and an automated algorithm to carry this out efficiently is presented. Simulated annealing Monte Carlo methods, which contain a genetic algorithm component as well, are used to produce an ensemble of low‐energy structures. For both proteins, a cluster of structures with low RMS deviation from the native structure is generated and the energetic ranking of this cluster is in the top 2 or 3 clusters obtained from simulations. These results are encouraging with regard to the possibility of constructing a robust procedure for tertiary folding which is applicable to β‐strand containing proteins. Proteins 33:240–252, 1998. © 1998 Wiley‐Liss, Inc.

[1]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[2]  F. Salemme,et al.  Conformational and geometrical properties of β-sheets in proteins: I. Parallel β-sheets☆ , 1981 .

[3]  F. Salemme,et al.  Conformational and geometrical properties of β-sheets in proteins: II. Antiparallel and mixed β-sheets , 1981 .

[4]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[5]  H. Charles Romesburg,et al.  Cluster analysis for researchers , 1984 .

[6]  A. Liljas,et al.  Structure of the C-terminal domain of the ribosomal protein L7/L12 from Escherichia coli at 1.7 A. , 1987, Journal of molecular biology.

[7]  P. Kraulis A program to produce both detailed and schematic plots of protein structures , 1991 .

[8]  G. Crippen,et al.  Contact potential that recognizes the correct folding of globular proteins. , 1992, Journal of molecular biology.

[9]  M J Sippl,et al.  Structure-derived hydrophobic potential. Hydrophobic potential derived from X-ray structures of globular proteins is able to identify native folds. , 1992, Journal of molecular biology.

[10]  R Unger,et al.  Genetic algorithms for protein folding simulations. , 1992, Journal of molecular biology.

[11]  Crevice‐forming mutants in the rigid core of bovine pancreatic trypsin inhibitor: Crystal structures of F22A, Y23A, N43G, and F45A , 1993, Protein science : a publication of the Protein Society.

[12]  Richard A. Friesner,et al.  Hierarchical algorithm for computer modeling of protein tertiary structure: folding of myoglobin to 6.2 .ANG. resolution , 1994 .

[13]  B Honig,et al.  An algorithm to generate low-resolution protein tertiary structures from knowledge of secondary structure. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[14]  R. Friesner,et al.  Computer modeling of protein folding: conformational and energetic analysis of reduced and detailed protein models. , 1995, Journal of molecular biology.

[15]  K. Dill,et al.  Statistical potentials extracted from protein structures: how accurate are they? , 1996, Journal of molecular biology.

[16]  Richard A. Friesner,et al.  Parallel implementation of a protein structure refinement algorithm , 1996 .

[17]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.