A novel method for sampling alpha-helical protein backbones.

We present a novel technique of sampling the configurations of helical proteins. Assuming knowledge of native secondary structure, we employ assembly rules gathered from a database of existing structures to enumerate the geometrically possible three-dimensional arrangements of the constituent helices. We produce a library of possible folds for 25 helical protein cores. In each case, our method finds significant numbers of conformations close to the native structure. In addition, we assign coordinates to all atoms for four of the 25 proteins and show that this has a small effect on the number of near-native conformations. In the context of database driven exhaustive enumeration our method performs extremely well, yielding significant percentages of conformations (between 0.02% and 82%) within 6 A of the native structure. The method's speed and efficiency make it a valuable tool for predicting protein structure.

[1]  M. Levitt A simplified representation of protein conformations for rapid simulation of protein folding. , 1976, Journal of molecular biology.

[2]  Irwin D. Kuntz,et al.  Application of distance geometry to protein tertiary structure calculations , 1979 .

[3]  F M Richards,et al.  Packing of alpha-helices: geometrical constraints and contact areas. , 1978, Journal of molecular biology.

[4]  W. Braun,et al.  Predicting the helix packing of globular proteins by self‐correcting distance geometry , 1995, Protein science : a publication of the Protein Society.

[5]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[6]  P. Kraulis A program to produce both detailed and schematic plots of protein structures , 1991 .

[7]  C. Tanford Macromolecules , 1994, Nature.

[8]  R. Levy,et al.  Global folding of proteins using a limited number of distance constraints. , 1993, Protein engineering.

[9]  A. Kolinski,et al.  Simulations of the Folding of a Globular Protein , 1990, Science.

[10]  C. Chothia,et al.  Population statistics of protein structures: lessons from structural classifications. , 1997, Current opinion in structural biology.

[11]  G Chelvanayagam,et al.  A combinatorial distance-constraint approach to predicting protein tertiary models from known secondary structure. , 1998, Folding & design.

[12]  D. Covell,et al.  Lattice model simulations of polypeptide chain folding. , 1994, Journal of molecular biology.

[13]  M. Levitt,et al.  A lattice model for protein structure prediction at low resolution. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[14]  W. Taylor,et al.  Global fold determination from a small number of distance restraints. , 1995, Journal of molecular biology.

[15]  M. Peitsch,et al.  A protein conformational search space defined by secondary structure contacts. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[16]  Timothy F. Havel An evaluation of computational strategies for use in the determination of protein structure from distance constraints obtained by nuclear magnetic resonance. , 1991, Progress in biophysics and molecular biology.

[17]  E. Huang,et al.  Ab initio fold prediction of small helical proteins using distance geometry and knowledge-based scoring functions. , 1999, Journal of molecular biology.

[18]  M. Levitt,et al.  Exploring conformational space with a simple lattice model for protein structure. , 1994, Journal of molecular biology.

[19]  J. Skolnick,et al.  Fold assembly of small proteins using monte carlo simulations driven by restraints derived from multiple sequence alignments. , 1998, Journal of molecular biology.

[20]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[21]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[22]  J. Skolnick,et al.  What is the probability of a chance prediction of a protein structure with an rmsd of 6 A? , 1998, Folding & design.

[23]  Population statistics of protein structures , 1997 .

[24]  M J Sternberg,et al.  On the use of chemically derived distance constraints in the prediction of protein structure with myoglobin as an example. , 1980, Journal of molecular biology.

[25]  F E Cohen,et al.  Protein folding: evaluation of some simple rules for the assembly of helices into tertiary structures with myoglobin as an example. , 1979, Journal of molecular biology.

[26]  N. Go Theoretical studies of protein folding. , 1983, Annual review of biophysics and bioengineering.

[27]  T L Blundell,et al.  Packing of secondary structural elements in proteins. Analysis and prediction of inter-helix distances. , 1993, Journal of molecular biology.

[28]  V A Eyrich,et al.  Prediction of protein tertiary structure to low resolution: performance for a large and structurally diverse test set. , 1999, Journal of molecular biology.

[29]  M. Levitt Accurate modeling of protein conformation by automatic segment matching. , 1992, Journal of molecular biology.

[30]  M. Levitt,et al.  Computer simulation of protein folding , 1975, Nature.

[31]  H. Scheraga,et al.  A Method for Predicting Nucleation Sites for Protein Folding Based on Hydrophobic Contacts , 1978 .

[32]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[33]  D. Baker,et al.  Improved recognition of native‐like protein structures using a combination of sequence‐dependent and sequence‐independent features of proteins , 1999, Proteins.

[34]  M. Levitt,et al.  Energy functions that discriminate X-ray and near native folds from well-constructed decoys. , 1996, Journal of molecular biology.

[35]  Anthony K. Felts,et al.  Protein tertiary structure prediction using a branch and bound algorithm , 1999, Proteins.

[36]  Frederic M. Richards,et al.  Packing of α-helices: Geometrical constraints and contact areas☆ , 1978 .

[37]  S Brunak,et al.  Relationship between protein structure and geometrical constraints , 1996, Protein science : a publication of the Protein Society.

[38]  R. Samudrala,et al.  An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. , 1998, Journal of molecular biology.

[39]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.