Distance geometry generates native‐like folds for small helical proteins using the consensus distances of predicted protein structures

For successful ab initio protein structure prediction, a method is needed to identify native‐like structures from a set containing both native and non‐native protein‐like conformations. In this regard, the use of distance geometry has shown promise when accurate inter‐residue distances are available. We describe a method by which distance geometry restraints are culled from sets of 500 protein‐like conformations for four small helical proteins generated by the method of Simons et al. (1997). A consensus‐based approach was applied in which every inter‐Cα distance was measured, and the most frequently occurring distances were used as input restraints for distance geometry. For each protein, a structure with lower coordinate root‐mean‐square (RMS) error than the mean of the original set was constructed; in three cases the topology of the fold resembled that of the native protein. When the fold sets were filtered for the best scoring conformations with respect to an all‐atom knowledge‐based scoring function, the remaining subset of 50 structures yielded restraints of higher accuracy. A second round of distance geometry using these restraints resulted in an average coordinate RMS error of 4.38 Å.

[1]  René Dionne Etude Et Extension D’Un Algorithme De Murghland , 1978 .

[2]  R. Friesner,et al.  Computer modeling of protein folding: conformational and energetic analysis of reduced and detailed protein models. , 1995, Journal of molecular biology.

[3]  M. Levitt,et al.  Exploring conformational space with a simple lattice model for protein structure. , 1994, Journal of molecular biology.

[4]  P Argos,et al.  Folding the main chain of small proteins with the genetic algorithm. , 1994, Journal of molecular biology.

[5]  R. Srinivasan,et al.  LINUS: A hierarchic procedure to predict the fold of a protein , 1995, Proteins.

[6]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[7]  D. Covell Folding protein α‐carbon chains into compact forms by monte carlo methods , 1992 .

[8]  Timothy F. Havel,et al.  The sampling properties of some distance geometry algorithms applied to unconstrained polypeptide chains: A study of 1830 independently computed conformations , 1990, Biopolymers.

[9]  J. Skolnick,et al.  Monte carlo simulations of protein folding. I. Lattice model and interaction scheme , 1994, Proteins.

[10]  A Kolinski,et al.  Prediction of the folding pathways and structure of the GCN4 leucine zipper. , 1994, Journal of molecular biology.

[11]  D Baker,et al.  Local sequence-structure correlations in proteins. , 1996, Current opinion in biotechnology.

[12]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[13]  Timothy F. Havel,et al.  The theory and practice of distance geometry , 1985 .

[14]  E S Huang,et al.  Factors affecting the ability of energy functions to discriminate correct from incorrect folds. , 1997, Journal of molecular biology.

[15]  D. Eisenberg,et al.  An evolutionary approach to folding small alpha-helical proteins that uses sequence information and an empirical guiding fitness function. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Roland L. Dunbrack,et al.  Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling tool. , 1997, Journal of molecular biology.

[17]  J. Ponder,et al.  The NMR solution structure of intestinal fatty acid-binding protein complexed with palmitate: application of a novel distance geometry algorithm. , 1996, Journal of molecular biology.

[18]  Irwin D. Kuntz,et al.  Application of distance geometry to protein tertiary structure calculations , 1979 .

[19]  D. Covell,et al.  Lattice model simulations of polypeptide chain folding. , 1994, Journal of molecular biology.

[20]  J. Skolnick,et al.  Monte carlo simulations of protein folding. II. Application to protein A, ROP, and crambin , 1994, Proteins.

[21]  W. Taylor,et al.  Global fold determination from a small number of distance restraints. , 1995, Journal of molecular biology.

[22]  M. Levitt,et al.  A lattice model for protein structure prediction at low resolution. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[23]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[24]  P Argos,et al.  Identifying the tertiary fold of small proteins with different topologies from sequence and secondary structure using the genetic algorithm and extended criteria specific for strand regions. , 1996, Journal of molecular biology.

[25]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[26]  W. Braun,et al.  Predicting the helix packing of globular proteins by self‐correcting distance geometry , 1995, Protein science : a publication of the Protein Society.

[27]  K Yue,et al.  Folding proteins with a simple energy function and extensive conformational searching , 1996, Protein science : a publication of the Protein Society.

[28]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[29]  S. Doniach,et al.  A computer model to dynamically simulate protein folding: Studies with crambin , 1989, Proteins.

[30]  I D Kuntz,et al.  Effects of limited input distance constraints upon the distance geometry algorithm , 1991, Biopolymers.

[31]  Timothy F. Havel,et al.  A distance geometry program for determining the structures of small proteins and other macromolecules from nuclear magnetic resonance measurements of intramolecular1H−1H proximities in solution , 1984 .

[32]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[33]  D G Covell Folding protein alpha-carbon chains into compact forms by Monte Carlo methods. , 1992, Proteins.

[34]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.

[35]  W. Braun,et al.  Pattern recognition and self‐correcting distance geometry calculations applied to myohemerythrin , 1994, FEBS letters.

[36]  P Tufféry,et al.  Prediction of protein side chain conformations: a study on the influence of backbone accuracy on conformation stability in the rotamer space. , 1997, Protein engineering.

[37]  M. Levitt,et al.  Accuracy of side‐chain prediction upon near‐native protein backbones generated by ab initio folding methods , 1998, Proteins.

[38]  Conrad C. Huang,et al.  The MIDAS display system , 1988 .

[39]  K. Dill,et al.  A simple protein folding algorithm using a binary code and secondary structure constraints. , 1995, Protein engineering.

[40]  R. Samudrala,et al.  An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. , 1998, Journal of molecular biology.