Improved greedy algorithm for protein structure reconstruction

This article concerns the development of an improved greedy algorithm for protein structure reconstruction. Our stochastic greedy algorithm, which attempts to locate the ground state of an approximate energy function, exploits the fact that protein structures consist of overlapping structural building blocks that are not independent. Application of this approach to a series of 16 proteins with 50–250 amino acids leads to predicted models deviating from the experimental structures by 0.5 Å RMSD using an RMSD‐based energy function and within 1.5 to 4.8 Å RMSD using a Go‐based energy function. The Go‐based results are significant because they illustrate the strength of combining structural fragments and stochastic greedy algorithms in capturing the native structures of proteins stabilized by long‐range interactions separated by more than 30 amino acids. These results clearly open the door to less computationally demanding solutions to predict structures from sequences. © 2005 Wiley Periodicals, Inc. J Comput Chem 26: 506–513, 2005

[1]  U. Hansmann Parallel tempering algorithm for conformational studies of biological molecules , 1997, physics/9710041.

[2]  J. Onuchic,et al.  Prediction of folding mechanism for circular-permuted proteins. , 2001, Journal of molecular biology.

[3]  I D Kuntz,et al.  Peter Andrew Kollman , 2001, Proteins.

[4]  A Maritan,et al.  Recurrent oligomers in proteins: An optimal scheme reconciling accurate and concise backbone representations in automated folding and design studies , 2000, Proteins.

[5]  R. Abagyan,et al.  Biased probability Monte Carlo conformational searches and electrostatic calculations for peptides and proteins. , 1994, Journal of molecular biology.

[6]  D. Baker,et al.  Prospects for ab initio protein structural genomics. , 2001, Journal of molecular biology.

[7]  J. Gunn Sampling protein conformations using segment libraries and a genetic algorithm , 1997 .

[8]  Songde Ma,et al.  Protein folding simulations of the hydrophobic–hydrophilic model by combining tabu search with genetic algorithms , 2003 .

[9]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[10]  P. Argos,et al.  Knowledge‐based protein secondary structure assignment , 1995, Proteins.

[11]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[12]  K Yue,et al.  Constraint-based assembly of tertiary protein structures from secondary structure elements. , 2000, Protein science : a publication of the Protein Society.

[13]  A C Camproux,et al.  A hidden markov model derived structural alphabet for proteins. , 2004, Journal of molecular biology.

[14]  M. Zuker,et al.  The alignment of protein structures in three dimensions. , 1989, Bulletin of mathematical biology.

[15]  Eric J. Sorin,et al.  β-hairpin folding simulations in atomistic detail using an implicit solvent model1 , 2001 .

[16]  Yuko Okamoto,et al.  Prediction of peptide conformation by multicanonical algorithm: New approach to the multiple‐minima problem , 1993, J. Comput. Chem..

[17]  H. Scheraga,et al.  Calculation of protein conformation by the build-up procedure. Application to bovine pancreatic trypsin inhibitor using limited simulated nuclear magnetic resonance data. , 1988, Journal of biomolecular structure & dynamics.

[18]  N. Go,et al.  Studies on protein folding, unfolding, and fluctuations by computer simulation. II. A. Three‐dimensional lattice model of lysozyme , 1978 .

[19]  M. Levitt,et al.  Small libraries of protein fragments model native protein structures accurately. , 2002, Journal of molecular biology.

[20]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[21]  Christopher Bystroff,et al.  Fully automated ab initio protein structure prediction using I-STES, HMMSTR and ROSETTA , 2002, ISMB.

[22]  M. Levitt,et al.  The complexity and accuracy of discrete state models of protein structure. , 1995, Journal of molecular biology.

[23]  H. Scheraga,et al.  Use of buildup and energy‐minimization procedures to compute low‐energy structures of the backbone of enkephalin , 1985, Biopolymers.

[24]  Normand Mousseau,et al.  Sampling the complex energy landscape of a simple β-hairpin , 2003 .