Prediction of protein structure from ideal forms

For many years it has been accepted that the sequence of a protein can specify its three‐dimensional structure. However, there has been limited progress in explaining how the sequence dictates its fold and no attempt to do this computationally without the use of specific structural data has ever succeeded for any protein larger than 100 residues. We describe a method that can predict complex folds up to almost 200 residues using only basic principles that do not include any elements of sequence homology. The method does not simulate the folding chain but generates many thousands of models based on an idealized representation of structure. Each rough model is scored and the best are refined. On a set of five proteins, the correct fold score well and when tested on a set of larger proteins, the correct fold was ranked highest for some proteins more than 150 residues, with others being close topological variants. All other methods that approach this level of success rely on the use of templates or fragments of known structures. Our method is unique in using a database of ideal models based on general packing rules that, in spirit, is closer to an ab initio approach. Proteins 2008. © 2008 Wiley‐Liss, Inc.

[1]  W. Taylor Protein structure comparison using iterated double dynamic programming , 2008, Protein science : a publication of the Protein Society.

[2]  O. Ptitsyn,et al.  Why do globular proteins fit the limited set of folding patterns? , 1987, Progress in biophysics and molecular biology.

[3]  William R. Taylor,et al.  Analysis and prediction of the packing of α-helices against a β-sheet in the tertiary structure of globular proteins , 1982 .

[4]  P. Kollman,et al.  Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. , 1998, Science.

[5]  W. Taylor,et al.  Multiple sequence threading: an analysis of alignment quality and stability. , 1997, Journal of molecular biology.

[6]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[7]  A V Finkelstein,et al.  The classification and origins of protein folding patterns. , 1990, Annual review of biochemistry.

[8]  Jeffrey Skolnick,et al.  All-atom ab initio folding of a diverse set of proteins. , 2006, Structure.

[9]  William R. Taylor,et al.  Analysis and prediction of protein β-sheet structures by a combinatorial approach , 1980, Nature.

[10]  Kuang Lin,et al.  A simple and fast secondary structure prediction method using hidden neural networks , 2005, Bioinform..

[11]  William R Taylor,et al.  Dynamic domain threading , 2006, Proteins.

[12]  William R. Taylor,et al.  A deeply knotted protein structure and how it might fold , 2000, Nature.

[13]  Yang Zhang,et al.  The protein structure prediction problem could be solved using the current PDB library. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[14]  William R Taylor,et al.  Using scores derived from statistical coupling analysis to distinguish correct and incorrect folds in de‐novo protein structure prediction , 2008, Proteins.

[15]  William R. Taylor,et al.  A ‘periodic table’ for protein structures , 2002, Nature.

[16]  William R. Taylor,et al.  Protein model refinement using structural fragment tessellation , 2006, Comput. Biol. Chem..

[17]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[18]  William R Taylor,et al.  A structural pattern‐based method for protein fold recognition , 2004, Proteins.

[19]  W R Taylor,et al.  Defining linear segments in protein structure. , 2001, Journal of molecular biology.

[20]  W R Taylor,et al.  Protein fold refinement: building models from idealized folds using motif constraints and multiple sequence data. , 1993, Protein engineering.

[21]  F E Cohen,et al.  Protein folding: evaluation of some simple rules for the assembly of helices into tertiary structures with myoglobin as an example. , 1979, Journal of molecular biology.

[22]  C. Orengo,et al.  Protein families and their evolution-a structural perspective. , 2005, Annual review of biochemistry.