Searching for the ideal forms of proteins.

A modification of the Structure Alignment Program (SAP), combined with a novel automatic method for the definition of structural elements, correctly identified the core folds of a variety of small beta/alpha proteins when compared with a series of ideal architectures. This approach opens the possibility of not just determining whether one structure is like another, but given a range of ideal forms, determining what the protein is. Preliminary studies have shown it to work equally well on the all alpha-class and the all-beta class of protein, each of which have corresponding ideal forms. Given the speed of the algorithm, it will be possible to compare all of these against the Protein Structure Database and determine the extent to which the current ideal forms can account for the variety of protein structure. Analysis of the remainder should provide a base for the development of further forms.

[1]  William R. Taylor,et al.  Analysis and prediction of protein β-sheet structures by a combinatorial approach , 1980, Nature.

[2]  Alexey G. Murzin,et al.  General architecture of the α-helical globule , 1988 .

[3]  Chris Sander,et al.  Dali/FSSP classification of three-dimensional protein folds , 1997, Nucleic Acids Res..

[4]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[5]  Robert Sedgewick,et al.  Algorithms in C , 1990 .

[6]  W R Taylor,et al.  Protein structure alignment. , 1989, Journal of molecular biology.

[7]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[8]  David T. Jones,et al.  Protein superfamilles and domain superfolds , 1994, Nature.

[9]  William R. Taylor,et al.  Analysis of the tertiary structure of protein β-sheet sandwiches , 1981 .

[10]  D. T. Jones,et al.  A method for alpha-helical integral membrane protein fold prediction. , 1994, Proteins.

[11]  O. Ptitsyn,et al.  Why do globular proteins fit the limited set of folding patterns? , 1987, Progress in biophysics and molecular biology.

[12]  J M Thornton,et al.  Using the CATH domain database to assign structures and functions to the genome sequences. , 2000, Biochemical Society transactions.

[13]  W R Taylor,et al.  Protein structural domain identification. , 1999, Protein engineering.

[14]  Cyrus Chothia,et al.  Structural principles of α/β barrel proteins: The packing of the interior of the sheet , 1989 .

[15]  C. Chothia,et al.  Protein architecture: New superfamilies , 1992, Current Biology.

[16]  W R Taylor,et al.  Protein fold refinement: building models from idealized folds using motif constraints and multiple sequence data. , 1993, Protein engineering.

[17]  W R Taylor,et al.  A model recognition approach to the prediction of all-helical membrane protein structure and topology. , 1994, Biochemistry.

[18]  William R. Taylor,et al.  An ellipsoidal approximation of protein shape , 1983 .

[19]  Alexei V. Finkelstein,et al.  A search for the most stable folds of protein chains , 1991, Nature.