Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features

For a successful analysis of the relation between amino acid sequence and protein structure, an unambiguous and physically meaningful definition of secondary structure is essential. We have developed a set of simple and physically motivated criteria for secondary structure, programmed as a pattern‐recognition process of hydrogen‐bonded and geometrical features extracted from x‐ray coordinates. Cooperative secondary structure is recognized as repeats of the elementary hydrogen‐bonding patterns “turn” and “bridge.” Repeating turns are “helices,” repeating bridges are “ladders,” connected ladders are “sheets.” Geometric structure is defined in terms of the concepts torsion and curvature of differential geometry. Local chain “chirality” is the torsional handedness of four consecutive Cα positions and is positive for right‐handed helices and negative for ideal twisted β‐sheets. Curved pieces are defined as “bends.” Solvent “exposure” is given as the number of water molecules in possible contact with a residue. The end result is a compilation of the primary structure, including SS bonds, secondary structure, and solvent exposure of 62 different globular proteins. The presentation is in linear form: strip graphs for an overall view and strip tables for the details of each of 10.925 residues. The dictionary is also available in computer‐readable form for protein structure prediction work.

[1]  L. Pauling,et al.  Configurations of Polypeptide Chains With Favored Orientations Around Single Bonds: Two New Pleated Sheets. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[2]  P. Flory,et al.  Conformational energy estimates for statistically coiling polypeptide chains , 1967 .

[3]  M. Levitt,et al.  Refinement of protein conformations using a macromolecular energy minimization procedure. , 1969, Journal of molecular biology.

[4]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[5]  R. Diamond A real-space refinement procedure for proteins , 1971 .

[6]  A. Shrake,et al.  Environment and exposure to solvent of protein atoms. Lysozyme and insulin. , 1973, Journal of molecular biology.

[7]  H. Scheraga,et al.  Chain reversals in proteins. , 1973, Biochimica et biophysica acta.

[8]  G. Schulz,et al.  Three-dimensional structure of adenyl kinase , 1974, Nature.

[9]  J. Deisenhofer,et al.  Crystallographic refinement of the structure of bovine pancreatic trypsin inhibitor at l.5 Å resolution , 1975 .

[10]  C. Chothia Structural invariants in protein folding , 1975, Nature.

[11]  N. W. Isaacs,et al.  A method for fitting satisfactory models to sets of atomic positions in protein structure refinements , 1976 .

[12]  R. Dickerson,et al.  The structure of Paracoccus denitrificans cytochrome c550. , 1976, The Journal of biological chemistry.

[13]  M. Levitt,et al.  Automatic identification of secondary structure in globular proteins. , 1977, Journal of molecular biology.

[14]  G. Rose,et al.  A new algorithm for finding the peptide chain turns in a globular protein. , 1977, Journal of molecular biology.

[15]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[16]  R. Stroud,et al.  Difference Fourier refinement of the structure of DIP‐trypsin at 1.5 Å with a minicomputer technique , 1977 .

[17]  George M. Church,et al.  A structure-factor least-squares refinement procedure for macromolecular structures using constrained and restrained parameters , 1977 .

[18]  P. Y. Chou,et al.  β-turns in proteins☆ , 1977 .

[19]  Michael Levitt,et al.  Refinement of Large Structures by Simultaneous Minimization of Energy and R Factor , 1978 .

[20]  S. Rackovsky,et al.  Differential Geometry and Polymer Conformation. 1. Comparison of Protein Conformations1a,b , 1978 .

[21]  H. Schenk,et al.  Computing in Crystallography , 1978 .

[22]  R. C. Agarwal A new least‐squares refinement technique based on the fast Fourier transform algorithm: erratum , 1978 .

[23]  K. Wüthrich,et al.  Nuclear magnetic resonance of labile protons in the basic pancreatic trypsin inhibitor. , 1979, Journal of molecular biology.

[24]  S. Lifson,et al.  Consistent force field studies of intermolecular forces in hydrogen-bonded crystals. 1. Carboxylic acids, amides, and the C:O.cntdot..cntdot..cntdot.H- hydrogen bonds , 1979 .

[25]  C. Sander,et al.  Specific recognition in the tertiary structure of β-sheets of proteins , 1980 .

[26]  K. Kopple,et al.  Reverse Turns in Peptides and Protein , 1980 .

[27]  R. Diamond,et al.  Computing in crystallography , 1980 .

[28]  A. Dunker,et al.  Determination of the secondary structure of proteins from the amide I band of the laser Raman spectrum. , 1981, Journal of molecular biology.

[29]  I. Wilson,et al.  Structure of the haemagglutinin membrane glycoprotein of influenza virus at 3 Å resolution , 1981, Nature.

[30]  Johnson Wc,et al.  Information content in the circular dichroism of proteins. , 1981 .

[31]  S. Provencher,et al.  Estimation of globular protein secondary structure from circular dichroism. , 1981, Biochemistry.

[32]  J. Richardson,et al.  The anatomy and taxonomy of protein structure. , 1981, Advances in protein chemistry.