Parallel and Antiparallel β-Strands Differ in Amino Acid Composition and Availability of Short Constituent Sequences

One of the important secondary structures in proteins is the β-strand. However, due to its complexity, it is less characterized than helical structures. Using the 1641 representative three-dimensional protein structure data from the Protein Data Bank, we characterized β-strand structures based on strand length and amino acid composition, focusing on differences between parallel and antiparallel β-strands. Antiparallel strands were more frequent and slightly longer than parallel strands. Overall, the majority of β-sheets were antiparallel sheets; however, mixed sheets were reasonably abundant, and parallel sheets were relatively rare. Notably, the nonpolar, aliphatic hydrocarbon amino acids, valine, isoleucine, and leucine were observed at a high frequency in both strands but were more abundant in parallel than in antiparallel strands. The relative amino acid occurrence in β-sheets, especially in parallel strands, was highly correlated with amino acid hydrophobicity. This correlation was not observed in α-helices and 3(10)-helices. In addition, we examined the frequency of 400 amino acid doublets and 8000 amino acid triplets in β-strands based on availability, a measurement of the relative counts of the doublets and triplets. We identified some triplets that were specifically found in either parallel or antiparallel strands. We further identified "zero-count triplets" which did not occur in either parallel or antiparallel strands, despite the fact that they were probabilistically supposed to occur several times. Taken together, the present study revealed essential features of β-strand structures and the differences between parallel and antiparallel β-strands, which can potentially be applied to the secondary structure prediction and the functional design of protein sequences in the future.

[1]  W. R. Krigbaum,et al.  Prediction of the amount of secondary structure in a globular protein from its aminoacid composition. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[2]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[3]  S. Cutler,et al.  C-terminal motif prediction in eukaryotic proteomes using comparative genomics and statistical over-representation across protein families , 2007, BMC Genomics.

[4]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[5]  T. Creamer,et al.  Side‐chain entropy effects on protein secondary structure formation , 2005, Proteins.

[6]  J. Nowick Exploring beta-sheet structure and interactions with chemical model systems. , 2008, Accounts of chemical research.

[7]  H. Scheraga,et al.  Proline‐induced constraints in α‐helices , 1987, Biopolymers.

[8]  Tomonori Gotoh,et al.  Availability of short amino acid sequences in proteins , 2005, Protein science : a publication of the Protein Society.

[9]  J. Thornton,et al.  Influence of proline residues on protein conformation. , 1991, Journal of molecular biology.

[10]  Hideo Matsuda,et al.  PDB-REPRDB: a database of representative protein chains from the Protein Data Bank (PDB) , 2001, Nucleic Acids Res..

[11]  J. Thornton,et al.  Prediction of strand pairing in antiparallel and parallel β‐sheets using information theory , 2002, Proteins.

[12]  Renate Kania,et al.  Storing and Annotating of Kinetic Data , 2007, Silico Biol..

[13]  Tao Zhang,et al.  Prediction of the parallel/antiparallel orientation of beta-strands using amino acid pairing preferences and support vector machines. , 2010, Journal of theoretical biology.

[14]  Tomonori Gotoh,et al.  Potential implications of availability of short amino acid sequences in proteins: an old and new approach to protein decoding and design. , 2008, Biotechnology annual review.

[15]  G Klebe,et al.  Cooperative effects in hydrogen‐bonding of protein secondary structure elements: A systematic analysis of crystal data using Secbase , 2005, Proteins.

[16]  P. S. Kim,et al.  Context-dependent secondary structure formation of a designed protein sequence , 1996, Nature.

[17]  B. Honig,et al.  Free energy determinants of secondary structure formation: II. Antiparallel beta-sheets. , 1995, Journal of molecular biology.

[18]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[19]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[20]  R. Williams,et al.  Secondary structure predictions and medium range interactions. , 1987, Biochimica et biophysica acta.

[21]  L. Pauling,et al.  The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Tamir Tuller,et al.  Forbidden penta‐peptides , 2007, Protein science : a publication of the Protein Society.

[23]  Shneior Lifson,et al.  Antiparallel and parallel β-strands differ in amino acid residue preferences , 1979, Nature.

[24]  J. Thornton,et al.  PROMOTIF—A program to identify and analyze structural motifs in proteins , 1996, Protein science : a publication of the Protein Society.

[25]  Jishou Ruan,et al.  The interstrand amino acid pairs play a significant role in determining the parallel or antiparallel orientation of beta-strands. , 2009, Biochemical and biophysical research communications.

[26]  M J Sternberg,et al.  On the conformation of proteins: hydrophobic ordering of strands in beta-pleated sheets. , 1977, Journal of molecular biology.

[27]  P. S. Kim,et al.  Context is a major determinant of β-sheet propensity , 1994, Nature.

[28]  V. Lim Algorithms for prediction of α-helical and β-structural regions in globular proteins , 1974 .

[29]  Tomonori Gotoh,et al.  Secondary Structure Characterization Based on Amino Acid Composition and Availability in Proteins , 2010, J. Chem. Inf. Model..

[30]  P. S. Kim,et al.  Measurement of the β-sheet-forming propensities of amino acids , 1994, Nature.

[31]  B. Persson,et al.  Characterization of oligopeptide patterns in large protein sets , 2007, BMC Genomics.

[32]  L. Pauling,et al.  Configurations of Polypeptide Chains With Favored Orientations Around Single Bonds: Two New Pleated Sheets. , 1951, Proceedings of the National Academy of Sciences of the United States of America.