Preference of Amino Acids in Different Protein Structural Classes: A Database Analysis

Understanding sequence-structure relationship is the key step in protein modeling and de novo protein design. Although almost 55,000 protein structures are solved and stored in protein data bank, elucidating sequence-structure relationship is still a challenging task. To understand sequence-structure relationship better, a statistical analysis of amino acid residues in four major structural classes of protein viz. &#945; proteins, &#946; proteins, &#945;/&#946; proteins and &#945;+&#946; proteins is performed. We use non-homologous proteins from (< 30% identity) October 2008 release Brookhaven Protein Data Bank (PDB) with resolution better than 2.5 angstrom. Interestingly, in comparison to the helical protein, the helical propensities of hydrophobic residues in mix proteins (containing both &#945; helix and &#946; sheet) are increased significantly. On the other hand, the helical propensities of hydrophilic residues are reduced in mixed proteins. A reverse trend is observed in strand propensity. The difference in helical propensity of hydrophobic and hydrophilic residues in different fold may be due to differential folding mechanism. The size of protein may also play a crucial role. A position specific analysis of helices is also done in all &#945; and &#945;/&#946; proteins. The detailed analysis of helix dissection revealed that, the presence of &#946; sheet influences the individual preference of amino acids in different positions within helix. This result indicates that the preference of amino acid in different positions (N terminus, C terminus and middle) within &#945; helix are influenced by long range interactions with other structural elements.

[1]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[2]  G. Petsko,et al.  Composition analysis of α-helices in thermophilic organisms , 1995 .

[3]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[4]  Tim J. P. Hubbard,et al.  SCOP database in 2002: refinements accommodate structural genomics , 2002, Nucleic Acids Res..

[5]  Giovanni Colonna,et al.  Amino acid propensities for secondary structures are influenced by the protein structural class. , 2006, Biochemical and biophysical research communications.

[6]  Yixin Chen,et al.  A fast parallel algorithm for finding the longest common sequence of multiple biosequences , 2006, BMC Bioinformatics.

[7]  J. Richardson,et al.  Amino acid preferences for specific locations at the ends of alpha helices. , 1988, Science.

[8]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[9]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[10]  Zhirong Sun,et al.  Folding type‐specific secondary structure propensities of amino acids, derived from α‐helical, β‐sheet, α/β, and α+β proteins of known structures , 1998 .

[11]  Sandeep Kumar,et al.  Dissecting α‐helices: Position‐specific analysis of α‐helices in globular proteins , 1998, Proteins.

[12]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[13]  Fei Gu,et al.  Improved Chou-Fasman method for protein secondary structure prediction , 2006, BMC Bioinformatics.

[14]  L Serrano,et al.  Position dependence of amino acid intrinsic helical propensities II: Non‐charged polar residues: Ser, Thr, Asn, and Gln , 1999, Protein science : a publication of the Protein Society.

[15]  T M Handel,et al.  Review: protein design--where we were, where we are, where we're going. , 2001, Journal of structural biology.

[16]  P. Argos,et al.  Protein structure prediction: recognition of primary, secondary, and tertiary structural features from amino acid sequence. , 1995, Critical reviews in biochemistry and molecular biology.

[17]  A. Doig Recent advances in helix-coil theory. , 2002, Biophysical chemistry.

[18]  J. Janin,et al.  Surface and inside volumes in globular proteins , 1979, Nature.

[19]  L. Serrano,et al.  Amino acid intrinsic α‐helical propensities III: Positional dependence at several positions of C terminus , 2002, Protein science : a publication of the Protein Society.

[20]  M. Levitt,et al.  Structure-based conformational preferences of amino acids. , 1999, Proceedings of the National Academy of Sciences of the United States of America.