An amino acid packing code for α-helical structure and protein design.

This work demonstrates that all packing in α-helices can be simplified to repetitive patterns of a single motif: the knob-socket. Using the precision of Voronoi Polyhedra/Delauney Tessellations to identify contacts, the knob-socket is a four-residue tetrahedral motif: a knob residue on one α-helix packs into the three-residue socket on another α-helix. The principle of the knob-socket model relates the packing between levels of protein structure: the intra-helical packing arrangements within secondary structure that permit inter-helix tertiary packing interactions. Within an α-helix, the three-residue sockets arrange residues into a uniform packing lattice. Inter-helix packing results from a definable pattern of interdigitated knob-socket motifs between two α-helices. Furthermore, the knob-socket model classifies three types of sockets: (1) free, favoring only intra-helical packing; (2) filled, favoring inter-helical interactions; and (3) non, disfavoring α-helical structure. The amino acid propensities in these three socket classes essentially represent an amino acid code for structure in α-helical packing. Using this code, we used a novel yet straightforward approach for the design of α-helical structure to validate the knob-socket model. Unique sequences for three peptides were created to produce a predicted amount of α-helical structure: mostly helical, some helical, and no helix. These three peptides were synthesized, and helical content was assessed using CD spectroscopy. The measured α-helicity of each peptide was consistent with the expected predictions. These results and analysis demonstrate that the knob-socket motif functions as the basic unit of packing and presents an intuitive tool to decipher the rules governing packing in protein structure.

[1]  Robert M. Stroud,et al.  A designed four helix bundle protein with native-like structure , 1997, Nature Structural Biology.

[2]  R L Jernigan,et al.  Coordination geometry of nonbonded residues in globular proteins. , 1996, Folding & design.

[3]  D N Woolfson,et al.  Amino acid pairing preferences in parallel beta-sheets in proteins. , 2006, Journal of molecular biology.

[4]  J. Fernández-Recio,et al.  Intrahelical side chain interactions in α‐helices: poor correlation between energetics and frequency , 1998, FEBS letters.

[5]  Alexey G. Murzin,et al.  General architecture of the α-helical globule , 1988 .

[6]  D. Baker,et al.  Contact order, transition state placement and the refolding rates of single domain proteins. , 1998, Journal of molecular biology.

[7]  Aleksey A. Porollo,et al.  Linear Regression Models for Solvent Accessibility Prediction in Proteins , 2005, J. Comput. Biol..

[8]  V. Lim Structural principles of the globular organization of protein chains. A stereochemical theory of globular protein secondary structure. , 1974, Journal of molecular biology.

[9]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[10]  David B. Dahl,et al.  Characterizing the regularity of tetrahedral packing motifs in protein tertiary structure , 2010, Bioinform..

[11]  Junwen Wang,et al.  Exploring the sequence patterns in the α‐helices of proteins , 2003 .

[12]  L. Pauling,et al.  The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Exceptional pairs of amino acid neighbors in α‐helices , 2003 .

[14]  A. Doig,et al.  Rotamer strain energy in protein helices - quantification of a major force opposing protein folding. , 2001, Journal of molecular biology.

[15]  Linus Pauling,et al.  The Structure of Proteins , 1939 .

[16]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[17]  Ilya A Vakser,et al.  Shorter side chains optimize helix–helix packing , 2004, Protein science : a publication of the Protein Society.

[18]  R. B. Merrifield Solid phase peptide synthesis. I. the synthesis of a tetrapeptide , 1963 .

[19]  A G Murzin,et al.  General architecture of the alpha-helical globule. , 1988, Journal of molecular biology.

[20]  Jie Liang,et al.  Higher-order interhelical spatial interactions in membrane proteins. , 2003, Journal of molecular biology.

[21]  Harshinder Singh,et al.  Probabilistic model for two dependent circular variables , 2002 .

[22]  Alessandro Senes,et al.  Folding of helical membrane proteins: the role of polar, GxxxG-like and proline motifs. , 2004, Current opinion in structural biology.

[23]  Puzzle pieces defined: locating common packing units in tertiary protein contacts. , 1996, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[24]  J. Thornton,et al.  Atomic environments of arginine side chains in proteins. , 1993, Protein engineering.

[25]  Andrzej Kloczkowski,et al.  The origin and extent of coarse‐grained regularities in protein internal packing , 2003, Proteins.

[26]  Liam J. McGuffin,et al.  Protein structure prediction servers at University College London , 2005, Nucleic Acids Res..

[27]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[28]  N L Harris,et al.  Four helix bundle diversity in globular proteins. , 1994, Journal of molecular biology.

[29]  Ricardo L. Mancera,et al.  Computational Methods for the Prediction of the Structure and Interactions of Coiled-Coil Peptides , 2008 .

[30]  H. Scheraga,et al.  Role of hydrophobicity and solvent-mediated charge-charge interactions in stabilizing alpha-helices. , 1998, Biophysical journal.

[31]  Min Lu,et al.  Core side-chain packing and backbone conformation in Lpp-56 coiled-coil mutants. , 2002, Journal of molecular biology.

[32]  G M Maggiora,et al.  Energetics of the structure of the four-alpha-helix bundle in proteins. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[33]  P Argos,et al.  Principles of helix-helix packing in proteins: the helical lattice superposition model. , 1996, Journal of molecular biology.

[34]  W. DeGrado,et al.  Amino acid propensities are position-dependent throughout the length of alpha-helices. , 2004, Journal of molecular biology.

[35]  Vladimir G Tumanyan,et al.  Analysis of forces that determine helix formation in alpha-proteins. , 2004, Protein science : a publication of the Protein Society.

[36]  Aleksey Porollo,et al.  PROTEINS: Structure, Function, and Bioinformatics 56:753–767 (2004) Accurate Prediction of Solvent Accessibility Using Neural Networks–Based Regression , 2022 .

[37]  F E Cohen,et al.  Helix‐helix packing angle preferences for finite helix axes , 1998, Proteins.

[38]  Patrice Koehl,et al.  The ASTRAL Compendium in 2004 , 2003, Nucleic Acids Res..

[39]  V. Lim Algorithms for prediction of alpha-helical and beta-structural regions in globular proteins. , 1974, Journal of molecular biology.

[40]  M. Bansal,et al.  HELANAL: A Program to Characterize Helix Geometry in Proteins , 2000, Journal of biomolecular structure & dynamics.

[41]  Mark A. Schmitz,et al.  Semirational design of Jun-Fos coiled coils with increased affinity: Universal implications for leucine zipper prediction and design. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Derek N Woolfson,et al.  Extended knobs-into-holes packing in classical and complex coiled-coil assemblies. , 2003, Journal of structural biology.

[43]  Acr Martin,et al.  Amino Acid Pairing Preferences in Parallel β-Sheets in Proteins , 2006 .

[44]  D. Parry,et al.  Fifty years of coiled-coils and alpha-helical bundles: a close relationship between sequence and structure. , 2008, Journal of structural biology.

[45]  C. Chothia,et al.  Helix to helix packing in proteins. , 1981, Journal of molecular biology.

[46]  F. Crick,et al.  Is α-Keratin a Coiled Coil? , 1952, Nature.

[47]  L. Dure A repeating 11-mer amino acid motif and plant desiccation. , 1993, The Plant journal : for cell and molecular biology.

[48]  R. Schulz,et al.  Protein Structure Prediction , 2020, Methods in Molecular Biology.

[49]  A. Efimov Packing of α-helices in globular proteins. Layer-structure of globin hydrophobic cores , 1979 .

[50]  V. Lim Algorithms for prediction of α-helical and β-structural regions in globular proteins , 1974 .

[51]  D. Woolfson,et al.  A periodic table of coiled-coil protein structures. , 2009, Journal of molecular biology.

[52]  James U. Bowie,et al.  Helix packing angle preferences , 1997, Nature Structural Biology.

[53]  N. C. Price,et al.  How to study proteins by circular dichroism. , 2005, Biochimica et biophysica acta.

[54]  A. Goede,et al.  Spare parts for helix-helix interaction. , 1999, Protein engineering.

[55]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[56]  G J Barton,et al.  Structural features can be unconserved in proteins with similar folds. An analysis of side-chain to side-chain contacts secondary structure and accessibility. , 1994, Journal of molecular biology.

[57]  Wei Wang,et al.  Identification of family-specific residue packing motifs and their use for structure-based protein function prediction: II. Case studies and applications , 2009, J. Comput. Aided Mol. Des..

[58]  Lu-Yong Wang,et al.  Covariation Analysis of Local Amino Acid Sequences in Recurrent Protein Local Structures , 2005, J. Bioinform. Comput. Biol..

[59]  P. Koehl,et al.  Helix‐sheet packing in proteins , 2010, Proteins.

[60]  P. Argos,et al.  Side-chain clusters in protein structures and their role in protein folding. , 1991, Journal of molecular biology.

[61]  Themis Lazaridis,et al.  Computational analysis of residue contributions to coiled‐coil topology , 2011, Protein science : a publication of the Protein Society.

[62]  F. Richards The interpretation of protein structures: total volume, group volume distributions and packing density. , 1974, Journal of molecular biology.

[63]  A. Tropsha,et al.  Four-body potentials reveal protein-specific correlations to stability changes caused by hydrophobic core mutations. , 2001, Journal of molecular biology.

[64]  Kristian Rother,et al.  Voronoia: analyzing packing in protein structures , 2008, Nucleic Acids Res..

[65]  David Eisenberg,et al.  GXXXG and GXXXA motifs stabilize FAD and NAD(P)-binding Rossmann folds through C(alpha)-H... O hydrogen bonds and van der waals interactions. , 2002, Journal of molecular biology.

[66]  Wei Liu,et al.  Comparison of helix interactions in membrane and soluble alpha-bundle proteins. , 2002, Biophysical journal.

[67]  L. Pauling,et al.  Compound Helical Configurations of Polypeptide Chains: Structure of Proteins of the α-Keratin Type , 1953, Nature.

[68]  Alexander Efimov Complementary packing of α‐helices in proteins , 1999 .

[69]  P. Harbury,et al.  Automated design of specificity in molecular recognition , 2003, Nature Structural Biology.

[70]  L. Pauling,et al.  The pleated sheet, a new layer configuration of polypeptide chains. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[71]  J. Sadoc Helices and helix packings derived from the {3, 3, 5} polytope , 2001 .

[72]  Markus Eilers,et al.  Comparison of Helix Interactions in Membrane and Soluble α-Bundle Proteins , 2002 .

[73]  N. Kurochkina Helix-helix interactions and their impact on protein motifs and assemblies. , 2010, Journal of theoretical biology.

[74]  Christopher M. Summa,et al.  D(n)-symmetrical tertiary templates for the design of tubular proteins. , 2001, Journal of molecular biology.

[75]  Coenraad Bron,et al.  Finding all cliques of an undirected graph , 1973 .

[76]  Derek N Woolfson,et al.  Preferred side-chain constellations at antiparallel coiled-coil interfaces , 2008, Proceedings of the National Academy of Sciences.

[77]  Nir Ben-Tal,et al.  Structural determinants of transmembrane helical proteins. , 2009, Structure.

[78]  Markus Gruber,et al.  Historical review: another 50th anniversary--new periodicities in coiled coils. , 2003, Trends in biochemical sciences.

[79]  A. Poupon Voronoi and Voronoi-related tessellations in studies of protein structure and interaction. , 2004, Current opinion in structural biology.

[80]  S. Wodak,et al.  Deviations from standard atomic volumes as a quality measure for protein crystal structures. , 1996, Journal of molecular biology.

[81]  David Eisenberg,et al.  GXXXG and AXXXA: common alpha-helical interaction motifs in proteins, particularly in extremophiles. , 2002, Biochemistry.

[82]  William F. DeGrado,et al.  Amino Acid Propensities are Position-dependent Throughout the Length of α-Helices , 2004 .

[83]  Vladimir G. Tumanyan,et al.  Analysis of forces that determine helix formation in α‐proteins , 2004 .

[84]  P. S. Kim,et al.  A switch between two-, three-, and four-stranded coiled coils in GCN4 leucine zipper mutants. , 1993, Science.

[85]  Energetic determinants of oligomeric state specificity in coiled coils. , 2007, Journal of the American Chemical Society.

[86]  Robert M Stroud,et al.  The channel architecture of aquaporin 0 at a 2.2-A resolution. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[87]  M. Gerstein,et al.  Statistical analysis of amino acid patterns in transmembrane helices: the GxxxG motif occurs frequently and in association with beta-branched residues at neighboring positions. , 2000, Journal of molecular biology.

[88]  Wei Wang,et al.  Mining protein family specific residue packing patterns from protein structure graphs , 2004, RECOMB.

[89]  S. Suh,et al.  Crystal structure of the ribosome recycling factor from Escherichia coli , 2000, The EMBO journal.

[90]  D. Baker,et al.  Prediction of local structure in proteins using a library of sequence-structure motifs. , 1998, Journal of molecular biology.

[91]  D. Frishman,et al.  Prediction of helix–helix contacts and interacting helices in polytopic membrane proteins using neural networks , 2009, Proteins.

[92]  G J Kleywegt,et al.  Recognition of spatial motifs in protein structures. , 1999, Journal of molecular biology.

[93]  Barry Honig,et al.  Helical packing patterns in membrane and soluble proteins. , 2004, Biophysical journal.

[94]  W. DeGrado,et al.  Helix-packing motifs in membrane proteins , 2006, Proceedings of the National Academy of Sciences.

[95]  Lenore Cowen,et al.  Recognition of beta-structural motifs using hidden Markov models trained with simulated evolution , 2010, Bioinform..

[96]  C. Chothia,et al.  The Packing Density in Proteins: Standard Radii and Volumes , 1999 .

[97]  David Eisenberg,et al.  GXXXG and AXXXA: Common α-Helical Interaction Motifs in Proteins, Particularly in Extremophiles† , 2002 .

[98]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[99]  Wen-Lian Hsu,et al.  Predicting helix–helix interactions from residue contacts in membrane proteins , 2009, Bioinform..

[100]  R. L. Baldwin,et al.  Helix stabilization by Glu-...Lys+ salt bridges in short peptides of de novo design. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[101]  Alexander Tropsha,et al.  Simplicial neighborhood analysis of protein packing (SNAPP): a computational geometry approach to studying proteins. , 2003, Methods in enzymology.

[102]  M. Schiffer,et al.  Use of helical wheels to represent the structures of proteins and to identify segments with helical potential. , 1967, Biophysical journal.

[103]  F. M. Richards,et al.  Calculation of molecular volumes and areas for structures of known geometry. , 1985, Methods in enzymology.

[104]  I. Jonassen,et al.  Discovery of local packing motifs in protein structures , 1999, Proteins.

[105]  Dieter Langosch,et al.  Interaction of transmembrane helices by a knobs‐into‐holes packing characteristic of soluble coiled coils , 1998, Proteins.

[106]  Georges Voronoi Nouvelles applications des paramètres continus à la théorie des formes quadratiques. Deuxième mémoire. Recherches sur les parallélloèdres primitifs. , 1908 .

[107]  M G Oakley,et al.  The design of antiparallel coiled coils. , 2001, Current opinion in structural biology.

[108]  J Walshaw,et al.  Socket: a program for identifying and analysing coiled-coil motifs within protein structures. , 2001, Journal of molecular biology.

[109]  Iosif I. Vaisman,et al.  Delaunay Tessellation of Proteins: Four Body Nearest-Neighbor Propensities of Amino Acid Residues , 1996, J. Comput. Biol..

[110]  Min Lu,et al.  Antiparallel Four-Stranded Coiled Coil Specified by a 3-3-1 Hydrophobic Heptad Repeat , 2006, Structure.

[111]  D Eisenberg,et al.  Crystal structure of a synthetic triple-stranded alpha-helical bundle. , 1993, Science.

[112]  Structure, stability and folding of the alpha-helix. , 2001, Biochemical Society symposium.

[113]  Aleksey A. Porollo,et al.  Combining prediction of secondary structure and solvent accessibility in proteins , 2005, Proteins.

[114]  Jerry Tsai,et al.  Characterizing conserved structural contacts by pair-wise relative contacts and relative packing groups. , 2005, Journal of molecular biology.

[115]  Arthur M Lesk,et al.  Contact patterns between helices and strands of sheet define protein folding patterns , 2007, Proteins.

[116]  C Geourjon,et al.  Protein structure prediction. Implications for the biologist. , 1997, Biochimie.

[117]  M Gerstein,et al.  Volume changes on protein folding. , 1994, Structure.

[118]  Min Lu,et al.  A seven-helix coiled coil , 2006, Proceedings of the National Academy of Sciences.

[119]  A. Lupas,et al.  Predicting coiled coils from protein sequences , 1991, Science.

[120]  Jin-An Feng,et al.  Exploring the sequence patterns in the alpha-helices of proteins. , 2003, Protein engineering.

[121]  Christian Cole,et al.  The Jpred 3 secondary structure prediction server , 2008, Nucleic Acids Res..

[122]  J. Snoeyink,et al.  Distance-based identification of structure motifs in proteins using constrained frequent subgraph mining. , 2006, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[123]  F. Crick,et al.  The packing of α‐helices: simple coiled‐coils , 1953 .

[124]  W. DeGrado,et al.  Modeling transmembrane helical oligomers. , 1997, Current opinion in structural biology.

[125]  S L Mayo,et al.  De novo protein design: towards fully automated sequence selection. , 1997, Journal of molecular biology.

[126]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[127]  D. Engelman,et al.  The GxxxG motif: a framework for transmembrane helix-helix association. , 2000, Journal of molecular biology.

[128]  Dmitrij Frishman,et al.  Phenylalanine promotes interaction of transmembrane domains via GxxxG motifs. , 2007, Journal of molecular biology.

[129]  J. Janin,et al.  Surface and inside volumes in globular proteins , 1979, Nature.

[130]  Owen J. L. Rackham,et al.  The evolution and structure prediction of coiled coils across all genomes , 2010 .

[131]  W. DeGrado,et al.  Solution structure and dynamics of a de novo designed three-helix bundle protein. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[132]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..