Statistical analysis of amino acid patterns in transmembrane helices: the GxxxG motif occurs frequently and in association with beta-branched residues at neighboring positions.

To find motifs that mediate helix-helix interactions in membrane proteins, we have analyzed frequently occurring combinations of residues in a database of transmembrane domains. Our analysis was performed with a novel formalism, which we call TMSTAT, for exactly calculating the expectancies of all pairs and triplets of residues in individual sequences, taking into account differential sequence composition and the substantial effect of finite length in short segments. We found that the number of significantly over and under-represented pairs and triplets was much greater than the random expectation. Isoleucine, glycine and valine were the most common residues in these extreme cases. The main theme observed is patterns of small residues (Gly, Ala and Ser) at i and i+4 found in association with large aliphatic residues (Ile, Val and Leu) at neighboring positions (i.e. i+/-1 and i+/-2). The most over-represented pair is formed by two glycine residues at i and i+4 (GxxxG, 31.6 % above expectation, p<1x10(-33)) and it is strongly associated with the neighboring beta-branched residues Ile and Val. In fact, the GxxxG pair has been described as part of the strong interaction motif in the glycophorin A transmembrane dimer, in which the pair is associated with two Val residues (GVxxGV). GxxxG is also the major motif identified using TOXCAT, an in vivo selection system for transmembrane oligomerization motifs. In conjunction with these experimental observations, our results highlight the importance of the GxxxG+beta-branched motif in transmembrane helix-helix interactions. In addition, the special role for the beta-branched residues Ile and Val suggested here is consistent with the hypothesis that residues with constrained rotameric freedom in helical conformation might reduce the entropic cost of folding in transmembrane proteins. Additional material is available at http://engelman.csb.yale. edu/tmstat and http://bioinfo.mbb.yale. edu/tmstat.

[1]  F M Richards,et al.  Packing of alpha-helices: geometrical constraints and contact areas. , 1978, Journal of molecular biology.

[2]  Frederic M. Richards,et al.  Packing of α-helices: Geometrical constraints and contact areas☆ , 1978 .

[3]  C. Chothia,et al.  Helix to helix packing in proteins. , 1981, Journal of molecular biology.

[4]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[5]  T. Steitz,et al.  Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins. , 1986, Annual review of biophysics and biophysical chemistry.

[6]  M. Sternberg,et al.  Analysis of the relationship between side-chain conformation and secondary structure in globular proteins. , 1987, Journal of molecular biology.

[7]  D. Engelman,et al.  Membrane protein folding and oligomerization: the two-stage model. , 1990, Biochemistry.

[8]  G. Heijne Membrane protein structure prediction. Hydrophobicity analysis and the positive-inside rule. , 1992, Journal of molecular biology.

[9]  C. Deber,et al.  Glycine and β‐branched residues support and modulate peptide helicity in membrane environments , 1992, FEBS letters.

[10]  G von Heijne,et al.  Membrane protein structure prediction. Hydrophobicity analysis and the positive-inside rule. , 1992, Journal of molecular biology.

[11]  D. Engelman,et al.  Sequence specificity in the dimerization of transmembrane alpha-helices. , 1992, Biochemistry.

[12]  C. Deber,et al.  Influence of glycine residues on peptide conformation in membrane environments. , 2009, International journal of peptide and protein research.

[13]  U. Hobohm,et al.  Selection of representative protein data sets , 1992, Protein science : a publication of the Protein Society.

[14]  T. Salakoski,et al.  Selection of a representative set of structures from brookhaven protein data bank , 1992, Proteins.

[15]  C. Deber,et al.  Non-random distribution of amino acids in the transmembrane segments of human type I single span membrane proteins. , 1993, Journal of molecular biology.

[16]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[17]  P. Argos,et al.  Rotamers: to be or not to be? An analysis of amino acid side-chain conformations in globular proteins. , 1993, Journal of molecular biology.

[18]  David C. Jones,et al.  A mutation data matrix for transmembrane proteins , 1994, FEBS letters.

[19]  U. Hobohm,et al.  Enlarged representative set of protein structures , 1994, Protein science : a publication of the Protein Society.

[20]  A T Brünger,et al.  A dimerization motif for transmembrane alpha-helices. , 1994, Nature structural biology.

[21]  R. L. Baldwin,et al.  Tests for helix‐stabilizing interactions between various nonpolar side chains in alanine‐based peptides , 1994, Protein science : a publication of the Protein Society.

[22]  D. Engelman,et al.  A dimerization motif for transmembrane α–helices , 1994, Nature Structural Biology.

[23]  Interactions between hydrophobic side chains within alpha-helices. , 1995, Protein science : a publication of the Protein Society.

[24]  George D. Rose,et al.  Interactions between hydrophobic side chains within α‐helices , 1995 .

[25]  J. Popot,et al.  On the distribution of amino acid residues in transmembrane alpha-helix bundles. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[26]  P. Argos,et al.  Intrahelical side chain-side chain contacts: the consequences of restricted rotameric states and implications for helix engineering and design. , 1996, Protein engineering.

[27]  P Argos,et al.  Principles of helix-helix packing in proteins: the helical lattice superposition model. , 1996, Journal of molecular biology.

[28]  J U Bowie,et al.  Helix packing in membrane proteins. , 1997, Journal of molecular biology.

[29]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[30]  James H. Prestegard,et al.  A Transmembrane Helix Dimer: Structure and Implications , 1997, Science.

[31]  C. Deber,et al.  Uncoupling Hydrophobicity and Helicity in Transmembrane Segments , 1998, The Journal of Biological Chemistry.

[32]  C. Chothia,et al.  Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[33]  A. Brunger,et al.  Statistical analysis of predicted transmembrane α-helices , 1998 .

[34]  A. Brunger,et al.  Statistical analysis of predicted transmembrane alpha-helices. , 1998, Biochimica et biophysica acta.

[35]  J. Beckwith,et al.  How many membrane proteins are there? , 1998, Protein science : a publication of the Protein Society.

[36]  D. Engelman,et al.  Structure-based prediction of the stability of transmembrane helix-helix interactions: the sequence dependence of glycophorin A dimerization. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[37]  M. Gerstein Patterns of protein‐fold usage in eight microbial genomes: A comprehensive structural census , 1998, Proteins.

[38]  D. Engelman,et al.  Detergents modulate dimerization, but not helicity, of the glycophorin A transmembrane domain. , 1999, Journal of molecular biology.

[39]  H Luecke,et al.  Structure of bacteriorhodopsin at 1.55 A resolution. , 1999, Journal of molecular biology.

[40]  A. Bairoch,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999 , 1999, Nucleic Acids Res..

[41]  J. Ballesteros,et al.  The role of a conserved proline residue in mediating conformational changes associated with voltage gating of Cx32 gap junctions. , 1999, Biophysical journal.

[42]  D. Engelman,et al.  The GxxxG motif: a framework for transmembrane helix-helix association. , 2000, Journal of molecular biology.

[43]  Structure of Bacteriorhodopsin at 100 K , 2000 .