Charged residues next to transmembrane regions revisited: “Positive-inside rule” is complemented by the “negative inside depletion/outside enrichment rule”

BackgroundTransmembrane helices (TMHs) frequently occur amongst protein architectures as means for proteins to attach to or embed into biological membranes. Physical constraints such as the membrane’s hydrophobicity and electrostatic potential apply uniform requirements to TMHs and their flanking regions; consequently, they are mirrored in their sequence patterns (in addition to TMHs being a span of generally hydrophobic residues) on top of variations enforced by the specific protein’s biological functions.ResultsWith statistics derived from a large body of protein sequences, we demonstrate that, in addition to the positive charge preference at the cytoplasmic inside (positive-inside rule), negatively charged residues preferentially occur or are even enriched at the non-cytoplasmic flank or, at least, they are suppressed at the cytoplasmic flank (negative-not-inside/negative-outside (NNI/NO) rule). As negative residues are generally rare within or near TMHs, the statistical significance is sensitive with regard to details of TMH alignment and residue frequency normalisation and also to dataset size; therefore, this trend was obscured in previous work. We observe variations amongst taxa as well as for organelles along the secretory pathway. The effect is most pronounced for TMHs from single-pass transmembrane (bitopic) proteins compared to those with multiple TMHs (polytopic proteins) and especially for the class of simple TMHs that evolved for the sole role as membrane anchors.ConclusionsThe charged-residue flank bias is only one of the TMH sequence features with a role in the anchorage mechanisms, others apparently being the leucine intra-helix propensity skew towards the cytoplasmic side, tryptophan flanking as well as the cysteine and tyrosine inside preference. These observations will stimulate new prediction methods for TMHs and protein topology from a sequence as well as new engineering designs for artificial membrane proteins.

[1]  Hwee Kuan Lee,et al.  How bioinformatics influences health informatics: usage of biomolecular sequences, expression profiles and automated microscopic image analyses for clinical needs and public health , 2013, Health Information Science and Systems.

[2]  P Argos,et al.  Are knowledge‐based potentials derived from protein structure sets discriminative with respect to amino acid types? , 1998, Proteins.

[3]  Peter B. McGarvey,et al.  UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches , 2014, Bioinform..

[4]  Masao Sakaguchi,et al.  Function of Positive Charges Following Signal-Anchor Sequences during Translocation of the N-terminal Domain* , 2006, Journal of Biological Chemistry.

[5]  C. Tanford,et al.  The solubility of amino acids and two glycine peptides in aqueous ethanol and dioxane solutions. Establishment of a hydrophobicity scale. , 1971, The Journal of biological chemistry.

[6]  Frank Eisenhaber,et al.  A Decade after the First Full Human genome sequencing: when will We Understand our Own genome? , 2012, J. Bioinform. Comput. Biol..

[7]  R. Coleman,et al.  Lipid topogenesis. , 1981, Journal of lipid research.

[8]  David T. Jones,et al.  Improving the accuracy of transmembrane protein topology prediction using evolutionary information , 2007, Bioinform..

[9]  J. Killian,et al.  How proteins adapt to a membrane-water interface. , 2000, Trends in biochemical sciences.

[10]  Lukas Käll,et al.  Reliability of transmembrane predictions in whole‐genome data , 2002, FEBS letters.

[11]  G. von Heijne,et al.  Orientational preferences of neighboring helices can drive ER insertion of a marginally hydrophobic transmembrane helix. , 2012, Molecular cell.

[12]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[13]  R. R. Bahadur Some Limit Theorems in Statistics , 1987 .

[14]  G. von Heijne,et al.  Membrane protein structure: prediction versus reality. , 2007, Annual review of biochemistry.

[15]  D. Peake,et al.  Inhibition of sphingomyelin synthase (SMS) affects intracellular sphingomyelin accumulation and plasma membrane lipid organization. , 2007, Biochimica et biophysica acta.

[16]  Birgit Eisenhaber,et al.  On filtering false positive transmembrane protein predictions. , 2002, Protein engineering.

[17]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[18]  G. Meer,et al.  Membrane lipids: where they are and how they behave , 2008, Nature Reviews Molecular Cell Biology.

[19]  S. Grinstein,et al.  Determinants of the pH of the Golgi Complex* , 2000, The Journal of Biological Chemistry.

[20]  G. Heijne,et al.  Genome‐wide analysis of integral membrane proteins from eubacterial, archaean, and eukaryotic organisms , 1998, Protein science : a publication of the Protein Society.

[21]  Igor N. Berezovsky,et al.  The Recipe for Protein Sequence-Based Function Prediction and Its Implementation in the ANNOTATOR Software Environment. , 2016, Methods in molecular biology.

[22]  A. Palmer,et al.  Measuring steady-state and dynamic endoplasmic reticulum and Golgi Zn2+ with genetically encoded sensors , 2011, Proceedings of the National Academy of Sciences.

[23]  G. Heijne,et al.  Recognition of transmembrane helices by the endoplasmic reticulum translocon , 2005, Nature.

[24]  K Nishikawa,et al.  The amino acid composition is different between the cytoplasmic and extracellular sides in membrane proteins , 1992, FEBS letters.

[25]  G. Heijne Membrane-protein topology , 2006, Nature Reviews Molecular Cell Biology.

[26]  Gunnar von Heijne,et al.  Trans‐membrane Translocation of Proteins , 1979 .

[27]  A. Krogh,et al.  A combined transmembrane topology and signal peptide prediction method. , 2004, Journal of molecular biology.

[28]  G. Vonheijne,et al.  Control of topology and mode of assembly of a polytopic membrane protein by positively charged residues , 1989, Nature.

[29]  G. von Heijne,et al.  Quantitative analysis of SecYEG-mediated insertion of transmembrane α-helices into the bacterial inner membrane. , 2013, Journal of Molecular Biology.

[30]  D. Tipper,et al.  Transmembrane Protein Insertion Orientation in Yeast Depends on the Charge Difference across Transmembrane Segments, Their Total Hydrophobicity, and Its Distribution* , 1998, The Journal of Biological Chemistry.

[31]  G. von Heijne,et al.  The aromatic residues Trp and Phe have different effects on the positioning of a transmembrane helix in the microsomal membrane. , 1999, Biochemistry.

[32]  M. Inouye,et al.  Reversible topology of a bifunctional transmembrane protein depends upon the charge balance around its transmembrane domain , 1994, Molecular microbiology.

[33]  Sebastian Maurer-Stroh,et al.  Not all transmembrane helices are born equal: Towards the extension of the sequence homology concept to membrane proteins , 2011, Biology Direct.

[34]  R. Schülein,et al.  A Single Negatively Charged Residue Affects the Orientation of a Membrane Protein in the Inner Membrane of Escherichia coliOnly When It Is Located Adjacent to a Transmembrane Domain* , 1999, The Journal of Biological Chemistry.

[35]  S H White,et al.  MPtopo: A database of membrane protein topology , 2001, Protein science : a publication of the Protein Society.

[36]  P M Cullis,et al.  Affinities of amino acid side chains for solvent water. , 1981, Biochemistry.

[37]  Sebastian Maurer-Stroh,et al.  More Than 1,001 Problems with Protein Domain Databases: Transmembrane Regions, Signal Peptides and the Issue of Sequence Homology , 2010, PLoS Comput. Biol..

[38]  Erik L. L. Sonnhammer,et al.  Advantages of combined transmembrane topology and signal peptide prediction—the Phobius web server , 2007, Nucleic Acids Res..

[39]  C. Chothia The nature of the accessible and buried surfaces in proteins. , 1976, Journal of molecular biology.

[40]  G. von Heijne,et al.  Membrane insertion of marginally hydrophobic transmembrane helices depends on sequence context. , 2010, Journal of molecular biology.

[41]  Sebastian Maurer-Stroh,et al.  HPMV: Human protein mutation viewer - relating sequence mutations to protein sequence architecture and function changes , 2015, J. Bioinform. Comput. Biol..

[42]  Gunnar von Heijne,et al.  Fine-tuning the topology of a polytopic membrane protein: Role of positively and negatively charged amino acids , 1990, Cell.

[43]  A. Zachowski Phospholipids in animal eukaryotic membranes: transverse asymmetry and movement. , 1993, The Biochemical journal.

[44]  Arne Elofsson,et al.  A study of the membrane-water interface region of membrane proteins. , 2005, Journal of molecular biology.

[45]  G von Heijne,et al.  Net N-C charge imbalance may be important for signal sequence function in bacteria. , 1986, Journal of molecular biology.

[46]  S. Munro,et al.  A Comprehensive Comparison of Transmembrane Domains Reveals Organelle-Specific Properties , 2010, Cell.

[47]  Martin Hermansson,et al.  Both Sphingomyelin Synthases SMS1 and SMS2 Are Required for Sphingomyelin Homeostasis and Growth in Human HeLa Cells* , 2007, Journal of Biological Chemistry.

[48]  M. Bogdanov,et al.  Lipids and topological rules governing membrane protein assembly. , 2014, Biochimica et biophysica acta.

[49]  Gunnar von Heijne,et al.  A biphasic pulling force acts on transmembrane helices during translocon-mediated membrane integration , 2012, Nature Structural &Molecular Biology.

[50]  P. Oger,et al.  Adaptation of the membrane in Archaea. , 2013, Biophysical chemistry.

[51]  M. Sato,et al.  Testing the Charge Difference Hypothesis for the Assembly of a Eucaryotic Multispanning Membrane Protein* , 1998, The Journal of Biological Chemistry.

[52]  G. von Heijne,et al.  Charge-driven dynamics of nascent chain movement through the SecYEG translocon , 2014, Nature Structural &Molecular Biology.

[53]  G. von Heijne,et al.  Topogenic signals in integral membrane proteins. , 1988, European journal of biochemistry.

[54]  Sebastian Maurer-Stroh,et al.  Transmembrane helix: simple or complex , 2012, Nucleic Acids Res..

[55]  S. White,et al.  Membrane protein folding and stability: physical principles. , 1999, Annual review of biophysics and biomolecular structure.

[56]  G. Heijne,et al.  Molecular code for transmembrane-helix recognition by the Sec61 translocon , 2007, Nature.

[57]  J. Beltzer,et al.  Charged residues are major determinants of the transmembrane orientation of a signal-anchor sequence. , 1991, The Journal of biological chemistry.

[58]  Gunnar von Heijne,et al.  How translocons select transmembrane helices. , 2008, Annual review of biophysics.

[59]  G. von Heijne,et al.  Predicting the topology of eukaryotic membrane proteins. , 1993, European journal of biochemistry.

[60]  G. von Heijne,et al.  Position-specific Asp-Lys pairing can affect signal sequence function and membrane protein topology. , 1993, The Journal of biological chemistry.

[61]  P. Devaux,et al.  Transmembrane Asymmetry and Lateral Domains in Biological Membranes , 2004, Traffic.

[62]  R. Dalbey,et al.  The Proton Motive Force, Acting on Acidic Residues, Promotes Translocation of Amino-terminal Domains of Membrane Proteins When the Hydrophobicity of the Translocation Signal Is Low* , 1998, The Journal of Biological Chemistry.

[63]  H. Riezman,et al.  The ins and outs of sphingolipid synthesis. , 2005, Trends in cell biology.

[64]  Pietro De Camilli,et al.  Phosphoinositides in cell regulation and membrane dynamics , 2006, Nature.

[65]  R. Hegde,et al.  Protein Targeting and Degradation are Coupled for Elimination of Mislocalized Proteins , 2011, Nature.

[66]  Gunnar von Heijne,et al.  Mechanisms of integral membrane protein insertion and folding. , 2015, Journal of molecular biology.

[67]  Birgit Eisenhaber,et al.  TM or not TM: transmembrane protein prediction with low false positive rate using DAS-TMfilter , 2004, Bioinform..

[68]  G. von Heijne,et al.  Positional editing of transmembrane domains during ion channel assembly , 2013, Journal of Cell Science.

[69]  D. Eisenberg,et al.  Analysis of membrane and surface protein sequences with the hydrophobic moment plot. , 1984, Journal of molecular biology.

[70]  G. von Heijne,et al.  Asn‐ and Asp‐mediated interactions between transmembrane helices during translocon‐mediated membrane protein assembly , 2006, EMBO reports.

[71]  Marc A. Marti-Renom,et al.  Structure-based statistical analysis of transmembrane helices , 2013, European Biophysics Journal.

[72]  A. Lomize,et al.  Structural adaptations of proteins to different biological membranes. , 2013, Biochimica et biophysica acta.

[73]  Johan Nilsson,et al.  Comparative analysis of amino acid distributions in integral membrane proteins from 107 genomes , 2005, Proteins.

[74]  Gang Zhao,et al.  An amino acid “transmembrane tendency” scale that approaches the theoretical limit to accuracy for prediction of transmembrane helices: Relationship to biological hydrophobicity , 2006, Protein science : a publication of the Protein Society.

[75]  G von Heijne,et al.  The ‘positive‐inside rule’ applies to thylakoid membrane proteins , 1991, FEBS letters.

[76]  M. Sansom,et al.  Amino acid distributions in integral membrane protein structures. , 2001, Biochimica et biophysica acta.

[77]  G von Heijne,et al.  Trans-membrane translocation of proteins. The direct transfer model. , 1979, European journal of biochemistry.

[78]  T A Rapoport,et al.  Predicting the orientation of eukaryotic membrane-spanning proteins. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[79]  Tony Yeung,et al.  Membrane Phosphatidylserine Regulates Surface Charge and Protein Localization , 2008, Science.

[80]  J. Janin,et al.  Surface and inside volumes in globular proteins , 1979, Nature.

[81]  Fernanda L. Sirota,et al.  Beware of Moving targets: Reference proteome Content Fluctuates substantially over the Years , 2012, J. Bioinform. Comput. Biol..

[82]  D. Daleke Phospholipid Flippases* , 2007, Journal of Biological Chemistry.

[83]  I. Dukes,et al.  Endoplasmic reticulum calcium store regulates membrane potential in mouse islet beta-cells. , 1994, The Journal of biological chemistry.

[84]  R. R. Bahadur Rates of Convergence of Estimates and Test Statistics , 1967 .

[85]  G. von Heijne,et al.  Different positively charged amino acids have similar effects on the topology of a polytopic transmembrane protein in Escherichia coli. , 1992, The Journal of biological chemistry.

[86]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[87]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[88]  Erik L. L. Sonnhammer,et al.  A Hidden Markov Model for Predicting Transmembrane Helices in Protein Sequences , 1998, ISMB.

[89]  István Reményi,et al.  Expediting topology data gathering for the TOPDB database , 2014, Nucleic Acids Res..

[90]  Thijs Beuming,et al.  A knowledge-based scale for the analysis and prediction of buried and exposed faces of transmembrane domain proteins , 2004, Bioinform..

[91]  Ying Gao,et al.  Bioinformatics Applications Note Sequence Analysis Cd-hit Suite: a Web Server for Clustering and Comparing Biological Sequences , 2022 .