Cellular crowding imposes global constraints on the chemistry and evolution of proteomes

In living cells, functional protein–protein interactions compete with a much larger number of nonfunctional, or promiscuous, interactions. Several cellular properties contribute to avoiding unwanted protein interactions, including regulation of gene expression, cellular compartmentalization, and high specificity and affinity of functional interactions. Here we investigate whether other mechanisms exist that shape the sequence and structure of proteins to favor their correct assembly into functional protein complexes. To examine this question, we project evolutionary and cellular abundance information onto 397, 196, and 631 proteins of known 3D structure from Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens, respectively. On the basis of amino acid frequencies in interface patches versus the solvent-accessible protein surface, we define a propensity or “stickiness” scale for each of the 20 amino acids. We find that the propensity to interact in a nonspecific manner is inversely correlated with abundance. In other words, high abundance proteins have less sticky surfaces. We also find that stickiness constrains protein evolution, whereby residues in sticky surface patches are more conserved than those found in nonsticky patches. Finally, we find that the constraint imposed by stickiness on protein divergence is proportional to protein abundance, which provides mechanistic insights into the correlation between protein conservation and protein abundance. Overall, the avoidance of nonfunctional interactions significantly influences the physico-chemical and evolutionary properties of proteins. Remarkably, the effects observed are consistently larger in E. coli and S. cerevisiae than in H. sapiens, suggesting that promiscuous protein–protein interactions may be freer to accumulate in the human lineage.

[1]  Angelo D. Favia,et al.  Protein promiscuity and its implications for biotechnology , 2009, Nature Biotechnology.

[2]  C. Chothia,et al.  The atomic structure of protein-protein recognition sites. , 1999, Journal of molecular biology.

[3]  S. Rackovsky,et al.  Hydrophobicity, hydrophilicity, and the radial and orientational distributions of residues in native proteins. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Sergei Maslov,et al.  Topology of protein interaction network shapes protein abundances and strengths of their functional and nonspecific interactions , 2011, Proceedings of the National Academy of Sciences.

[5]  D. Eisenberg Three-dimensional structure of membrane and surface proteins. , 1984, Annual review of biochemistry.

[6]  Michele Vendruscolo,et al.  A relationship between mRNA expression levels and protein solubility in E. coli. , 2009, Journal of molecular biology.

[7]  G. Rose,et al.  The Levinthal paradox of the interactome , 2011, Protein science : a publication of the Protein Society.

[8]  Minoru Kanehisa,et al.  AAindex: amino acid index database, progress report 2008 , 2007, Nucleic Acids Res..

[9]  Dan S. Tawfik,et al.  Slow protein evolutionary rates are dictated by surface–core association , 2011, Proceedings of the National Academy of Sciences.

[10]  M. Levitt A simplified representation of protein conformations for rapid simulation of protein folding. , 1976, Journal of molecular biology.

[11]  Andreas Prlic,et al.  Ensembl 2008 , 2007, Nucleic Acids Res..

[12]  P. Ponnuswamy,et al.  Hydrophobic character of amino acid residues in globular proteins , 1978, Nature.

[13]  G. Rose,et al.  Hydrophobicity of amino acid residues in globular proteins. , 1985, Science.

[14]  H. Cid,et al.  Hydrophobicity and structural classes in proteins. , 1992, Protein engineering.

[15]  D. D. Jones,et al.  Amino acid properties and side-chain orientation in proteins: a cross correlation appraoch. , 1975, Journal of theoretical biology.

[16]  Milton T. W. Hearn,et al.  Physicochemical Basis of Amino Acid Hydrophobicity Scales: Evaluation of Four New Scales of Amino Acid Hydrophobicity Coefficients Derived from RP-HPLC of Peptides , 1995 .

[17]  R D MacElroy,et al.  A simple experimental model for hydrophobic interactions in proteins. , 1984, The Journal of biological chemistry.

[18]  M. Prabhakaran,et al.  The distribution of physical, chemical and conformational properties in signal and nascent peptides. , 1990, The Biochemical journal.

[19]  M J Sippl,et al.  Structure-derived hydrophobic potential. Hydrophobic potential derived from X-ray structures of globular proteins is able to identify native folds. , 1992, Journal of molecular biology.

[20]  P. Argos,et al.  Structural prediction of membrane-bound proteins. , 2005, European journal of biochemistry.

[21]  Michele Vendruscolo,et al.  Life on the edge: a link between gene expression levels and aggregation rates of human proteins. , 2007, Trends in biochemical sciences.

[22]  Johan A. Grahnen,et al.  Binding constraints on the evolution of enzymes and signalling proteins: the important role of negative pleiotropy , 2011, Proceedings of the Royal Society B: Biological Sciences.

[23]  S. Teichmann,et al.  Tight Regulation of Unstructured Proteins: From Transcript Synthesis to Protein Degradation , 2008, Science.

[24]  D. Tieleman,et al.  Hydrophobicity scales: a thermodynamic looking glass into lipid-protein interactions. , 2011, Trends in biochemical sciences.

[25]  Ben Lehner,et al.  Intrinsic Protein Disorder and Interaction Promiscuity Are Widely Associated with Dosage Sensitivity , 2009, Cell.

[26]  Yu Xia,et al.  Structural determinants of protein evolution are context-sensitive at the residue level. , 2009, Molecular biology and evolution.

[27]  Cyrus Chothia,et al.  The selection of acceptable protein mutations , 2007, Proceedings of the National Academy of Sciences.

[28]  D. Mould,et al.  Development of hydrophobicity parameters to analyze proteins which bear post- or cotranslational modifications. , 1991, Analytical biochemistry.

[29]  Emmanuel D Levy,et al.  Protein abundance is key to distinguish promiscuous from functional phosphorylation based on evolutionary information , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[30]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[31]  Itay Mayrose,et al.  Rate4Site: an algorithmic tool for the identification of functional regions in proteins by surface mapping of evolutionary determinants within their homologues , 2002, ISMB.

[32]  A. D. McLachlan,et al.  Solvation energy in protein folding and binding , 1986, Nature.

[33]  T. Steitz,et al.  Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins. , 1986, Annual review of biophysics and biophysical chemistry.

[34]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[35]  Dan S. Tawfik Messy biology and the origins of evolutionary innovations. , 2010, Nature chemical biology.

[36]  John Kuriyan,et al.  The origin of protein interactions and allostery in colocalization , 2007, Nature.

[37]  E. Levy A simple definition of structural regions in proteins and its use in analyzing interface evolution. , 2010, Journal of molecular biology.

[38]  Michele Vendruscolo,et al.  Physicochemical principles that regulate the competition between functional and dysfunctional association of proteins , 2009, Proceedings of the National Academy of Sciences.

[39]  Stephen H. White,et al.  Experimentally determined hydrophobicity scale for proteins at membrane interfaces , 1996, Nature Structural Biology.

[40]  Jian-Rong Yang,et al.  Impact of translational error-induced and error-free misfolding on the rate of protein evolution , 2010, Molecular systems biology.

[41]  T Tsujita,et al.  Dependence of conformational stability on hydrophobicity of the amino acid residue in a series of variant proteins substituted at a unique position of tryptophan synthase alpha subunit. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Jian-Rong Yang,et al.  Protein misinteraction avoidance causes highly expressed proteins to evolve slowly , 2012, Proceedings of the National Academy of Sciences.

[43]  C. Pál,et al.  Highly expressed genes in yeast evolve slowly. , 2001, Genetics.

[44]  P. Ponnuswamy Hydrophobic characteristics of folded proteins. , 1993, Progress in biophysics and molecular biology.

[45]  K Nishikawa,et al.  Distinct character in hydrophobicity of amino acid compositions of mitochondrial proteins , 1990, Proteins.

[46]  Margaret E. Johnson,et al.  Nonspecific binding limits the number of proteins in a cell and shapes their interaction networks , 2010, Proceedings of the National Academy of Sciences.

[47]  Sarah A. Teichmann,et al.  3D Complex: A Structural Classification of Protein Complexes , 2006, PLoS Comput. Biol..

[48]  Sergei Maslov,et al.  Constraints imposed by non-functional protein–protein interactions on gene expression and proteome size , 2008, Molecular systems biology.

[49]  T. Venanzi Hydrophobicity parameters and the bitter taste of L-amino acids. , 1984, Journal of theoretical biology.

[50]  J. Doye,et al.  Inhibition of protein crystallization by evolutionary negative design , 2004, Physical biology.

[51]  M. Kanehisa,et al.  Local hydrophobicity stabilizes secondary structures in proteins , 1980, Biopolymers.

[52]  R Cowan,et al.  Hydrophobicity indices for amino acid residues as determined by high-performance liquid chromatography. , 1990, Peptide research.

[53]  H. Scheraga,et al.  Statistical analysis of the physical properties of the 20 naturally occurring amino acids , 1985 .

[54]  Adrian H. Elcock,et al.  Diffusion, Crowding & Protein Stability in a Dynamic Molecular Model of the Bacterial Cytoplasm , 2010, PLoS Comput. Biol..

[55]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[56]  Ariel Fernández,et al.  Nonadaptive origins of interactome complexity , 2011, Nature.

[57]  C C Bigelow,et al.  On the average hydrophobicity of proteins and the relation between it and protein structure. , 1967, Journal of theoretical biology.

[58]  David Baker,et al.  Role of the Biomolecular Energy Gap in Protein Design, Structure, and Evolution , 2012, Cell.

[59]  J. Janin,et al.  Quantifying biological specificity: the statistical mechanics of molecular recognition. , 1996, Proteins.

[60]  C. Dobson,et al.  Competition between folding, native-state dimerisation and amyloid aggregation in beta-lactoglobulin. , 2009, Journal of molecular biology.

[61]  A. Bertolotti,et al.  Exposure of Hydrophobic Surfaces Initiates Aggregation of Diverse ALS-Causing Superoxide Dismutase-1 Mutants , 2010, Journal of molecular biology.

[62]  Emmanuel D Levy,et al.  PiQSi: protein quaternary structure investigation. , 2007, Structure.

[63]  J. Janin,et al.  Protein–protein interaction and quaternary structure , 2008, Quarterly Reviews of Biophysics.

[64]  P. Ponnuswamy,et al.  Hydrophobic packing and spatial arrangement of amino acid residues in globular proteins. , 1980, Biochimica et biophysica acta.

[65]  R. Wolfenden,et al.  Water, protein folding, and the genetic code. , 1979, Science.

[66]  C. von Mering,et al.  PaxDb, a Database of Protein Abundance Averages Across All Three Domains of Life , 2012, Molecular & Cellular Proteomics.

[67]  Sarath Chandra Janga,et al.  Operons and the effect of genome redundancy in deciphering functional relationships using phylogenetic profiles , 2007, Proteins.

[68]  Z. Derewenda,et al.  The role of entropy and polarity in intermolecular contacts in protein crystals. , 2009, Acta crystallographica. Section D, Biological crystallography.

[69]  Orly Dym,et al.  Following evolutionary paths to protein-protein interactions with high affinity and selectivity , 2009, Nature Structural &Molecular Biology.

[70]  D. Goldsack,et al.  Contribution of the free energy of mixing of hydrophobic side chains to the stability of the tertiary structure of proteins. , 1973, Journal of theoretical biology.

[71]  D. Eisenberg,et al.  Correlation of sequence hydrophobicities measures similarity in three-dimensional protein structure. , 1983, Journal of molecular biology.

[72]  E. Shakhnovich,et al.  Native atomic burials, supplemented by physically motivated hydrogen bond constraints, contain sufficient information to determine the tertiary structure of small globular proteins , 2008, Proteins.

[73]  J. M. Zimmerman,et al.  The characterization of amino acid sequences in proteins by statistical methods. , 1968, Journal of theoretical biology.

[74]  M. Charton,et al.  The structural dependence of amino acid hydrophobicity parameters. , 1982, Journal of theoretical biology.

[75]  C. Chothia Structural invariants in protein folding , 1975, Nature.

[76]  Claus O. Wilke,et al.  Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution , 2008, Cell.

[77]  N. Friedman,et al.  Natural history and evolutionary principles of gene duplication in fungi , 2007, Nature.