Protein abundances and interactions coevolve to promote functional complexes while suppressing non-specific binding

How do living cells achieve sufficient abundances of functional protein complexes while minimizing promiscuous non-functional interactions? Here we study this problem using a first-principle model of the cell whose phenotypic traits are directly determined from its genome through biophysical properties of protein structures and binding interactions in crowded cellular environment. The model cell includes three independent prototypical pathways, whose topologies of Protein-Protein Interaction (PPI) sub-networks are different, but whose contributions to the cell fitness are equal. Model cells evolve through genotypic mutations and phenotypic protein copy number variations. We found a strong relationship between evolved physical-chemical properties of protein interactions and their abundances due to a "frustration" effect: strengthening of functional interactions brings about hydrophobic interfaces, which make proteins prone to promiscuous binding. The balancing act is achieved by lowering concentrations of hub proteins while raising solubilities and abundances of functional monomers. Based on these principles we generated and analyzed a possible realization of the proteome-wide PPI network in yeast. In this simulation we found that high-throughput affinity capture - mass spectroscopy experiments can detect functional interactions with high fidelity only for high abundance proteins while missing most interactions for low abundance proteins.

[1]  E. O’Shea,et al.  Global analysis of protein expression in yeast , 2003, Nature.

[2]  Sergei Maslov,et al.  Constraints imposed by non-functional protein–protein interactions on gene expression and proteome size , 2008, Molecular systems biology.

[3]  Ruth Nussinov,et al.  Generation and analysis of a protein–protein interface data set with similar chemical and spatial patterns of interactions , 2005, Proteins.

[4]  N. Wingreen,et al.  NATURE OF DRIVING FORCE FOR PROTEIN FOLDING : A RESULT FROM ANALYZING THE STATISTICAL POTENTIAL , 1995, cond-mat/9512111.

[5]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[6]  R. Jernigan,et al.  Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. , 1996, Journal of molecular biology.

[7]  Eugene I. Shakhnovich,et al.  Enumeration of all compact conformations of copolymers with random sequence of links , 1990 .

[8]  I. Ispolatov,et al.  Propagation of large concentration changes in reversible protein-binding networks , 2007, Proceedings of the National Academy of Sciences.

[9]  Eugene I. Shakhnovich,et al.  A First-Principles Model of Early Evolution: Emergence of Gene Families, Species, and Preferred Protein Folds , 2007, PLoS Comput. Biol..

[10]  Ruth Nussinov,et al.  Energetic determinants of protein binding specificity: Insights into protein interaction networks , 2009, Proteomics.

[11]  Louis Kang,et al.  Emergence of species in evolutionary “simulated annealing” , 2008, Proceedings of the National Academy of Sciences.

[12]  Eric J. Deeds,et al.  Robust protein–protein interactions in crowded cellular environments , 2007, Proceedings of the National Academy of Sciences.

[13]  R. Aebersold,et al.  Comparative Functional Analysis of the Caenorhabditis elegans and Drosophila melanogaster Proteomes , 2009, PLoS biology.