Global landscape of protein complexes in the yeast Saccharomyces cerevisiae

Identification of protein–protein interactions often provides insight into protein function, and many cellular processes are performed by stable protein complexes. We used tandem affinity purification to process 4,562 different tagged proteins of the yeast Saccharomyces cerevisiae. Each preparation was analysed by both matrix-assisted laser desorption/ionization–time of flight mass spectrometry and liquid chromatography tandem mass spectrometry to increase coverage and accuracy. Machine learning was used to integrate the mass spectrometry scores and assign probabilities to the protein–protein interactions. Among 4,087 different proteins identified with high confidence by mass spectrometry from 2,357 successful purifications, our core data set (median precision of 0.69) comprises 7,123 protein–protein interactions involving 2,708 proteins. A Markov clustering algorithm organized these interactions into 547 protein complexes averaging 4.9 subunits per complex, about half of them absent from the MIPS database, as well as 429 additional interactions between pairs of complexes. The data (all of which are available online) will help future studies on individual proteins as well as functional genomics and systems biology.

[1]  Michael Shales,et al.  Extensive homology among the largest subunits of eukaryotic and prokaryotic RNA polymerases , 1985, Cell.

[2]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[3]  D. Shore,et al.  An essential yeast gene encoding a TTAGGG repeat-binding protein , 1993, Molecular and cellular biology.

[4]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[5]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[6]  J. Yates,et al.  Direct analysis and identification of proteins in mixtures by LC/MS/MS and database searching at the low-femtomole level. , 1997, Analytical chemistry.

[7]  S. Fields,et al.  A biochemical genomics approach for identifying genes by the activity of their products. , 1999, Science.

[8]  G. Fourel,et al.  Cohabitation of insulators and silencing elements in yeast subtelomeric regions , 1999, The EMBO journal.

[9]  J. Yates,et al.  Direct analysis of protein complexes using mass spectrometry , 1999, Nature Biotechnology.

[10]  B. Séraphin,et al.  A generic protein purification method for protein complex characterization and proteome exploration , 1999, Nature Biotechnology.

[11]  Ronald W. Davis,et al.  Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. , 1999, Science.

[12]  Kei-Hoi Cheung,et al.  Large-scale analysis of the yeast genome by transposon tagging and gene disruption , 1999, Nature.

[13]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[14]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[15]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[16]  L. Aravind The BED finger, a novel DNA-binding domain in chromatin-boundary-element-binding proteins and transposases. , 2000, Trends in biochemical sciences.

[17]  Nevan J. Krogan,et al.  Characterization of a Six-Subunit Holo-Elongator Complex Required for the Regulated Expression of a Group of Genes in Saccharomyces cerevisiae , 2001, Molecular and Cellular Biology.

[18]  Gary D Bader,et al.  Systematic Genetic Analysis with Ordered Arrays of Yeast Deletion Mutants , 2001, Science.

[19]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[20]  M. Curcio,et al.  Multiple regulators of Ty1 transposition in Saccharomyces cerevisiae have conserved roles in genome maintenance. , 2001, Genetics.

[21]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[22]  M. Gerstein,et al.  Subcellular localization of the yeast proteome. , 2002, Genes & development.

[23]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[24]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[25]  G. Cagney,et al.  RNA Polymerase II Elongation Factors of Saccharomyces cerevisiae: a Targeted Proteomics Approach , 2002, Molecular and Cellular Biology.

[26]  A. Shilatifard,et al.  dELL is an essential RNA polymerase II elongation factor with a general role in development , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[28]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[29]  E. O’Shea,et al.  Global analysis of protein expression in yeast , 2003, Nature.

[30]  E. O’Shea,et al.  Global analysis of protein localization in budding yeast , 2003, Nature.

[31]  Dennis P Wall,et al.  A simple dependence between protein evolution rate and the number of protein-protein interactions , 2003, BMC Evolutionary Biology.

[32]  Michael Hampsey,et al.  Tails of Intrigue Phosphorylation of RNA Polymerase II Mediates Histone Methylation , 2003, Cell.

[33]  M. Snyder,et al.  Protein chip technology. , 2003, Current opinion in chemical biology.

[34]  Carole A. Goble,et al.  Investigating Semantic Similarity Measures Across the Gene Ontology: The Relationship Between Sequence and Annotation , 2003, Bioinform..

[35]  Karl-Dieter Entian,et al.  Catabolite degradation of fructose-1,6-bisphosphatase in the yeast Saccharomyces cerevisiae: a genome-wide screen identifies eight novel GID genes and indicates the existence of two degradation pathways. , 2003, Molecular biology of the cell.

[36]  M. Gerstein,et al.  A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data , 2003, Science.

[37]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2005, Nucleic Acids Res..

[38]  M. Gerstein,et al.  Analyzing protein function on a genomic scale: the importance of gold-standard positives and negatives for network prediction. , 2004, Current opinion in microbiology.

[39]  Mark Gerstein,et al.  Analyzing cellular biochemistry in terms of molecular networks. , 2003, Annual review of biochemistry.

[40]  T. Hughes,et al.  High-definition macromolecular composition of yeast RNA-processing complexes. , 2004, Molecular cell.

[41]  G. Cagney,et al.  Proteasome involvement in the repair of DNA double-strand breaks. , 2004, Molecular cell.

[42]  Philipp Korber,et al.  SWRred Not Shaken Mixing the Histones , 2004, Cell.

[43]  P. Sadhale,et al.  Rpb4 and Rpb7: A Sub‐complex Integral to Multi‐subunit RNA Polymerases Performs a Multitude of Functions , 2005, IUBMB life.

[44]  D. Ingber,et al.  High-Betweenness Proteins in the Yeast Protein Interaction Network , 2005, Journal of biomedicine & biotechnology.

[45]  Nevan J. Krogan,et al.  Cotranscriptional Set2 Methylation of Histone H3 Lysine 36 Recruits a Repressive Rpd3 Complex , 2005, Cell.

[46]  A. Emili,et al.  Interaction network containing conserved and essential protein complexes in Escherichia coli , 2005, Nature.

[47]  Bing Li,et al.  Histone H3 Methylation by Set2 Directs Deacetylation of Coding Regions by Rpd3S to Suppress Spurious Intragenic Transcription , 2005, Cell.

[48]  Sean R. Collins,et al.  Exploration of the Function and Organization of the Yeast Early Secretory Pathway through an Epistatic Miniarray Profile , 2005, Cell.