Relating whole-genome expression data with protein-protein interactions.

We investigate the relationship of protein-protein interactions with mRNA expression levels, by integrating a variety of data sources for yeast. We focus on known protein complexes that have clearly defined interactions between their subunits. We find that subunits of the same protein complex show significant coexpression, both in terms of similarities of absolute mRNA levels and expression profiles, e.g., we can often see subunits of a complex having correlated patterns of expression over a time course. We classify the yeast protein complexes as either permanent or transient, with permanent ones being maintained through most cellular conditions. We find that, generally, permanent complexes, such as the ribosome and proteasome, have a particularly strong relationship with expression, while transient ones do not. However, we note that several transient complexes, such as the RNA polymerase II holoenzyme and the replication complex, can be subdivided into smaller permanent ones, which do have a strong relationship to gene expression. We also investigated the interactions in aggregated, genome-wide data sets, such as the comprehensive yeast two-hybrid experiments, and found them to have only a weak relationship with gene expression, similar to that of transient complexes. (Further details on genecensus.org/expression/interactions and bioinfo.mbb.yale.edu/expression/interactions.)

[1]  W. H. Mager,et al.  Quantitative analysis of the protein composition of yeast ribosomes. , 1978, European journal of biochemistry.

[2]  J. Broach,et al.  Genome dynamics, protein synthesis, and energetics , 1991 .

[3]  F. R. Papa,et al.  The yeast DOA4 gene encodes a deubiquitinating enzyme related to a product of the human tre-2 oncogene , 1993, Nature.

[4]  R. Young,et al.  RNA Polymerase II Holoenzyme Contains SWI/SNF Regulators Involved in Chromatin Remodeling , 1996, Cell.

[5]  K Tanaka,et al.  Structure and functions of the 20S and 26S proteasomes. , 1996, Annual review of biochemistry.

[6]  D. Mastronarde,et al.  Nuclear pore complex number and distribution throughout the Saccharomyces cerevisiae cell cycle by three-dimensional reconstruction from electron micrographs of nuclear envelopes. , 1997, Molecular biology of the cell.

[7]  M. Bucci,et al.  In Vivo Dynamics of Nuclear Pore Complexes in Yeast , 1997, The Journal of cell biology.

[8]  Wei Zhou,et al.  Characterization of the Yeast Transcriptome , 1997, Cell.

[9]  J. Seilhamer,et al.  A comparison of selected mRNA and protein abundances in human liver , 1997, Electrophoresis.

[10]  O. Aparicio,et al.  Components and Dynamics of DNA Replication Complexes in S. cerevisiae: Redistribution of MCM Proteins and Cdc45p during S Phase , 1997, Cell.

[11]  R. Planta Regulation of ribosome synthesis in yeast , 1997, Yeast.

[12]  M. Holcombe,et al.  Information Processing in Cells and Tissues , 1998, Springer US.

[13]  A. Persidis High-throughput screening , 1998, Bio/Technology.

[14]  J. Barker,et al.  Large-scale temporal gene expression mapping of central nervous system development. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[16]  Michael R. Green,et al.  Dissecting the Regulatory Circuitry of a Eukaryotic Genome , 1998, Cell.

[17]  G. Church,et al.  Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation , 1998, Nature Biotechnology.

[18]  P. D’haeseleer,et al.  Mining the gene expression matrix: inferring gene relationships from large scale gene expression data , 1998 .

[19]  N M Luscombe,et al.  New tools and resources for analysing protein structures and their interactions. , 1998, Acta crystallographica. Section D, Biological crystallography.

[20]  J. Yates,et al.  TRAPP, a highly conserved novel complex on the cis‐Golgi that mediates vesicle docking and fusion , 1998, The EMBO journal.

[21]  P. Brown,et al.  Identification of eukaryotic mRNAs that are translated at reduced cap binding complex eIF4F concentrations using a cDNA microarray. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Dmitrij Frishman,et al.  MIPS: a database for genomes and protein sequences , 1999, Nucleic Acids Res..

[23]  B. Futcher,et al.  A Sampling of the Yeast Proteome , 1999, Molecular and Cellular Biology.

[24]  M. Nomura,et al.  Regulation of Ribosome Biosynthesis in Escherichia coli and Saccharomyces cerevisiae: Diversity and Common Principles , 1999, Journal of bacteriology.

[25]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[26]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[27]  L. Samson,et al.  Global response of Saccharomyces cerevisiae to an alkylating agent. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[28]  F. R. Papa,et al.  Interaction of the Doa4 deubiquitinating enzyme with the yeast 26S proteasome. , 1999, Molecular biology of the cell.

[29]  T. P. Flores,et al.  Protein structural topology: Automated analysis and diagrammatic representation , 2008, Protein science : a publication of the Protein Society.

[30]  C. Gordon,et al.  The 26S proteasome of the fission yeast Schizosaccharomyces pombe. , 1999, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[31]  Laurie J. Heyer,et al.  Exploring expression data: identification and analysis of coexpressed genes. , 1999, Genome research.

[32]  C. Hill,et al.  Structural basis for the activation of 20S proteasomes by 11S regulators , 2000, Nature.

[33]  Hans-Werner Mewes,et al.  Integrative Analysis of Protein Interaction Data , 2000, ISMB.

[34]  Andrea Califano,et al.  Analysis of Gene Expression Microarrays for Phenotype Classification , 2000, ISMB.

[35]  G. Cagney,et al.  Large-scale functional analysis using peptide or protein arrays , 2000, Nature Biotechnology.

[36]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[37]  T. Gaasterland,et al.  Making the most of microarray data , 2000, Nature Genetics.

[38]  M. Sacher,et al.  TRAPP stably associates with the Golgi and is required for vesicle docking , 2000, The EMBO journal.

[39]  M Gerstein,et al.  Structural proteomics: prospects for high throughput sample preparation. , 2000, Progress in biophysics and molecular biology.

[40]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[41]  M Gerstein,et al.  Analysis of the yeast transcriptome with structural and functional categories: characterizing highly expressed proteins. , 2000, Nucleic acids research.

[42]  P. Uetz,et al.  Systematic and large-scale two-hybrid screens. , 2000, Current opinion in microbiology.

[43]  Mark Gerstein,et al.  Structural proteomics of an archaeon , 2000, Nature Structural Biology.

[44]  B. Schwikowski,et al.  A network of protein–protein interactions in yeast , 2000, Nature Biotechnology.

[45]  D. Eisenberg,et al.  Protein function in the post-genomic era , 2000, Nature.

[46]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[47]  C. Li,et al.  Analyzing high‐density oligonucleotide gene expression array data , 2001, Journal of cellular biochemistry.

[48]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[49]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[50]  M. Gerstein,et al.  A Bayesian system integrating expression data with sequence patterns for localizing proteins: comprehensive application to the yeast genome. , 2000, Journal of molecular biology.

[51]  P. Uetz,et al.  High-throughput screening for protein-protein interactions using two-hybrid assay. , 2000, Methods in enzymology.

[52]  T. Ito,et al.  Toward a protein-protein interaction map of the budding yeast: A comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[53]  M. Gerstein,et al.  The current excitement in bioinformatics-analysis of whole-genome expression data: how does it relate to protein structure and function? , 2000, Current opinion in structural biology.

[54]  Gary D. Bader,et al.  BIND-a data specification for storing and describing biomolecular interactions, molecular complexes and pathways , 2000, Bioinform..

[55]  M Gerstein,et al.  Genome-wide analysis relating expression level with protein subcellular localization. , 2000, Trends in genetics : TIG.

[56]  C. Chothia,et al.  Determination of protein function, evolution and interactions by structural genomics. , 2001, Current opinion in structural biology.

[57]  M. Gerstein,et al.  RNA expression patterns change dramatically in human neutrophils exposed to bacteria. , 2001, Blood.

[58]  T. Takagi,et al.  Assessment of prediction accuracy of protein function from protein–protein interaction data , 2001, Yeast.

[59]  Jeffrey T. Chang,et al.  Basic microarray analysis: grouping and feature reduction. , 2001, Trends in biotechnology.

[60]  M. Gerstein,et al.  Genomic and proteomic analysis of the myeloid differentiation program. , 2001, Blood.

[61]  M. Gerstein,et al.  Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions. , 2001, Journal of molecular biology.

[62]  M. Vidal,et al.  High-throughput yeast two-hybrid assays for large-scale protein interaction mapping. , 2001, Methods.

[63]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Mark Gerstein,et al.  Analysis of mRNA expression and protein abundance data: an approach for the comparison of the enrichment of features in the cellular population of proteins and transcripts , 2002, Bioinform..