A data integration methodology for systems biology: experimental verification.

The integration of data from multiple global assays is essential to understanding dynamic spatiotemporal interactions within cells. In a companion paper, we reported a data integration methodology, designated Pointillist, that can handle multiple data types from technologies with different noise characteristics. Here we demonstrate its application to the integration of 18 data sets relating to galactose utilization in yeast. These data include global changes in mRNA and protein abundance, genome-wide protein-DNA interaction data, database information, and computational predictions of protein-DNA and protein-protein interactions. We divided the integration task to determine three network components: key system elements (genes and proteins), protein-protein interactions, and protein-DNA interactions. Results indicate that the reconstructed network efficiently focuses on and recapitulates the known biology of galactose utilization. It also provided new insights, some of which were verified experimentally. The methodology described here, addresses a critical need across all domains of molecular and cell biology, to effectively integrate large and disparate data sets.

[1]  R. Kornberg,et al.  A GAL family of upstream activating sequences in yeast: roles in both induction and repression of transcription. , 1986, The EMBO journal.

[2]  H. Ronne,et al.  Yeast galactose permease is related to yeast and mammalian glucose transporters. , 1989, Gene.

[3]  R. Trumbly,et al.  The yeast GLC7 gene required for glycogen accumulation encodes a type 1 protein phosphatase. , 1991, The Journal of biological chemistry.

[4]  E. Mugumya Small families mean better health for mothers and kids. , 1994, Africa women & health : a Safe Motherhood magazine.

[5]  G. Blobel,et al.  Two novel related yeast nucleoporins Nup170p and Nup157p: complementation with the vertebrate homologue Nup155p and functional interactions with the yeast nuclear pore-membrane protein Pom152p , 1995, The Journal of cell biology.

[6]  P. Philippsen,et al.  Additional modules for versatile and economical PCR‐based gene deletion and modification in Saccharomyces cerevisiae , 1998, Yeast.

[7]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[8]  G. Church,et al.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. , 2000, Journal of molecular biology.

[9]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[10]  J. Nielsen,et al.  In vivo dynamics of galactose metabolism in Saccharomyces cerevisiae: metabolic fluxes and metabolite levels. , 2001, Biotechnology and bioengineering.

[11]  Kathleen Marchal,et al.  A higher-order background model improves the detection of promoter regulatory elements by Gibbs sampling , 2001, Bioinform..

[12]  L. Olsson,et al.  The impact of GAL6, GAL80, and MIG1 on glucose control of the GAL system in Saccharomyces cerevisiae. , 2001, FEMS yeast research.

[13]  Roger E Bumgarner,et al.  Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. , 2001, Science.

[14]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  H. Herzel,et al.  Is there a bias in proteome research? , 2001, Genome research.

[16]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[17]  Ioannis Xenarios,et al.  DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions , 2002, Nucleic Acids Res..

[18]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[19]  C. Deane,et al.  Protein Interactions , 2002, Molecular & Cellular Proteomics.

[20]  Trey Ideker,et al.  Transcriptome profiling to identify genes involved in peroxisome assembly and function , 2002, The Journal of cell biology.

[21]  K. Ayscough,et al.  An interaction between Sla1p and Sla2p plays a role in regulating actin dynamics and endocytosis in budding yeast , 2003, Journal of Cell Science.

[22]  Hui Lu,et al.  Multimeric threading-based prediction of protein-protein interactions on a genomic scale: application to the Saccharomyces cerevisiae proteome. , 2003, Genome research.

[23]  E. O’Shea,et al.  Global analysis of protein localization in budding yeast , 2003, Nature.

[24]  Nicola J. Rinaldi,et al.  Computational discovery of gene modules and regulatory networks , 2003, Nature Biotechnology.

[25]  See-Kiong Ng,et al.  InterDom: a database of putative interacting protein domains for validating predicted protein interactions and complexes , 2003, Nucleic Acids Res..

[26]  L. Hood,et al.  Systems approaches applied to the study of Saccharomyces cerevisiae and Halobacterium sp. , 2003, Cold Spring Harbor symposia on quantitative biology.

[27]  Benno Schwikowski,et al.  Predicting protein-peptide interactions via a network-based motif sampler , 2004, ISMB/ECCB.

[28]  B Marshall,et al.  Gene Ontology Consortium: The Gene Ontology (GO) database and informatics resource , 2004, Nucleic Acids Res..

[29]  Kara Dolinski,et al.  Saccharomyces genome database: Underlying principles and organisation , 2004, Briefings Bioinform..

[30]  Iliana Avila-Campillo,et al.  Control of yeast filamentous-form growth by modules in an integrated molecular network. , 2004, Genome research.

[31]  Ian M. Donaldson,et al.  The Biomolecular Interaction Network Database and related tools 2005 update , 2004, Nucleic Acids Res..

[32]  Hamid Bolouri,et al.  A data integration methodology for systems biology. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[33]  C. Glover,et al.  Gene expression profiling for hematopoietic cell culture , 2006 .