GIMS: an integrated data storage and analysis environment for genomic and functional data

Effective analyses in functional genomics require access to many kinds of biological data. For example, the analysis of upregulated genes in a microarray experiment might be aided by information concerning protein interactions or proteins' cellular locations. However, such information is often stored in different formats at different sites, in ways that may not be amenable to integrated analysis. The Genome Information Management System (GIMS) is an object database that integrates genomic data with data on the transcriptome, protein–protein interactions, metabolic pathways and annotations, such as gene ontology terms and identifiers. The resulting system supports the running of analyses over this integrated data resource, and provides comprehensive facilities for handling and interrelating the results of these analyses. GIMS has been used to store Saccharomyces cerevisiae data, and we demonstrate how the integrated storage of diverse types of data can be beneficial for analysis, using combinations of complex queries. As an example, we describe how GIMS has been used to analyse a collection of aryl alcohol dehydrogenase gene deletion mutants. The GIMS database can be accessed remotely using a Java application that can be downloaded from http://img.cs.man.ac.uk/gims. Copyright © 2003 John Wiley & Sons, Ltd.

[1]  Philip Lijnzaad,et al.  The Ensembl genome database project , 2002, Nucleic Acids Res..

[2]  Susumu Goto,et al.  LIGAND: database of chemical compounds and reactions in biological pathways , 2002, Nucleic Acids Res..

[3]  S G Oliver,et al.  Analysis of the seven-member AAD gene set demonstrates that genetic redundancy in yeast may be more apparent than real. , 1999, Genetics.

[4]  James I. Garrels,et al.  YPD-A database for the proteins of Saccharomyces cerevisiae , 1996, Nucleic Acids Res..

[5]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[6]  Andrew Hayes,et al.  Hybridization array technology coupled with chemostat culture: Tools to interrogate gene expression in Saccharomyces cerevisiae. , 2002, Methods.

[7]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[8]  Hans-Werner Mewes,et al.  MIPS: a database for protein sequences, homology data and yeast genome information , 1997, Nucleic Acids Res..

[9]  M. Gerstein,et al.  Relating whole-genome expression data with protein-protein interactions. , 2002, Genome research.

[10]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[11]  Ivar Jacobson,et al.  The Unified Modeling Language User Guide , 1998, J. Database Manag..

[12]  David Botstein,et al.  SGD: Saccharomyces Genome Database , 1998, Nucleic Acids Res..

[13]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[14]  강문설 [서평]「The Unified Modeling Language User Guide」 , 1999 .

[15]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[16]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[17]  S. Oliver,et al.  Disruption of seven hypothetical aryl alcohol dehydrogenase genes from Saccharomyces cerevisiae and construction of a multiple knock‐out strain , 1999, Yeast.

[18]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[19]  Magnus Rattray,et al.  Making sense of microarray data distributions , 2002, Bioinform..

[20]  Carole A. Goble,et al.  Conceptual modelling of genomic information , 2000, Bioinform..

[21]  X. Parés,et al.  Characterization of the Saccharomyces cerevisiae YMR318C (ADH6) gene product as a broad specificity NADPH-dependent alcohol dehydrogenase: relevance in aldehyde reduction. , 2002, The Biochemical journal.

[22]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.