Treasures and traps in genome-wide data sets: case examples from yeast

Since the publication of the Saccharomyces cerevisiae genome sequence, much effort has been dedicated to developing high-throughput techniques to generate comprehensive information about the function and dynamics of all genes in this yeast's genome. These techniques have generated data sets that typically contain large amounts of reliable and valuable biological information. Nevertheless, there are also uncertainties that are associated with such large-scale studies, which we discuss in this review. These uncertainties increase with the complexity of the organism under study. On the basis of the results from yeast, we should learn much from human and mouse genomic data sets. However, as with yeast data sets, they might also contain misleading results.

[1]  L. Wodicka,et al.  Genome-wide expression monitoring in Saccharomyces cerevisiae , 1997, Nature Biotechnology.

[2]  Hongyue Dai,et al.  Widespread aneuploidy revealed by DNA microarray expression profiling , 2000, Nature Genetics.

[3]  Ronald W. Davis,et al.  Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. , 1999, Science.

[4]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[5]  Gary D Bader,et al.  Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometry , 2002, Nature.

[6]  Wei Zhou,et al.  Characterization of the Yeast Transcriptome , 1997, Cell.

[7]  M. Snyder,et al.  Proteomics: Protein complexes take the bait , 2002, Nature.

[8]  Ronald W. Davis,et al.  Replication dynamics of the yeast genome. , 2001, Science.

[9]  D. Eisenberg,et al.  A combined algorithm for genome-wide prediction of protein function , 1999, Nature.

[10]  R. W. Davis,et al.  Two genes differentially regulated in the cell cycle and by DNA-damaging agents encode alternative regulatory subunits of ribonucleotide reductase. , 1990, Genes & development.

[11]  S. Fields,et al.  A novel genetic system to detect protein–protein interactions , 1989, Nature.

[12]  M. Snyder,et al.  Emerging technologies in yeast genomics , 2001, Nature Reviews Genetics.

[13]  G. Church,et al.  Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae , 2001, Nature Genetics.

[14]  John J. Wyrick,et al.  Genome-wide location and function of DNA binding proteins. , 2000, Science.

[15]  D. Botstein,et al.  Genomic expression responses to DNA-damaging agents and the regulatory role of the yeast ATR homolog Mec1p. , 2001, Molecular biology of the cell.

[16]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[17]  L. Samson,et al.  Global response of Saccharomyces cerevisiae to an alkylating agent. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Acknowledgements , 1992, Experimental Gerontology.

[19]  L. Dirick,et al.  Roles and regulation of Cln‐Cdc28 kinases at the start of the cell cycle of Saccharomyces cerevisiae. , 1995, The EMBO journal.

[20]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[21]  P. Legrain,et al.  Toward a functional analysis of the yeast genome through exhaustive two-hybrid screens , 1997, Nature Genetics.

[22]  J. Haber,et al.  NEJ1 controls non-homologous end joining in Saccharomyces cerevisiae , 2001, Nature.

[23]  Daniel R. Richards,et al.  Dissecting the architecture of a quantitative trait locus in yeast , 2002, Nature.

[24]  Kara Dolinski,et al.  Integrating functional genomic information into the Saccharomyces Genome Database , 2000, Nucleic Acids Res..

[25]  G. Church,et al.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. , 2000, Journal of molecular biology.

[26]  Kei-Hoi Cheung,et al.  Large-scale analysis of the yeast genome by transposon tagging and gene disruption , 1999, Nature.

[27]  Elizabeth A. Winzeler,et al.  Genomic profiling of drug sensitivities via induced haploinsufficiency , 1999, Nature Genetics.

[28]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[29]  J. Yates,et al.  Large-scale analysis of the yeast proteome by multidimensional protein identification technology , 2001, Nature Biotechnology.

[30]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[31]  Ji Huang,et al.  [Serial analysis of gene expression]. , 2002, Yi chuan = Hereditas.

[32]  Gary D Bader,et al.  Systematic Genetic Analysis with Ordered Arrays of Yeast Deletion Mutants , 2001, Science.

[33]  J. Boeke,et al.  A DNA Microarray-Based Genetic Screen for Nonhomologous End-Joining Mutants in Saccharomyces cerevisiae , 2001, Science.

[34]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[35]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[36]  M. Mann,et al.  Proteomics to study genes and genomes , 2000, Nature.

[37]  M. Gerstein,et al.  Relating whole-genome expression data with protein-protein interactions. , 2002, Genome research.

[38]  George M. Church,et al.  Regulatory Networks Revealed by Transcriptional Profiling of Damaged Saccharomyces cerevisiae Cells: Rpn4 Links Base Excision Repair with Proteasomes , 2000, Molecular and Cellular Biology.

[39]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Ronald W. Davis,et al.  The core meiotic transcriptome in budding yeasts , 2000, Nature Genetics.

[41]  John J. Wyrick,et al.  Genome-Wide Distribution of ORC and MCM Proteins in S. cerevisiae: High-Resolution Mapping of Replication Origins , 2001, Science.

[42]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[43]  D. Kell,et al.  A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations , 2001, Nature Biotechnology.

[44]  C. Newlon,et al.  DNA replication joins the revolution: whole-genome views of DNA replication in budding yeast. , 2002, BioEssays : news and reviews in molecular, cellular and developmental biology.

[45]  Ronald W. Davis,et al.  Functional profiling of the Saccharomyces cerevisiae genome , 2002, Nature.

[46]  D. Botstein,et al.  Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF , 2001, Nature.

[47]  James I. Garrels,et al.  The Yeast Proteome Database (YPD): a model for the organization and presentation of genome-wide functional data , 1999, Nucleic Acids Res..

[48]  B. Futcher,et al.  A Sampling of the Yeast Proteome , 1999, Molecular and Cellular Biology.

[49]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[50]  E. Winzeler,et al.  Whole genome genetic-typing in yeast using high-density oligonucleotide arrays , 1999, Parasitology.

[51]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[52]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.