Systematic condition-dependent annotation of metabolic genes.

The task of deriving a functional annotation for genes is complex as their involvement in various processes depends on multiple factors such as environmental conditions and genetic backup mechanisms. This study employs a large-scale model of the metabolism of Saccharomyces cerevisiae to investigate the function of yeast genes and derive a condition-dependent annotation (CDA) for their involvement in major metabolic processes under various genetic and environmental conditions. The resulting CDA is validated on a large scale and is shown to be superior to the corresponding Gene Ontology (GO) annotation, by showing that genes annotated with the same CDA term tend to be more coherently conserved in evolution and display greater expression coherency than those annotated with the same GO term. The CDA gives rise to new kinds of functional condition-dependent metabolic pathways, some of which are described and further examined via substrate auxotrophy measurements of knocked-out strains. The CDA presented is likely to serve as a new reference source for metabolic gene annotation.

[1]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[2]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[3]  R. Maier,et al.  Oxidative-Stress Resistance Mutants of Helicobacter pylori , 2002, Journal of bacteriology.

[4]  Ronald W. Davis,et al.  Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. , 1999, Science.

[5]  O. Ozier-Kalogeropoulos,et al.  A simple and efficient method for direct gene deletion in Saccharomyces cerevisiae. , 1993, Nucleic acids research.

[6]  B. Palsson,et al.  Large-scale evaluation of in silico gene deletions in Saccharomyces cerevisiae. , 2003, Omics : a journal of integrative biology.

[7]  Gavin Sherlock,et al.  Global analysis of gene function in yeast by quantitative phenotypic profiling , 2006, Molecular systems biology.

[8]  Y. Pilpel,et al.  Transcription control reprogramming in genetic backup circuits , 2005, Nature Genetics.

[9]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[10]  S. Oliver,et al.  Plasticity of genetic interactions in metabolic networks of yeast , 2007, Proceedings of the National Academy of Sciences.

[11]  Gavin Sherlock,et al.  The Stanford Microarray Database accommodates additional microarray platforms and data formats , 2004, Nucleic Acids Res..

[12]  M. Dante,et al.  Multifunctional yeast high-copy-number shuttle vectors. , 1992, Gene.

[13]  U. Sauer,et al.  Metabolic functions of duplicate genes in Saccharomyces cerevisiae. , 2005, Genome research.

[14]  B. Garvik,et al.  Principles for the Buffering of Genetic Variation , 2001, Science.

[15]  Y. Surdin-Kerjan,et al.  Cysteine biosynthesis in Saccharomyces cerevisiae occurs through the transsulfuration pathway which has been built up by enzyme recruitment , 1993, Journal of bacteriology.

[16]  Jianzhi Zhang,et al.  Toward a Molecular Understanding of Pleiotropy , 2006, Genetics.

[17]  B. Marshall,et al.  Urea protects Helicobacter (Campylobacter) pylori from the bactericidal effect of acid. , 1990, Gastroenterology.

[18]  B. Palsson,et al.  The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[19]  R. Mahadevan,et al.  The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. , 2003, Metabolic engineering.

[20]  J A Eisen,et al.  Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis. , 1998, Genome research.

[21]  C. Deytieux,et al.  Fine measurement of ergosterol requirements for growth of Saccharomyces cerevisiae during alcoholic fermentation , 2005, Applied Microbiology and Biotechnology.

[22]  B. Palsson,et al.  Towards multidimensional genome annotation , 2006, Nature Reviews Genetics.

[23]  G. Church,et al.  Modular epistasis in yeast metabolism , 2005, Nature Genetics.

[24]  Gary D Bader,et al.  Global Mapping of the Yeast Genetic Interaction Network , 2004, Science.

[25]  Li Ni,et al.  A procedure for assessing GO annotation consistency , 2005, ISMB.

[26]  E. Ruppin,et al.  Regulatory on/off minimization of metabolic flux changes after genetic perturbations. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Bruno van Swinderen,et al.  Flexibility in a Gene Network Affecting a Simple Behavior in Drosophila melanogaster , 2005, Genetics.

[28]  B. Palsson,et al.  Theory for the systemic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective. , 2000, Journal of theoretical biology.

[29]  D. Fell,et al.  A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks , 2000, Nature Biotechnology.

[30]  Ronald W. Davis,et al.  Functional profiling of the Saccharomyces cerevisiae genome , 2002, Nature.

[31]  Sean R. Collins,et al.  Exploration of the Function and Organization of the Yeast Early Secretory Pathway through an Epistatic Miniarray Profile , 2005, Cell.

[32]  B. Palsson,et al.  In silico predictions of Escherichia coli metabolic capabilities are consistent with experimental data , 2001, Nature Biotechnology.

[33]  E. Ruppin,et al.  Multiple knockout analysis of genetic robustness in the yeast metabolic network , 2006, Nature Genetics.

[34]  Markus J. Herrgård,et al.  Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. , 2004, Genome research.

[35]  D. E. Levin,et al.  Saccharomyces cerevisiae contains four fatty acid activation (FAA) genes: an assessment of their role in regulating protein N- myristoylation and cellular lipid metabolism , 1994, The Journal of cell biology.

[36]  B. Palsson,et al.  Expanded Metabolic Reconstruction of Helicobacter pylori (iIT341 GSM/GPR): an In Silico Genome-Scale Characterization of Single- and Double-Deletion Mutants , 2005, Journal of bacteriology.

[37]  M. Goebl,et al.  Two glycogen synthase isoforms in Saccharomyces cerevisiae are coded by distinct genes that are differentially controlled. , 1991, The Journal of biological chemistry.

[38]  Jan Ihmels,et al.  Principles of transcriptional control in the metabolic network of Saccharomyces cerevisiae , 2004, Nature Biotechnology.

[39]  B. Palsson,et al.  Saccharomyces cerevisiae phenotypes can be predicted by using constraint-based analysis of a genome-scale reconstructed metabolic network , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[40]  E. Dubois,et al.  Specific induction of catabolism and its relation to repression of biosynthesis in arginine metabolism of Saccharomyces cerevisiae. , 1978, Journal of molecular biology.

[41]  Markus J. Herrgård,et al.  Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae. , 2006, Genome research.

[42]  B. Palsson,et al.  Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth , 2002, Nature.