Investigating metabolite essentiality through genome-scale analysis of Escherichia coli production capabilities

MOTIVATION A phenotype mechanism is classically derived through the study of a set of mutants and comparison of their biochemical capabilities. One method of comparing mutant capabilities is to characterize producible and knocked out metabolites. However such an effect is difficult to manually assess, especially for a large biochemical network and a complex media. Current algorithmic approaches towards analyzing metabolic networks either do not address this specific property or are computationally infeasible on the genome-scale. RESULTS We have developed a novel genome-scale computational approach that identifies the full set of biochemical species that are knocked out from the metabolome following a gene deletion. Results from this approach are combined with data from in vivo mutant screens to examine the essentiality of metabolite production for a phenotype. This approach can also be a useful tool for metabolic network annotation validation and refinement in newly sequenced organisms. Combining an in silico genome-scale model of Escherichia coli metabolism with in vivo survival data, we uncover possible essential roles for several cell membranes, cell walls, and quinone species. We also identify specific biomass components whose production appears to be non-essential for survival, contrary to the assumptions of previous models. AVAILABILITY Programs are available upon request from the authors in the form of Matlab script files. SUPPLEMENTARY INFORMATION http://www.cis.upenn.edu/biocomp/manuscripts/bioinformatics_bti245/supp-info.html.

[1]  B. Bochner Innovations: New technologies to assess genotype–phenotype relationships , 2003, Nature Reviews Genetics.

[2]  B. Palsson,et al.  Theory for the systemic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective. , 2000, Journal of theoretical biology.

[3]  D. Fell,et al.  A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks , 2000, Nature Biotechnology.

[4]  R. Heinrich,et al.  Metabolic Pathway Analysis: Basic Concepts and Scientific Applications in the Post‐genomic Era , 1999, Biotechnology progress.

[5]  P D Karp,et al.  Pathway Databases: A Case Study in Computational Symbolic Theories , 2001, Science.

[6]  B. Palsson,et al.  An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR) , 2003, Genome Biology.

[7]  B. Palsson,et al.  In silico predictions of Escherichia coli metabolic capabilities are consistent with experimental data , 2001, Nature Biotechnology.

[8]  Markus J. Herrgård,et al.  Integrating high-throughput and computational data elucidates bacterial networks , 2004, Nature.

[9]  Jason A. Papin,et al.  Comparison of network-based pathway analysis methods. , 2004, Trends in biotechnology.

[10]  Masanori Arita The metabolic world of Escherichia coli is not small. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Bruce L. Clarke Stoichiometric network analysis of the oxalate–persulfate–silver oscillator , 1992 .

[12]  B. Palsson,et al.  Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth , 2002, Nature.

[13]  Jason A. Papin,et al.  Metabolic pathways in the post-genome era. , 2003, Trends in biochemical sciences.

[14]  G. Church,et al.  Analysis of optimality in natural and perturbed metabolic networks , 2002 .

[15]  B. Palsson,et al.  Assessment of the metabolic capabilities of Haemophilus influenzae Rd through a genome-scale pathway analysis. , 2000, Journal of theoretical biology.

[16]  B. Palsson,et al.  The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Nikos Kyrpides,et al.  Genomes OnLine Database (GOLD): a monitor of genome projects world-wide , 2001, Nucleic Acids Res..

[18]  Ney Lemke,et al.  Essentiality and damage in metabolic networks , 2004, Bioinform..

[19]  B. Palsson,et al.  Metabolic Flux Balancing: Basic Concepts, Scientific and Practical Use , 1994, Bio/Technology.

[20]  G. Church,et al.  Genome-Scale Metabolic Model of Helicobacter pylori 26695 , 2002, Journal of bacteriology.

[21]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[22]  B. Palsson,et al.  Combining pathway analysis with flux balance analysis for the comprehensive study of metabolic systems. , 2000, Biotechnology and bioengineering.

[23]  B O Palsson,et al.  Metabolic modeling of microbial strains in silico. , 2001, Trends in biochemical sciences.

[24]  R. Heinrich,et al.  The Regulation of Cellular Systems , 1996, Springer US.

[25]  Stephen S Fong,et al.  Metabolic gene–deletion strains of Escherichia coli evolve to computationally predicted growth phenotypes , 2004, Nature Genetics.

[26]  J. W. Campbell,et al.  Experimental Determination and System Level Analysis of Essential Genes in Escherichia coli MG1655 , 2003, Journal of bacteriology.

[27]  B. Palsson,et al.  Toward Metabolic Phenomics: Analysis of Genomic Data Using Flux Balances , 1999, Biotechnology progress.

[28]  B. Palsson,et al.  Saccharomyces cerevisiae phenotypes can be predicted by using constraint-based analysis of a genome-scale reconstructed metabolic network , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Thomas Pfeiffer,et al.  Exploring the pathway structure of metabolism: decomposition into subnetworks and application to Mycoplasma pneumoniae , 2002, Bioinform..