Integration of metabolic databases for the reconstruction of genome-scale metabolic networks

BackgroundGenome-scale metabolic reconstructions have been recognised as a valuable tool for a variety of applications ranging from metabolic engineering to evolutionary studies. However, the reconstruction of such networks remains an arduous process requiring a high level of human intervention. This process is further complicated by occurrences of missing or conflicting information and the absence of common annotation standards between different data sources.ResultsIn this article, we report a semi-automated methodology aimed at streamlining the process of metabolic network reconstruction by enabling the integration of different genome-wide databases of metabolic reactions. We present results obtained by applying this methodology to the metabolic network of the plant Arabidopsis thaliana. A systematic comparison of compounds and reactions between two genome-wide databases allowed us to obtain a high-quality core consensus reconstruction, which was validated for stoichiometric consistency. A lower level of consensus led to a larger reconstruction, which has a lower quality standard but provides a baseline for further manual curation.ConclusionThis semi-automated methodology may be applied to other organisms and help to streamline the process of genome-scale network reconstruction in order to accelerate the transfer of such models to applications.

[1]  M. Hawes,et al.  Flavonoids: from cell cycle regulation to biotechnology , 2005, Biotechnology Letters.

[2]  B. Palsson,et al.  The Escherichia coli MG1655 in silico metabolic genotype: its definition, characteristics, and capabilities. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Masanori Arita The metabolic world of Escherichia coli is not small. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Hugh D. Spence,et al.  Minimum information requested in the annotation of biochemical models (MIRIAM) , 2005, Nature Biotechnology.

[5]  David Weininger,et al.  SMILES. 2. Algorithm for generation of unique SMILES notation , 1989, J. Chem. Inf. Comput. Sci..

[6]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[7]  Monica L. Mo,et al.  Global reconstruction of the human metabolic network based on genomic and bibliomic data , 2007, Proceedings of the National Academy of Sciences.

[8]  D. Fell,et al.  Challenges to be faced in the reconstruction of metabolic networks from public databases. , 2006, Systems biology.

[9]  Francisco Marco,et al.  Involvement of polyamines in plant response to abiotic stress , 2006, Biotechnology Letters.

[10]  B. Palsson,et al.  Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. , 2003, Genome research.

[11]  D. Fell,et al.  A Genome-Scale Metabolic Model of Arabidopsis and Some of Its Properties1[C][W] , 2009, Plant Physiology.

[12]  Anne Kümmel,et al.  In silico genome-scale reconstruction and validation of the Staphylococcus aureus metabolic network. , 2005, Biotechnology and bioengineering.

[13]  H. Ginsburg Caveat emptor: limitations of the automated reconstruction of metabolic pathways in Plasmodium. , 2009, Trends in parasitology.

[14]  Robert E. Buntrock,et al.  Chemical Registries-in the Fourth Decade of Service , 2001, J. Chem. Inf. Comput. Sci..

[15]  L. Nielsen,et al.  Modeling Hybridoma Cell Metabolism Using a Generic Genome‐Scale Metabolic Model of Mus musculus , 2008, Biotechnology progress.

[16]  P. Schulze-Lefert,et al.  A Glucosinolate Metabolism Pathway in Living Plant Cells Mediates Broad-Spectrum Antifungal Defense , 2009, Science.

[17]  David Rogers,et al.  Cheminformatics analysis and learning in a data pipelining environment , 2006, Molecular Diversity.

[18]  J. Gershenzon,et al.  The secondary metabolism of Arabidopsis thaliana: growing like a weed. , 2005, Current opinion in plant biology.

[19]  Bernhard O. Palsson,et al.  Metabolic Reconstruction and Modeling of Nitrogen Fixation in Rhizobium etli , 2007, PLoS Comput. Biol..

[20]  Michael Darsow,et al.  ChEBI: a database and ontology for chemical entities of biological interest , 2007, Nucleic Acids Res..

[21]  Leonore Reiser,et al.  Huala, E. et al. The Arabidopsis Information Resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant. Nucleic Acids Res. 29, 102-105 , 2001 .

[22]  Vinay Satish Kumar,et al.  A Genome-Scale Metabolic Reconstruction of Mycoplasma genitalium, iPS189 , 2009, PLoS Comput. Biol..

[23]  David A. Fell,et al.  Detection of stoichiometric inconsistencies in biomolecular models , 2008, Bioinform..

[24]  Jun Dong,et al.  Understanding network concepts in modules , 2007, BMC Systems Biology.

[25]  A. Barabasi,et al.  Targets Drug Genomes Identify Novel Antimicrobial Staphylococcus Aureus of Multiple Reconstruction and Flux Balance Analysis Comparative Genome-scale Metabolic Supplemental Material , 2009 .

[26]  Costas D Maranas,et al.  Elucidation and structural analysis of conserved pools for genome-scale metabolic reconstructions. , 2005, Biophysical journal.

[27]  An-Ping Zeng,et al.  Reconstruction of metabolic networks from genome data and analysis of their global structure for various organisms , 2003, Bioinform..

[28]  Michael Hucka,et al.  LibSBML: an API Library for SBML , 2008, Bioinform..

[29]  D. Fell,et al.  The small world inside large metabolic networks , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[30]  S. Heller,et al.  An Open Standard for Chemical Structure Representation: The IUPAC Chemical Identifier , 2003 .

[31]  J. Nielsen,et al.  Metabolic model integration of the bibliome, genome, metabolome and reactome of Aspergillus niger , 2008, Molecular systems biology.

[32]  P. D. Karp,et al.  The outcomes of pathway database computations depend on pathway ontology , 2006, Nucleic acids research.

[33]  Peter D. Karp,et al.  MetaCyc and AraCyc. Metabolic Pathway Databases for Plant Research1[w] , 2005, Plant Physiology.

[34]  Bernhard Ø Palsson,et al.  Understanding human metabolic physiology: a genome-to-systems approach. , 2009, Trends in biotechnology.

[35]  B. Palsson,et al.  A protocol for generating a high-quality genome-scale metabolic reconstruction , 2010 .

[36]  Fidel Ramírez,et al.  Computing topological parameters of biological networks , 2008, Bioinform..

[37]  Frederick M. Ausubel,et al.  Glucosinolate Metabolites Required for an Arabidopsis Innate Immune Response , 2009, Science.

[38]  P. Willett,et al.  Similarity-based virtual screening using 2D fingerprints. , 2006, Drug discovery today.

[39]  Peter Willett,et al.  Similarity-based virtual screening using 2D fingerprints. , 2006, Drug discovery today.

[40]  Peter Dörmann,et al.  Functional diversity of tocochromanols in plants , 2006, Planta.

[41]  E. Almaas Biological impacts and context of network theory , 2007, Journal of Experimental Biology.

[42]  Sophia Ananiadou,et al.  Learning string similarity measures for gene/protein name dictionary look-up using logistic regression , 2007, Bioinform..

[43]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[44]  L. Quek,et al.  AraGEM, a Genome-Scale Reconstruction of the Primary Metabolic Network in Arabidopsis1[W] , 2009, Plant Physiology.

[45]  Wen Huang,et al.  The Arabidopsis Information Resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant , 2001, Nucleic Acids Res..

[46]  B. Palsson,et al.  An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR) , 2003, Genome Biology.

[47]  D. Fell,et al.  Getting to grips with the plant metabolic network. , 2008, The Biochemical journal.

[48]  O. Demin,et al.  The Edinburgh human metabolic network reconstruction and its functional analysis , 2007, Molecular systems biology.

[49]  Markus J. Herrgård,et al.  A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology , 2008, Nature Biotechnology.

[50]  Adam M. Feist,et al.  The growing scope of applications of genome-scale metabolic reconstructions using Escherichia coli , 2008, Nature Biotechnology.

[51]  B. Palsson,et al.  Genome-scale Reconstruction of Metabolic Network in Bacillus subtilis Based on High-throughput Phenotyping and Gene Essentiality Data* , 2007, Journal of Biological Chemistry.

[52]  T. Insel,et al.  NIH Molecular Libraries Initiative , 2004, Science.

[53]  Yoshihiro Yamanishi,et al.  KEGG for linking genomes to life and the environment , 2007, Nucleic Acids Res..