Consistency analysis and improvement of matabolic databases for the integration of metabolic models

Modern systems biology uses analysis and modeling of large metabolic networks. These models can be assembled by integrating data from sources SBML files and online databases. Sometimes this integration can be challenging, as information can be hidden in human-readable texts or annotational layers not directly accessible with the help of common methods. Here, it is shown how algebraic analysis can be used to unravel structural information hidden in the kinetic laws of SBML models. Additionally, this work will demonstrate the Organization Theory approach and its application for inconsistency detection on the Biomodels Database. The usefulness of combining algebraic analysis and OT is shown by comparing the gathered results with data originating from other methods, like FBA. It is shown how scientific methods can be prone to an incorrect interpretation of the data given as well as their representation format. Complementing the analysis of data given in SBML model files, we also present a tool designed to help identifying microbial communities suited to perform biodegradation tasks. The preliminaries needed to perform such a task are discussed together with problems that hinder automatic solution of metabolic research questions. Problems usually occuring in the work with databases are specified and investigated in dato; using the KEGG databse as the main source. Strategies to circumvent the found problems by rule-based network descriptions are sketched out. A detailed description of the idea of rule-based databases for metabolic and biological data will be given. Subsequently, possible applications are listed, giving examples for reasonably simple models. A new formalism will be presented which might suit the task better than more general formalisms like BGNL, which is indeed a very powerful, yet rather tedious methodology. Finally, we will give an account of the advantages and challenges of networks modeled with the rule-based description introduced in this work.

[1]  F. Srienc,et al.  Elementary mode analysis: a useful metabolic pathway analysis tool for characterizing cellular metabolism , 2009, Applied Microbiology and Biotechnology.

[2]  Leen Stougie,et al.  Graph-Based Analysis of the Metabolic Exchanges between Two Co-Resident Intracellular Symbionts, Baumannia cicadellinicola and Sulcia muelleri, with Their Insect Host, Homalodisca coagulata , 2010, PLoS Comput. Biol..

[3]  Jochen H M Prehn,et al.  Systems analysis of effector caspase activation and its control by X‐linked inhibitor of apoptosis protein , 2006, The EMBO journal.

[4]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[5]  Christoph Steinbeck,et al.  The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013 , 2012, Nucleic Acids Res..

[6]  James R Faeder,et al.  Rule-based modeling of biochemical systems with BioNetGen. , 2009, Methods in molecular biology.

[7]  Christoph Kaleta,et al.  Computing chemical organizations in biological networks , 2008, Bioinform..

[8]  R. K. De,et al.  Comparing methods for metabolic network analysis and an application to metabolic engineering. , 2013, Gene.

[9]  Christoph Kaleta,et al.  Phenotype prediction in regulated metabolic networks , 2008, BMC Systems Biology.

[10]  Ursula Kummer,et al.  A model of the oscillatory metabolism of activated neutrophils. , 2003, Biophysical journal.

[11]  B. Palsson,et al.  Systems approach to refining genome annotation , 2006, Proceedings of the National Academy of Sciences.

[12]  Christoph Steinbeck,et al.  Rhea—a manually curated resource of biochemical reactions , 2011, Nucleic Acids Res..

[13]  D. Fell,et al.  Detection of elementary flux modes in biochemical networks: a promising tool for pathway analysis and metabolic engineering. , 1999, Trends in biotechnology.

[14]  B O Palsson,et al.  Metabolic modeling of microbial strains in silico. , 2001, Trends in biochemical sciences.

[15]  Frederick Soddy,et al.  Atomic transmutation : the greatest discovery ever made from memoirs of Frederick Soddy , 1953 .

[16]  J. Timmer,et al.  Data-Based Mathematical Modeling of Vectorial Transport across Double-Transfected Polarized Cells , 2007, Drug Metabolism And Disposition.

[17]  A. Valencia,et al.  Systemic approaches to biodegradation. , 2009, FEMS microbiology reviews.

[18]  Melanie I. Stefan,et al.  BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models , 2010, BMC Systems Biology.

[19]  Satoru Shiono,et al.  Control mechanism of JAK/STAT signal transduction pathway , 2003, FEBS letters.

[20]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[21]  Thomas Bäck,et al.  Evolutionary algorithms in theory and practice - evolution strategies, evolutionary programming, genetic algorithms , 1996 .

[22]  A Goldbeter,et al.  A minimal cascade model for the mitotic oscillator involving cyclin and cdc2 kinase. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[23]  A Goldbeter,et al.  Complex intracellular calcium oscillations. A theoretical exploration of possible mechanisms. , 1997, Biophysical chemistry.

[24]  Oliver Ebenhöh,et al.  Expanding Metabolic Networks: Scopes of Compounds, Robustness, and Evolution , 2005, Journal of Molecular Evolution.

[25]  W. Fontana,et al.  “The arrival of the fittest”: Toward a theory of biological organization , 1994 .

[26]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[27]  M. Feldman,et al.  Large-scale reconstruction and phylogenetic analysis of metabolic environments , 2008, Proceedings of the National Academy of Sciences.

[28]  Encoding Rules,et al.  SMILES, a Chemical Language and Information System. 1. Introduction to Methodology , 1988 .

[29]  D. Kim,et al.  A hidden oncogenic positive feedback loop caused by crosstalk between Wnt and ERK Pathways , 2007, Oncogene.

[30]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[31]  Peter Dittrich,et al.  Analyzing molecular reaction networks , 2006, Molecular biotechnology.

[32]  P. Karp,et al.  Genome annotation errors in pathway databases due to semantic ambiguity in partial EC numbers , 2005, Nucleic acids research.

[33]  James R. Faeder,et al.  Compartmental rule-based modeling of biochemical systems , 2009, Proceedings of the 2009 Winter Simulation Conference (WSC).

[34]  M. Feinberg,et al.  Dynamics of open chemical systems and the algebraic structure of the underlying reaction network , 1974 .

[35]  Costas D. Maranas,et al.  MetRxn: a knowledgebase of metabolites and reactions spanning metabolic models and databases , 2012, BMC Bioinformatics.

[36]  G.E. Moore,et al.  Cramming More Components Onto Integrated Circuits , 1998, Proceedings of the IEEE.

[37]  J. Rohwer Kinetic modelling of plant metabolic pathways. , 2012, Journal of experimental botany.

[38]  Bernhard O. Palsson,et al.  BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions , 2010, BMC Bioinformatics.

[39]  Axel Drefahl,et al.  CurlySMILES: a chemical language to customize and annotate encodings of molecular and nanodevice structures , 2011, J. Cheminformatics.

[40]  Alfonso Valencia,et al.  The organization of the microbial biodegradation network from a systems‐biology perspective , 2003, EMBO reports.

[41]  Peter Dittrich,et al.  Chemical organizations in atmospheric photochemistries—A new method to analyze chemical reaction networks , 2007 .

[42]  Peter D. Karp,et al.  A survey of metabolic databases emphasizing the MetaCyc family , 2011, Archives of Toxicology.

[43]  G. Valiente,et al.  Validation of metabolic pathway databases based on chemical substructure search. , 2007, Biomolecular engineering.

[44]  Adam M. Feist,et al.  The biomass objective function. , 2010, Current opinion in microbiology.

[45]  M. Kanehisa A database for post-genome analysis. , 1997, Trends in genetics : TIG.

[46]  Víctor de Lorenzo,et al.  Systems biology approaches to bioremediation. , 2008 .

[47]  Jacky L. Snoep,et al.  BioModels Database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems , 2005, Nucleic Acids Res..

[48]  Annegret Wagler,et al.  A mathematical approach to solve the network reconstruction problem , 2008, Math. Methods Oper. Res..

[49]  A. Jayaraman,et al.  Modeling regulatory mechanisms in IL‐6 signal transduction in hepatocytes , 2006, Biotechnology and bioengineering.

[50]  Vinay Satish Kumar,et al.  Optimization based automated curation of metabolic reconstructions , 2007, BMC Bioinformatics.

[51]  D. Fell,et al.  A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks , 2000, Nature Biotechnology.

[52]  Leen Stougie,et al.  Enumerating Precursor Sets of Target Metabolites in a Metabolic Network , 2008, WABI.

[53]  Christoph Kaleta,et al.  Using chemical organization theory for model checking , 2009, Bioinform..

[54]  B. Palsson,et al.  Theory for the systemic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective. , 2000, Journal of theoretical biology.

[55]  Jeffrey D Orth,et al.  What is flux balance analysis? , 2010, Nature Biotechnology.

[56]  Thomas Hinze,et al.  Toward Organization-Oriented Chemical Programming: A Case Study with the Maximal Independent Set Problem , 2007, Advances in Biologically Inspired Information Systems.

[57]  Angel Rubio,et al.  Computing the shortest elementary flux modes in genome-scale metabolic networks , 2009, Bioinform..

[58]  Sharene D. Bungay,et al.  A mathematical model of lipid-mediated thrombin generation. , 2003, Mathematical medicine and biology : a journal of the IMA.

[59]  Gert Vriend,et al.  Correcting ligands, metabolites, and pathways , 2006, BMC Bioinformatics.

[60]  D. Kell,et al.  The Kyoto Encyclopedia of Genes and Genomes—KEGG , 2000, Yeast.

[61]  B. Palsson,et al.  Metabolic Flux Balancing: Basic Concepts, Scientific and Practical Use , 1994, Bio/Technology.

[62]  Wolfgang Marwan,et al.  Theory of time-resolved somatic complementation and its use to explore the sporulation control network in Physarum polycephalum. , 2003, Genetics.

[63]  Elhanan Borenstein,et al.  Topological Signatures of Species Interactions in Metabolic Networks , 2009, J. Comput. Biol..

[64]  Stephen R. Heller,et al.  InChI - the worldwide chemical structure identifier standard , 2013, Journal of Cheminformatics.