Meneco, a Topology-Based Gap-Filling Tool Applicable to Degraded Genome-Wide Metabolic Networks

Increasing amounts of sequence data are becoming available for a wide range of non-model organisms. Investigating and modelling the metabolic behaviour of those organisms is highly relevant to understand their biology and ecology. As sequences are often incomplete and poorly annotated, draft networks of their metabolism largely suffer from incompleteness. Appropriate gap-filling methods to identify and add missing reactions are therefore required to address this issue. However, current tools rely on phenotypic or taxonomic information, or are very sensitive to the stoichiometric balance of metabolic reactions, especially concerning the co-factors. This type of information is often not available or at least prone to errors for newly-explored organisms. Here we introduce Meneco, a tool dedicated to the topological gap-filling of genome-scale draft metabolic networks. Meneco reformulates gap-filling as a qualitative combinatorial optimization problem, omitting constraints raised by the stoichiometry of a metabolic network considered in other methods, and solves this problem using Answer Set Programming. Run on several artificial test sets gathering 10,800 degraded Escherichia coli networks Meneco was able to efficiently identify essential reactions missing in networks at high degradation rates, outperforming the stoichiometry-based tools in scalability. To demonstrate the utility of Meneco we applied it to two case studies. Its application to recent metabolic networks reconstructed for the brown algal model Ectocarpus siliculosus and an associated bacterium Candidatus Phaeomarinobacter ectocarpi revealed several candidate metabolic pathways for algal-bacterial interactions. Then Meneco was used to reconstruct, from transcriptomic and metabolomic data, the first metabolic network for the microalga Euglena mutabilis. These two case studies show that Meneco is a versatile tool to complete draft genome-scale metabolic networks produced from heterogeneous data, and to suggest relevant reactions that explain the metabolic capacity of a biological system.

[1]  D. E. Buetow,et al.  Decline in the cellular content of RNA, protein and dry weight during the logarithmic growth of Euglena gracilis. , 1962, Journal of general microbiology.

[2]  Y. Kott,et al.  AMINO ACID COMPOSITION OF BULK PROTEIN OF EUGLENA GROWN IN WASTE WATER. , 1964, Applied microbiology.

[3]  K. Bloch,et al.  Effect of Light Intensity on the Lipid Composition of Euglena gracilis , 1967 .

[4]  G. Constantopoulos,et al.  Lipid metabolism of manganese-deficient algae. I. Effect of manganese deficiency on the greening and the lipid composition of Euglena gracilis Z. , 1970, Plant physiology.

[5]  P. Mazliak,et al.  Lipid composition of Euglena gracilis in relation to carbon-nitrogen balance , 1995 .

[6]  Jorge J. Moré,et al.  The NEOS Server , 1998 .

[7]  Kenneth Holmstrom,et al.  The TOMLAB Optimization Environment in Matlab , 1999 .

[8]  P R Romero,et al.  Nutrient-related analysis of pathway/genome databases. , 2001, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[9]  Peter D. Karp,et al.  The Pathway Tools software , 2002, ISMB.

[10]  B. Palsson,et al.  An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR) , 2003, Genome Biology.

[11]  B. Palsson,et al.  Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. , 2003, Genome research.

[12]  L. N. Edmunds,et al.  Phasing of cell division by temperature cycles in Euglena cultured autotrophically under continuous illumination , 1970, Planta.

[13]  Kenneth Holmström,et al.  The TOMLAB Optimization Environment , 2004 .

[14]  Costas D Maranas,et al.  OptStrain: a computational framework for redesign of microbial production systems. , 2004, Genome research.

[15]  Oliver Ebenhöh,et al.  Expanding Metabolic Networks: Scopes of Compounds, Robustness, and Evolution , 2005, Journal of Molecular Evolution.

[16]  W. Martens-Habbena,et al.  An improved method for counting bacteria from sediments and turbid environments by epifluorescence microscopy. , 2005, Environmental microbiology.

[17]  B. Palsson,et al.  Systems approach to refining genome annotation , 2006, Proceedings of the National Academy of Sciences.

[18]  Vinay Satish Kumar,et al.  Optimization based automated curation of metabolic reconstructions , 2007, BMC Bioinformatics.

[19]  Bernhard O. Palsson,et al.  Identification of Genome-Scale Metabolic Network Models Using Experimentally Measured Flux Profiles , 2006, PLoS Comput. Biol..

[20]  Adam M. Feist,et al.  A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information , 2007, Molecular systems biology.

[21]  Leen Stougie,et al.  Enumerating Precursor Sets of Target Metabolites in a Metabolic Network , 2008, WABI.

[22]  Martin Gebser,et al.  Engineering an Incremental ASP Solver , 2008, ICLP.

[23]  Christoph Kaleta,et al.  Metabolic Pathway Analysis : from small to genome-scale networks , 2011 .

[24]  Vinay Satish Kumar,et al.  GrowMatch: An Automated Method for Reconciling In Silico/In Vivo Growth Predictions , 2009, PLoS Comput. Biol..

[25]  Torsten Schaub,et al.  Metabolic Network Expansion with Answer Set Programming , 2009, ICLP.

[26]  P. May,et al.  An integrative approach towards completing genome-scale metabolic networks. , 2009, Molecular bioSystems.

[27]  Vinay Satish Kumar,et al.  Metabolic reconstruction of the archaeon methanogen Methanosarcina Acetivorans , 2011, BMC Systems Biology.

[28]  B. Palsson,et al.  A protocol for generating a high-quality genome-scale metabolic reconstruction , 2010 .

[29]  Leen Stougie,et al.  Graph-Based Analysis of the Metabolic Exchanges between Two Co-Resident Intracellular Symbionts, Baumannia cicadellinicola and Sulcia muelleri, with Their Insect Host, Homalodisca coagulata , 2010, PLoS Comput. Biol..

[30]  Ilpo Vattulainen,et al.  Role of Lipids in Spheroidal High Density Lipoproteins , 2010, PLoS Comput. Biol..

[31]  Susana M. Coelho,et al.  The Ectocarpus genome sequence: insights into brown algal biology and the evolutionary diversity of the eukaryotes. , 2010, The New phytologist.

[32]  Rick L. Stevens,et al.  High-throughput generation, optimization and analysis of genome-scale metabolic models , 2010, Nature Biotechnology.

[33]  Kathleen Marchal,et al.  A community effort towards a knowledge-base and mathematical model of the human pathogen Salmonella Typhimurium LT2 , 2011, BMC Systems Biology.

[34]  Costas D. Maranas,et al.  Improving the iMM904 S. cerevisiae metabolic model using essentiality and synthetic lethality data , 2010, BMC Systems Biology.

[35]  Jeffrey D Orth,et al.  What is flux balance analysis? , 2010, Nature Biotechnology.

[36]  Yong-Su Jin,et al.  Metabolic network reconstruction and genome-scale model of butanol-producing strain Clostridium beijerinckii NCIMB 8052 , 2011, BMC Systems Biology.

[37]  E. Ruppin,et al.  Reconstruction of Arabidopsis metabolic network models accounting for subcellular compartmentalization and tissue-specificity , 2011, Proceedings of the National Academy of Sciences.

[38]  Adam M. Feist,et al.  A comprehensive genome-scale reconstruction of Escherichia coli metabolism—2011 , 2011, Molecular systems biology.

[39]  H. Heipieper,et al.  Surface properties and intracellular speciation revealed an original adaptive mechanism to arsenic in the acid mine drainage bio-indicator Euglena mutabilis , 2012, Applied Microbiology and Biotechnology.

[40]  P. Bertin,et al.  In situ proteo-metabolomics reveals metabolite secretion by the acid mine drainage bio-indicator, Euglena mutabilis , 2012, The ISME Journal.

[41]  Martin Gebser,et al.  Conflict-driven answer set solving: From theory to practice , 2012, Artif. Intell..

[42]  T. Shlomi,et al.  MIRAGE: a functional genomics-based approach for metabolic network model reconstruction and its application to cyanobacteria networks , 2012, Genome Biology.

[43]  Elias W. Krumholz,et al.  Genome-wide metabolic network reconstruction of the picoalga Ostreococcus. , 2012, Journal of experimental botany.

[44]  Joshua A. Lerman,et al.  COBRApy: COnstraints-Based Reconstruction and Analysis for Python , 2013, BMC Systems Biology.

[45]  Yves Van de Peer,et al.  ORCAE: online resource for community annotation of eukaryotes , 2012, Nature Methods.

[46]  Martin Gebser,et al.  Answer Set Solving in Practice , 2012, Answer Set Solving in Practice.

[47]  I. Karimi,et al.  In silico modeling and evaluation of Gordonia alkanivorans for biodesulfurization. , 2013, Molecular bioSystems.

[48]  Intawat Nookaew,et al.  The RAVEN Toolbox and Its Use for Generating a Genome-scale Metabolic Model for Penicillium chrysogenum , 2013, PLoS Comput. Biol..

[49]  Stefan Engelen,et al.  MicroScope—an integrated microbial resource for the curation and comparative analysis of genomic and metabolic data , 2012, Nucleic Acids Res..

[50]  Martin Gebser,et al.  Extending the Metabolic Network of Ectocarpus Siliculosus Using Answer Set Programming , 2013, LPNMR.

[51]  T. Tonon,et al.  A metabolic approach to study algal–bacterial interactions in changing environments , 2014, Molecular ecology.

[52]  Nathan D. Price,et al.  Likelihood-Based Gene Annotations for Gap Filling and Quality Assessment in Genome-Scale Metabolic Models , 2014, PLoS Comput. Biol..

[53]  Ronan M. T. Fleming,et al.  fastGapFill: efficient gap filling in metabolic networks , 2014, Bioinform..

[54]  Anne Siegel,et al.  Genome and metabolic network of “Candidatus Phaeomarinobacter ectocarpi” Ec32, a new candidate genus of Alphaproteobacteria frequently associated with brown algae , 2014, Front. Genet..

[55]  Bernhard O. Palsson,et al.  Optimizing genome-scale network reconstructions , 2014, Nature Biotechnology.

[56]  M. Tefagh,et al.  A mathematical approach to emergent properties of metabolic networks: partial coupling relations, hyperarcs and flux ratios. , 2014, Journal of theoretical biology.

[57]  A. Siegel,et al.  The genome-scale metabolic network of Ectocarpus siliculosus (EctoGEM): a resource to study brown algal physiology and beyond. , 2014, The Plant journal : for cell and molecular biology.

[58]  J. Qi,et al.  Saccharina genomes provide novel insight into kelp biology , 2015, Nature Communications.

[59]  J. Poulain,et al.  Arsenic hypertolerance in the protist Euglena mutabilis is mediated by specific transporters and functional integrity maintenance mechanisms. , 2015, Environmental microbiology.

[60]  David James Sherman,et al.  Pantograph: A template-based method for genome-scale metabolic model reconstruction , 2015, J. Bioinform. Comput. Biol..

[61]  David W. Smith,et al.  Adaptive Remodeling of Achilles Tendon: A Multi-scale Computational Model , 2016, PLoS Comput. Biol..

[62]  Keith Dufault-Thompson,et al.  PSAMM: A Portable System for the Analysis of Metabolic Models , 2016, PLoS Comput. Biol..

[63]  P. Midford,et al.  The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases , 2007, Nucleic Acids Res..