Enhancing genetic algorithm-based genome-scale metabolic network curation efficiency

Genome-scale metabolic modeling using constraint-based analysis is a powerful modeling paradigm for simulating metabolic networks. Models are generated via inference from genome annotations. However, errors in the annotation or the identity of a gene's function could lead to "metabolic inconsistency" rendering simulations infeasible. Uncovering the source of metabolic inconsistency is non-trivial due to network size and complexity. Recently published work uses genetic algorithms for curation by generating pools of models with randomly relaxed mass balance constraints. Models are evolved that allow feasible simulation while minimizing the number of constraints relaxed. Relaxed constraints represent metabolites likely to be the root of metabolic inconsistency. Although effective, the approach can result in numerous false positives. Here we present a strategy, MassChecker, which evaluates all of the relaxed mass balance constraints in each generation prior to the next round of evolution to determine if they had become consistent due to recombination/mutation. If so, these constraints are enforced. This approach was applied to the development of genome-scale metabolic model of B. anthracis. The model consisted of 1,049 reactions and 1,003 metabolites. The result was a 60% reduction in the number of relaxed mass balance constraints, significantly speeding up the curation process.

[1]  U. Sauer,et al.  Stoichiometric growth model for riboflavin-producing Bacillus subtilis. , 2001, Biotechnology and bioengineering.

[2]  D. Missiakas,et al.  Synthesis of Lipoteichoic Acids in Bacillus anthracis , 2012, Journal of bacteriology.

[3]  Petrica C. Pop,et al.  Generalized network design problems : modeling and optimization , 2012 .

[4]  G. Clark,et al.  Reference , 2008 .

[5]  R. Losick,et al.  Bacillus subtilis and Its Closest Relatives , 2002 .

[6]  U. Sauer,et al.  Physiology and metabolic fluxes of wild-type and riboflavin-producing Bacillus subtilis , 1996, Applied and environmental microbiology.

[7]  A. Casadevall,et al.  Bacillus anthracis produces membrane-derived vesicles containing biologically active toxins , 2010, Proceedings of the National Academy of Sciences.

[8]  Ranjan Srivastava,et al.  Semi-automated Curation of Metabolic Models via Flux Balance Analysis: A Case Study with Mycoplasma gallisepticum , 2013, PLoS Comput. Biol..

[9]  D. Lundgren,et al.  Lipid Composition of Bacillus cereus During Growth and Sporulation , 1970, Journal of bacteriology.

[10]  J. Stuy THE NUCLEIC ACIDS OF BACILLUS CEREUS , 1958, Journal of bacteriology.

[11]  P. Hanna,et al.  Bacillus anthracis requires siderophore biosynthesis for growth in macrophages and mouse virulence , 2004, Molecular microbiology.

[12]  B. Palsson,et al.  Genome-scale Reconstruction of Metabolic Network in Bacillus subtilis Based on High-throughput Phenotyping and Gene Essentiality Data* , 2007, Journal of Biological Chemistry.

[13]  Kaisa Miettinen,et al.  On initial populations of a genetic algorithm for continuous optimization problems , 2007, J. Glob. Optim..

[14]  K. Miettinen,et al.  Quasi-random initial population for genetic algorithms , 2004 .