Estimating the size of the solution space of metabolic networks

BackgroundCellular metabolism is one of the most investigated system of biological interactions. While the topological nature of individual reactions and pathways in the network is quite well understood there is still a lack of comprehension regarding the global functional behavior of the system. In the last few years flux-balance analysis (FBA) has been the most successful and widely used technique for studying metabolism at system level. This method strongly relies on the hypothesis that the organism maximizes an objective function. However only under very specific biological conditions (e.g. maximization of biomass for E. coli in reach nutrient medium) the cell seems to obey such optimization law. A more refined analysis not assuming extremization remains an elusive task for large metabolic systems due to algorithmic limitations.ResultsIn this work we propose a novel algorithmic strategy that provides an efficient characterization of the whole set of stable fluxes compatible with the metabolic constraints. Using a technique derived from the fields of statistical physics and information theory we designed a message-passing algorithm to estimate the size of the affine space containing all possible steady-state flux distributions of metabolic networks. The algorithm, based on the well known Bethe approximation, can be used to approximately compute the volume of a non full-dimensional convex polytope in high dimensions. We first compare the accuracy of the predictions with an exact algorithm on small random metabolic networks. We also verify that the predictions of the algorithm match closely those of Monte Carlo based methods in the case of the Red Blood Cell metabolic network. Then we test the effect of gene knock-outs on the size of the solution space in the case of E. coli central metabolism. Finally we analyze the statistical properties of the average fluxes of the reactions in the E. coli metabolic network.ConclusionWe propose a novel efficient distributed algorithmic strategy to estimate the size and shape of the affine space of a non full-dimensional convex polytope in high dimensions. The method is shown to obtain, quantitatively and qualitatively compatible results with the ones of standard algorithms (where this comparison is possible) being still efficient on the analysis of large biological systems, where exact deterministic methods experience an explosion in algorithmic time. The algorithm we propose can be considered as an alternative to Monte Carlo sampling methods.

[1]  David Avis,et al.  A pivoting algorithm for convex hulls and vertex enumeration of arrangements and polyhedra , 1991, SCG '91.

[2]  H. J. Greenberg,et al.  Monte Carlo sampling can be used to determine the size and shape of the steady-state flux space. , 2004, Journal of theoretical biology.

[3]  M. Mézard,et al.  Analytic and Algorithmic Solution of Random Satisfiability Problems , 2002, Science.

[4]  B. Palsson,et al.  Metabolic capabilities of Escherichia coli: I. synthesis of biosynthetic precursors and cofactors. , 1993, Journal of theoretical biology.

[5]  William T. Freeman,et al.  Correctness of Belief Propagation in Gaussian Graphical Models of Arbitrary Topology , 1999, Neural Computation.

[6]  Dongxiao Zhu,et al.  BMC Bioinformatics BioMed Central , 2005 .

[7]  B. Palsson,et al.  Candidate Metabolic Network States in Human Mitochondria , 2005, Journal of Biological Chemistry.

[8]  X. Jin Factor graphs and the Sum-Product Algorithm , 2002 .

[9]  M. .. Moore Exactly Solved Models in Statistical Mechanics , 1983 .

[10]  Riccardo Zecchina,et al.  Survey propagation: An algorithm for satisfiability , 2002, Random Struct. Algorithms.

[11]  A. Barabasi,et al.  Global organization of metabolic fluxes in the bacterium Escherichia coli , 2004, Nature.

[12]  B. Palsson,et al.  An expanded genome-scale model of Escherichia coli K-12 (iJR904 GSM/GPR) , 2003, Genome Biology.

[13]  M. Mézard,et al.  The Bethe lattice spin glass revisited , 2000, cond-mat/0009418.

[14]  B. Palsson,et al.  Uniform sampling of steady-state flux spaces: means to design experiments and to interpret enzymopathies. , 2004, Biophysical journal.

[15]  B. Palsson,et al.  Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth , 2002, Nature.

[16]  R Monasson,et al.  Von Neumann’s expanding model on random graphs , 2007, cond-mat/0703343.

[17]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[18]  G. Church,et al.  Analysis of optimality in natural and perturbed metabolic networks , 2002 .

[19]  Komei Fukuda,et al.  Exact volume computation for polytopes: a practical study , 1996 .

[20]  Riccardo Zecchina,et al.  Coloring random graphs , 2002, Physical review letters.

[21]  Riccardo Zecchina,et al.  Viable flux distribution in metabolic networks , 2008, Networks Heterog. Media.

[22]  H. Qian,et al.  Thermodynamic constraints for biochemical networks. , 2004, Journal of theoretical biology.

[23]  Martin E. Dyer,et al.  On the Complexity of Computing the Volume of a Polyhedron , 1988, SIAM J. Comput..

[24]  L. Khachiyan Complexity of Polytope Volume Computation , 1993 .

[25]  Bernhard Ø Palsson,et al.  Candidate states of Helicobacter pylori's genome-scale metabolic network upon application of "loop law" thermodynamic constraints. , 2006, Biophysical journal.

[26]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[27]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[28]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[29]  D. Fell,et al.  The small world of metabolism , 2000, Nature Biotechnology.

[30]  Rüdiger L. Urbanke,et al.  The capacity of low-density parity-check codes under message-passing decoding , 2001, IEEE Trans. Inf. Theory.

[31]  Miklós Simonovits,et al.  How to compute the volume in high dimension? , 2003, Math. Program..

[32]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[33]  M. Mézard,et al.  The Cavity Method at Zero Temperature , 2002, cond-mat/0207121.