Principal elementary mode analysis (PEMA).

Principal component analysis (PCA) has been widely applied in fluxomics to compress data into a few latent structures in order to simplify the identification of metabolic patterns. These latent structures lack a direct biological interpretation due to the intrinsic constraints associated with a PCA model. Here we introduce a new method that significantly improves the interpretability of the principal components with a direct link to metabolic pathways. This method, called principal elementary mode analysis (PEMA), establishes a bridge between a PCA-like model, aimed at explaining the maximum variance in flux data, and the set of elementary modes (EMs) of a metabolic network. It provides an easy way to identify metabolic patterns in large fluxomics datasets in terms of the simplest pathways of the organism metabolism. The results using a real metabolic model of Escherichia coli show the ability of PEMA to identify the EMs that generated the different simulated flux distributions. Actual flux data of E. coli and Pichia pastoris cultures confirm the results observed in the simulated study, providing a biologically meaningful model to explain flux data of both organisms in terms of the EM activation. The PEMA toolbox is freely available for non-commercial purposes on http://mseg.webs.upv.es.

[1]  Johannes Stadlmann,et al.  A multi-level study of recombinant Pichia pastoris in different oxygen conditions , 2010, BMC Systems Biology.

[2]  José Camacho,et al.  Cross-validation in PCA models with the element-wise k-fold (ekf) algorithm: Practical aspects , 2014 .

[3]  J. Stelling,et al.  Combinatorial Complexity of Pathway Analysis in Metabolic Networks , 2004, Molecular Biology Reports.

[4]  Annik Nanchen,et al.  Nonlinear Dependency of Intracellular Fluxes on Growth Rate in Miniaturized Continuous Cultures of Escherichia coli , 2006, Applied and Environmental Microbiology.

[5]  Berna Sarıyar,et al.  Monte Carlo sampling and principal component analysis of flux distributions yield topological and modular information on metabolic networks. , 2006, Journal of theoretical biology.

[6]  D. Ramkrishna,et al.  Reduction of a set of elementary modes using yield analysis , 2009, Biotechnology and bioengineering.

[7]  Age K. Smilde,et al.  Principal Component Analysis , 2003, Encyclopedia of Machine Learning.

[8]  Pei Yee Ho,et al.  Multiple High-Throughput Analyses Monitor the Response of E. coli to Perturbations , 2007, Science.

[9]  Jesús Picó,et al.  Validation of a constraint-based model of Pichia pastoris metabolism under data scarcity , 2010, BMC Systems Biology.

[10]  D. Fell,et al.  A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks , 2000, Nature Biotechnology.

[11]  José Camacho,et al.  Cross‐validation in PCA models with the element‐wise k‐fold (ekf) algorithm: theoretical aspects , 2012 .

[12]  Michael Sauer,et al.  The effect of temperature on the proteome of recombinant Pichia pastoris. , 2009, Journal of proteome research.

[13]  Jungoh Ahn,et al.  Genome-scale metabolic reconstruction and in silico analysis of methylotrophic yeast Pichia pastoris for strain improvement , 2010, Microbial cell factories.

[14]  D. Fell,et al.  Reaction routes in biochemical reaction systems: Algebraic properties, validated calculation procedure and example from nucleotide metabolism , 2002, Journal of mathematical biology.

[15]  Minoru Kanehisa,et al.  A quadratic programming approach for decomposing steady-state metabolic flux distributions onto elementary modes , 2005, ECCB/JBI.

[16]  Abel Folch-Fortuny,et al.  Metabolic Flux Understanding of Pichia pastoris Grown on Heterogenous , 2014 .

[17]  F. Llaneras,et al.  Stoichiometric modelling of cell metabolism. , 2008, Journal of bioscience and bioengineering.

[18]  B. Palsson,et al.  Reconstructing metabolic flux vectors from extreme pathways: defining the α-spectrum , 2003 .

[19]  Romà Tauler,et al.  MCR-ALS GUI 2.0: New features and applications , 2015 .

[20]  U. Sauer,et al.  Cyclic AMP-Dependent Catabolite Repression Is the Dominant Control Mechanism of Metabolic Fluxes under Glucose Limitation in Escherichia coli , 2008, Journal of bacteriology.

[21]  Lake-Ee Quek,et al.  A depth-first search algorithm to compute elementary flux modes by linear programming , 2014, BMC Systems Biology.

[22]  Bernhard O. Palsson,et al.  Decomposing complex reaction networks using random sampling, principal component analysis and basis rotation , 2009, BMC Systems Biology.

[23]  R. Carlson,et al.  Fundamental Escherichia coli biochemical pathways for biomass and energy production: Identification of reactions , 2004, Biotechnology and bioengineering.

[24]  Ryo Tsuboi,et al.  Complementary elementary modes for fast and efficient analysis of metabolic networks , 2014 .

[25]  Jesús Picó,et al.  MCR-ALS on metabolic networks: Obtaining more meaningful pathways , 2015 .