MCR-ALS on metabolic networks: Obtaining more meaningful pathways

Abstract With the aim of understanding the flux distributions across a metabolic network, i.e. within living cells, Principal Component Analysis (PCA) has been proposed to obtain a set of orthogonal components (pathways) capturing most of the variance in the flux data. The problems with this method are (i) that no additional information can be included in the model, and (ii) that orthogonality imposes a hard constraint, not always reasonably. To overcome these drawbacks, here we propose to use a more flexible approach such as Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS) to obtain this set of biological pathways through the network. By using this method, different constraints can be included in the model, and the same source of variability can be present in different pathways, which is reasonable from a biological standpoint. This work follows a methodology developed for Pichia pastoris cultures grown on different carbon sources, lately presented in Gonzalez-Martinez et al. (2014). In this paper a different grey modelling approach, which aims to incorporate a priori knowledge through constraints on the modelling algorithms, is applied to the same case of study. The results of both models are compared to show their strengths and weaknesses.

[1]  Dong Xu,et al.  Identification of transcription factor's targets using tissue-specific transcriptomic data in Arabidopsis thaliana , 2010, BMC Systems Biology.

[2]  Julian Morris,et al.  Curve resolution for multivariate images with applications to TOF-SIMS and Raman , 2004 .

[3]  Romà Tauler,et al.  Chemometrics applied to unravel multicomponent processes and mixtures: Revisiting latest trends in multivariate resolution , 2003 .

[4]  C. Wandrey,et al.  Recombinant Protein Production with Pichia pastoris in Continuous Fermentation – Kinetic Analysis of Growth and Product Formation , 2001 .

[5]  B. Kowalski,et al.  Selectivity, local rank, three‐way data analysis and ambiguity in multivariate curve resolution , 1995 .

[6]  Age K. Smilde,et al.  Improved monitoring of batch processes by incorporating external information , 2002 .

[7]  K. Bellgardt,et al.  Macrokinetic model for methylotrophic Pichia pastoris based on stoichiometric balance. , 2003, Journal of biotechnology.

[8]  Berna Sarıyar,et al.  Monte Carlo sampling and principal component analysis of flux distributions yield topological and modular information on metabolic networks. , 2006, Journal of theoretical biology.

[9]  J. Nielsen,et al.  Industrial systems biology. , 2010, Biotechnology and bioengineering.

[10]  Ronan M. T. Fleming,et al.  Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0 , 2007, Nature Protocols.

[11]  Gabriel Potvin,et al.  Bioprocess engineering aspects of heterologous protein production in Pichia pastoris: A review , 2012 .

[12]  Romà Tauler,et al.  A graphical user-friendly interface for MCR-ALS: a new tool for multivariate curve resolution in MATLAB , 2005 .

[13]  Wolfgang Wiechert,et al.  Determination of flux directions by thermodynamic network analysis: Computing informative metabolite pools , 2011, Math. Comput. Simul..

[14]  R. Tauler,et al.  Application of multivariate curve resolution alternating least squares (MCR-ALS) to the quantitative analysis of pharmaceutical and agricultural samples. , 2008, Talanta.

[15]  Wenhui Zhang,et al.  Optimization of cell density and dilution rate in Pichia pastoris continuous fermentations for production of recombinant proteins , 2004, Journal of Industrial Microbiology and Biotechnology.

[16]  Markus J. Herrgård,et al.  Integrating high-throughput and computational data elucidates bacterial networks , 2004, Nature.

[17]  B. Schilling,et al.  Scale‐Up of a High Cell Density Continuous Culture with Pichiapastoris X‐33 for the Constitutive Expression of rh‐Chitinase , 2001, Biotechnology progress (Print).

[18]  F. Llaneras,et al.  Stoichiometric modelling of cell metabolism. , 2008, Journal of bioscience and bioengineering.

[19]  R. Tauler Multivariate curve resolution applied to second order data , 1995 .

[20]  Bernhard O. Palsson,et al.  Decomposing complex reaction networks using random sampling, principal component analysis and basis rotation , 2009, BMC Systems Biology.

[21]  Jean-Pierre A. Kocher,et al.  A new statistic for identifying batch effects in high-throughput genomic data that uses guided principal component analysis , 2013, Bioinform..

[22]  L. Zadeh Fuzzy sets as a basis for a theory of possibility , 1999 .

[23]  Riccardo Leardi,et al.  Nature-Inspired Methods in Chemometrics: Genetic Algorithms and Artificial Neural Networks , 2005 .

[24]  A. Daugulis,et al.  A rational approach to improving productivity in recombinant Pichia pastoris fermentation. , 2001, Biotechnology and bioengineering.

[25]  Itziar Ruisánchez,et al.  MCR-ALS for sequential estimation of FTIR-ATR spectra to resolve a curing process using global phase angle convergence criterion. , 2009, Analytica chimica acta.

[26]  Abel Folch-Fortuny,et al.  Metabolic Flux Understanding of Pichia pastoris Grown on Heterogenous , 2014 .

[27]  Kenneth J. Kauffman,et al.  Advances in flux balance analysis. , 2003, Current opinion in biotechnology.

[28]  J. E. Jackson A User's Guide to Principal Components , 1991 .

[29]  Michael Sauer,et al.  The effect of temperature on the proteome of recombinant Pichia pastoris. , 2009, Journal of proteome research.

[30]  Eytan Ruppin,et al.  Flux balance analysis accounting for metabolite dilution , 2010, Genome Biology.

[31]  Volker Hecht,et al.  Metabolic flux analysis of Escherichia coli in glucose-limited continuous culture. I. Growth-rate-dependent metabolic efficiency at steady state. , 2005, Microbiology.

[32]  José Manuel Amigo,et al.  ChroMATHography: solving chromatographic issues with mathematical models and intuitive graphics. , 2010, Chemical reviews.

[33]  S. Van Dien,et al.  Theoretical analysis of amino acid-producing Escherichia coli using a stoichiometric model and multivariate linear regression. , 2006, Journal of bioscience and bioengineering.

[34]  A. Ferrer,et al.  Prostate functional magnetic resonance image analysis using multivariate curve resolution methods , 2014 .

[35]  Tieliu Shi,et al.  A comparison of batch effect removal methods for enhancement of prediction performance using MAQC-II microarray gene expression data , 2010, The Pharmacogenomics Journal.

[36]  Rui Oliveira,et al.  Design of pathway-level bioprocess monitoring and control strategies supported by metabolic networks. , 2013, Advances in biochemical engineering/biotechnology.

[37]  Francisco Llaneras Estrada,et al.  Interval and Possibilistic Methods for Constraint-Based Metabolic Models , 2011 .

[38]  S. Feyo de Azevedo,et al.  Hybrid modelling of biochemical processes: A comparison with the conventional approach , 1997 .

[39]  Urs von Stockar,et al.  Mixed feeds of glycerol and methanol can improve the performance of Pichia pastoris cultures: A quantitative study based on concentration gradients in transient continuous cultures. , 2007, Journal of biotechnology.

[40]  Meng Wang,et al.  Context based mixture model for cell phase identification in automated fluorescence microscopy , 2007, BMC Bioinformatics.

[41]  W. Windig,et al.  Interactive self-modeling mixture analysis , 1991 .

[42]  J. Nielsen,et al.  Integration of gene expression data into genome-scale metabolic models. , 2004, Metabolic engineering.

[43]  Holger Fröhlich,et al.  Modeling ERBB receptor-regulated G1/S transition to find novel targets for de novo trastuzumab resistance , 2009, BMC Systems Biology.

[44]  Udo Reichl,et al.  How can measurement, monitoring, modeling and control advance cell culture in industrial biotechnology? , 2012, Biotechnology journal.

[45]  M. Mesarovic,et al.  Search for organising principles: understanding in systems biology. , 2004, Systems biology.

[46]  Jason A. Papin,et al.  Genome-scale microbial in silico models: the constraints-based approach. , 2003, Trends in biotechnology.

[47]  Isabel Rocha,et al.  Exploring the gap between dynamic and constraint-based models of metabolism. , 2012, Metabolic engineering.

[48]  Aina Solà i Rodrigo Estudi del metabolisme central del carboni de Pichia pastoris , 2006 .

[49]  Alberto Ferrer,et al.  MIA and NIR Chemical Imaging for pharmaceutical product characterization , 2012 .

[50]  Romà Tauler,et al.  Multivariate Curve Resolution (MCR) from 2000: Progress in Concepts and Applications , 2006 .

[51]  Joel S. Parker,et al.  Adjustment of systematic microarray data biases , 2004, Bioinform..

[52]  J. Cregg,et al.  Heterologous protein expression in the methylotrophic yeast Pichia pastoris. , 2000, FEMS microbiology reviews.

[53]  Thomas E Marlin,et al.  Process Control , 1995 .

[54]  B. Palsson,et al.  Regulation of gene expression in flux balance models of metabolism. , 2001, Journal of theoretical biology.

[55]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[56]  Romà Tauler,et al.  Exploratory data analysis of DNA microarrays by multivariate curve resolution. , 2006, Analytical biochemistry.

[57]  W. Windig,et al.  Two-Way Data Analysis: Detection of Purest Variables , 2020, Comprehensive Chemometrics.

[58]  B. Palsson Systems Biology: Properties of Reconstructed Networks , 2006 .

[59]  D. Bertrand,et al.  Feature selection by a genetic algorithm. Application to seed discrimination by artificial vision , 1998 .

[60]  Thomas Szyperski,et al.  Metabolic flux profiling of Pichia pastoris grown on glycerol/methanol mixtures in chemostat cultures at low and high dilution rates. , 2007, Microbiology.

[61]  Jesús Picó,et al.  Validation of a constraint-based model of Pichia pastoris metabolism under data scarcity , 2010, BMC Systems Biology.

[62]  Jesús Picó,et al.  Estimation of recombinant protein production in Pichia pastoris based on a constraint-based model , 2012 .

[63]  Antonio Sala,et al.  A possibilistic framework for constraint-based metabolic flux analysis , 2009, BMC Systems Biology.

[64]  D. Massart,et al.  Orthogonal projection approach applied to peak purity assessment. , 1996, Analytical chemistry.

[65]  G. Stephanopoulos,et al.  Metabolic Engineering: Principles And Methodologies , 1998 .