Context-Specific Metabolic Model Extraction Based on Regularized Least Squares Optimization

Genome-scale metabolic models have proven highly valuable in investigating cell physiology. Recent advances include the development of methods to extract context-specific models capable of describing metabolism under more specific scenarios (e.g., cell types). Yet, none of the existing computational approaches allows for a fully automated model extraction and determination of a flux distribution independent of user-defined parameters. Here we present RegrEx, a fully automated approach that relies solely on context-specific data and ℓ1-norm regularization to extract a context-specific model and to provide a flux distribution that maximizes its correlation to data. Moreover, the publically available implementation of RegrEx was used to extract 11 context-specific human models using publicly available RNAseq expression profiles, Recon1 and also Recon2, the most recent human metabolic model. The comparison of the performance of RegrEx and its contending alternatives demonstrates that the proposed method extracts models for which both the structure, i.e., reactions included, and the flux distributions are in concordance with the employed data. These findings are supported by validation and comparison of method performance on additional data not used in context-specific model extraction. Therefore, our study sets the ground for applications of other regularization techniques in large-scale metabolic modeling.

[1]  R. Mahadevan,et al.  The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. , 2003, Metabolic engineering.

[2]  L. Lash Role of glutathione transport processes in kidney function. , 2005, Toxicology and applied pharmacology.

[3]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[4]  Barbara M. Bakker,et al.  Unraveling the complexity of flux regulation: A new method demonstrated for nutrient starvation in Saccharomyces cerevisiae , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[5]  P. Arner,et al.  Fatty acid metabolism in adipose tissue, muscle and liver in health and disease. , 2006, Essays in biochemistry.

[6]  Terence Tao,et al.  The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.

[7]  Monica L. Mo,et al.  Global reconstruction of the human metabolic network based on genomic and bibliomic data , 2007, Proceedings of the National Academy of Sciences.

[8]  Bernhard O. Palsson,et al.  Context-Specific Metabolic Networks Are Consistent with Experiments , 2008, PLoS Comput. Biol..

[9]  Markus J. Herrgård,et al.  Network-based prediction of human tissue-specific metabolism , 2008, Nature Biotechnology.

[10]  T. Hesterberg,et al.  Least angle and ℓ1 penalized regression: A review , 2008, 0802.0964.

[11]  Michael C. Jewett,et al.  Linking high-resolution metabolic flux phenotypes and transcriptional regulation in yeast modulated by the global regulator Gcn4p , 2009, Proceedings of the National Academy of Sciences.

[12]  Bernhard O. Palsson,et al.  BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions , 2010, BMC Bioinformatics.

[13]  Eytan Ruppin,et al.  iMAT: an integrative metabolic analysis tool , 2010, Bioinform..

[14]  L. Quek,et al.  C4GEM, a Genome-Scale Metabolic Model to Study C4 Plant Metabolism1[W][OA] , 2010, Plant Physiology.

[15]  Jeffrey D Orth,et al.  What is flux balance analysis? , 2010, Nature Biotechnology.

[16]  E. Ruppin,et al.  Computational reconstruction of tissue-specific metabolic models: application to human liver metabolism , 2010, Molecular systems biology.

[17]  Patrick F Suthers,et al.  Construction of an E. Coli genome‐scale atom mapping model for MFA calculations , 2011, Biotechnology and bioengineering.

[18]  Ronan M. T. Fleming,et al.  COBRA Toolbox 2.0 , 2011 .

[19]  Jason A. Papin,et al.  TIGER: Toolbox for integrating genome-scale metabolic models, expression data, and transcriptional regulatory networks , 2011, BMC Systems Biology.

[20]  Jamey D. Young,et al.  Mapping photoautotrophic metabolism with isotopically nonstationary (13)C flux analysis. , 2011, Metabolic engineering.

[21]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[22]  Nathan D. Price,et al.  Reconstruction of genome-scale metabolic models for 126 human tissues using mCADRE , 2012, BMC Systems Biology.

[23]  Neil Swainston,et al.  Improving metabolic flux predictions using absolute gene expression data , 2012, BMC Systems Biology.

[24]  Jason A. Papin,et al.  Integration of expression data in genome-scale metabolic network reconstructions , 2012, Front. Physio..

[25]  Gregory Stephanopoulos,et al.  CorridendumCorrigendum to “Mapping photoautotrophic metabolism with isotopically nonstationary 13C flux analysis” [Metab. Eng. 13 (2011) 656–665] , 2012 .

[26]  Ugur Sahin,et al.  RNA-Seq Atlas - a reference database for gene expression profiling in normal tissue by next-generation sequencing , 2012, Bioinform..

[27]  S. K. Masakapalli,et al.  Strategies for investigating the plant metabolic network with steady-state metabolic flux analysis: lessons from an Arabidopsis cell culture and other systems. , 2012, Journal of experimental botany.

[28]  B. Palsson,et al.  Constraining the metabolic genotype–phenotype relationship using a phylogeny of in silico methods , 2012, Nature Reviews Microbiology.

[29]  Natapol Pornputtapong,et al.  Reconstruction of Genome-Scale Active Metabolic Networks for 69 Human Cell Types and 16 Cancer Types Using INIT , 2012, PLoS Comput. Biol..

[30]  Guy-Bart Stan,et al.  Reconstruction of arbitrary biochemical reaction networks: A compressive sensing approach , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[31]  Nathan E Lewis,et al.  Analysis of omics data with genome-scale models of metabolism. , 2013, Molecular bioSystems.

[32]  T. Fearn Ridge Regression , 2013 .

[33]  M. Bhushan,et al.  A Compressed Sensing Based Basis-pursuit Formulation of the Room Algorithm , 2013 .

[34]  Bernhard O. Palsson,et al.  GIM3E: condition-specific models of cellular metabolism developed from metabolomics and expression data , 2013, Bioinform..

[35]  Concha Bielza,et al.  A Survey of L1 Regression , 2013 .

[36]  Ronan M. T. Fleming,et al.  A community-driven global reconstruction of human metabolism , 2013, Nature Biotechnology.

[37]  Björn H. Junker,et al.  Multiscale Metabolic Modeling: Dynamic Flux Balance Analysis on a Whole-Plant Scale1[W][OPEN] , 2013, Plant Physiology.

[38]  Daniel Machado,et al.  Systematic Evaluation of Methods for Integration of Transcriptomic Data into Constraint-Based Models of Metabolism , 2014, PLoS Comput. Biol..

[39]  Nikos Vlassis,et al.  Fast Reconstruction of Compact Context-Specific Metabolic Network Models , 2013, PLoS Comput. Biol..

[40]  Zoran Nikoloski,et al.  Generalized framework for context-specific metabolic model extraction methods , 2014, Front. Plant Sci..

[41]  J. Nielsen,et al.  Identification of anticancer drugs for hepatocellular carcinoma through personalized genome‐scale metabolic modeling , 2014, Molecular systems biology.

[42]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[43]  Mathias Uhlén,et al.  Charting the human proteome: Understanding disease using a tissue-based atlas , 2015 .