StanDep: Capturing transcriptomic variability improves context-specific metabolic models

Diverse algorithms can integrate transcriptomics with genome-scale metabolic models (GEMs) to build context-specific metabolic models. These algorithms require identification of a list of high confidence (core) reactions from transcriptomics, but parameters related to identification of core reactions, such as thresholding of expression profiles, can significantly change model content. Importantly, current thresholding approaches are burdened with setting singular arbitrary thresholds for all genes; thus, resulting in removal of enzymes needed in small amounts and even many housekeeping genes. Here, we describe StanDep, a novel heuristic method for using transcriptomics to identify core reactions prior to building context-specific metabolic models. StanDep clusters gene expression data based on their expression pattern across different contexts and determines thresholds for each cluster using data-dependent statistics, specifically standard deviation and mean. To demonstrate the use of StanDep, we built hundreds of models for the NCI-60 cancer cell lines. These models successfully increased the inclusion of housekeeping reactions, which are often lost in models built using standard thresholding approaches. Further, StanDep also provided a transcriptomic explanation for inclusion of lowly expressed reactions that were otherwise only supported by model extraction methods. Our study also provides novel insights into how cells may deal with context-specific and ubiquitous functions. StanDep, as a MATLAB toolbox, is available at https://github.com/LewisLabUCSD/StanDep

[1]  Min Kyung Kim,et al.  Methods for integration of transcriptomic data in genome-scale metabolic models , 2014, Computational and structural biotechnology journal.

[2]  Nathan D. Price,et al.  Reconstruction of genome-scale metabolic models for 126 human tissues using mCADRE , 2012, BMC Systems Biology.

[3]  T. Golub,et al.  Genomic Copy Number Dictates a Gene-Independent Cell Response to CRISPR/Cas9 Targeting. , 2016, Cancer discovery.

[4]  B. Palsson,et al.  Large-scale in silico modeling of metabolic interactions between cell types in the human brain , 2010, Nature Biotechnology.

[5]  A. Barabasi,et al.  Blueprint for antimicrobial hit discovery targeting metabolic networks , 2010, Proceedings of the National Academy of Sciences.

[6]  B. Wolf Biotinidase deficiency: “if you have to have an inherited metabolic disease, this is the one to have” , 2012, Genetics in Medicine.

[7]  Thomas Sauter,et al.  Benchmarking Procedures for High-Throughput Context Specific Reconstruction Algorithms , 2016, Front. Physiol..

[8]  Mark D. Biggin,et al.  Statistics requantitates the central dogma , 2015, Science.

[9]  Dmitri D. Pervouchine,et al.  Gene-specific patterns of expression variation across organs and species , 2016, Genome Biology.

[10]  Natapol Pornputtapong,et al.  Reconstruction of Genome-Scale Active Metabolic Networks for 69 Human Cell Types and 16 Cancer Types Using INIT , 2012, PLoS Comput. Biol..

[11]  E. Ruppin,et al.  Computational reconstruction of tissue-specific metabolic models: application to human liver metabolism , 2010, Molecular systems biology.

[12]  Christian Jungreuthmayer,et al.  Elementary flux modes in a nutshell: properties, calculation and applications. , 2013, Biotechnology journal.

[13]  Anne Richelle,et al.  A Systematic Evaluation of Methods for Tailoring Genome-Scale Metabolic Models. , 2017, Cell systems.

[14]  D. Kell,et al.  GeneGini: Assessment via the Gini Coefficient of Reference “Housekeeping” Genes and Diverse Human Transporter Expression Profiles , 2018, Cell systems.

[15]  Nathan E Lewis,et al.  Analysis of omics data with genome-scale models of metabolism. , 2013, Molecular bioSystems.

[16]  D. Kell,et al.  The role and robustness of the Gini coefficient as an unbiased tool for the selection of Gini genes for normalising expression profiling data , 2019, Scientific Reports.

[17]  Ann E. Sizemore,et al.  Computational correction of copy-number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells , 2017, Nature Genetics.

[18]  M. Uhlén,et al.  Genome-scale metabolic modelling of hepatocytes reveals serine deficiency in patients with non-alcoholic fatty liver disease , 2014, Nature Communications.

[19]  Eytan Ruppin,et al.  iMAT: an integrative metabolic analysis tool , 2010, Bioinform..

[20]  G. Netto,et al.  Profiling the expression pattern of GPI transamidase complex subunits in human cancer , 2008, Modern Pathology.

[21]  D. Kilburn,et al.  The energetics of mammalian cell growth. , 1969, Journal of cell science.

[22]  P. Abraham,et al.  An update on diagnostic value of biotinidase: From liver damage tocancer: Minireview. , 2013 .

[23]  K. Kohn,et al.  CellMiner: a web-based suite of genomic and pharmacologic tools to explore transcript and drug patterns in the NCI-60 cell line set. , 2012, Cancer research.

[24]  Andrew C. Adey,et al.  Single-Cell Transcriptional Profiling of a Multicellular Organism , 2017 .

[25]  V. Mootha,et al.  Metabolite Profiling Identifies a Key Role for Glycine in Rapid Cancer Cell Proliferation , 2012, Science.

[26]  E. Levanon,et al.  Human housekeeping genes, revisited. , 2013, Trends in genetics : TIG.

[27]  William C Reinhold,et al.  CellMiner: a relational database and query tool for the NCI-60 cancer cell lines , 2009, BMC Genomics.

[28]  Neil Swainston,et al.  Recon 2.2: from reconstruction to model of human metabolism , 2016, Metabolomics.

[29]  Jens Nielsen,et al.  Global transcriptional response of Saccharomyces cerevisiae to the deletion of SDH3 , 2009, BMC Systems Biology.

[30]  William S York,et al.  Regulation of Glycan Structures in Animal Tissues , 2008, Journal of Biological Chemistry.

[31]  Amina A. Qutub,et al.  Reconstruction of Tissue-Specific Metabolic Networks Using CORDA , 2016, PLoS Comput. Biol..

[32]  S. Orkin,et al.  Mapping the Mouse Cell Atlas by Microwell-Seq , 2018, Cell.

[33]  Thomas D. Wu,et al.  A comprehensive transcriptional portrait of human cancer cell lines , 2014, Nature Biotechnology.

[34]  Anne Richelle,et al.  Assessing key decisions for transcriptomic data integration in biochemical networks , 2019, PLoS Comput. Biol..

[35]  Naoyuki Taniguchi,et al.  Glycans and cancer: role of N-glycans in cancer biomarker, progression and metastasis, and therapeutics. , 2015, Advances in cancer research.

[36]  G. von Heijne,et al.  Tissue-based map of the human proteome , 2015, Science.

[37]  Nathan E. Lewis,et al.  The evolution of genome-scale models of cancer metabolism , 2013, Front. Physiol..

[38]  D. Noh,et al.  Differential profiling of breast cancer plasma proteome by isotope-coded affinity tagging method reveals biotinidase as a breast cancer biomarker , 2010, BMC Cancer.

[39]  P. Reddien,et al.  Fundamentals of planarian regeneration. , 2004, Annual review of cell and developmental biology.

[40]  Genetic deficiency in neuronal peroxisomal fatty acid β-oxidation causes the interruption of dauer development in Caenorhabditis elegans , 2017, Scientific Reports.

[41]  C. Ponting,et al.  Elevated rates of protein secretion, evolution, and disease among tissue-specific genes. , 2003, Genome research.

[42]  Daniel Machado,et al.  Systematic Evaluation of Methods for Integration of Transcriptomic Data into Constraint-Based Models of Metabolism , 2014, PLoS Comput. Biol..

[43]  Anne Richelle,et al.  Increasing consensus of context-specific metabolic models by integrating data-inferred cell functions , 2018, bioRxiv.

[44]  Nikos Vlassis,et al.  Fast Reconstruction of Compact Context-Specific Metabolic Network Models , 2013, PLoS Comput. Biol..

[45]  R. Ralhan,et al.  Biotinidase is a Novel Marker for Papillary Thyroid Cancer Aggressiveness , 2012, PloS one.

[46]  E. Levanon,et al.  Human housekeeping genes are compact. , 2003, Trends in genetics : TIG.

[47]  Bernhard O. Palsson,et al.  Context-Specific Metabolic Networks Are Consistent with Experiments , 2008, PLoS Comput. Biol..

[48]  Dong-Yan Jin,et al.  N-linked glycosylation is required for optimal proteolytic activation of membrane-bound transcription factor CREB-H , 2010, Journal of Cell Science.

[49]  R. Dwek,et al.  Biological importance of glycosylation. , 1998, Developments in biological standardization.

[50]  Michael D. Wilson,et al.  The Evolutionary Landscape of Alternative Splicing in Vertebrate Species , 2012, Science.

[51]  Zoran Nikoloski,et al.  Generalized framework for context-specific metabolic model extraction methods , 2014, Front. Plant Sci..

[52]  Anne Richelle,et al.  Assessing key decisions for transcriptomic data integration in biochemical networks , 2018, bioRxiv.

[53]  Y. Dong,et al.  Systematic functional analysis of the Caenorhabditis elegans genome using RNAi , 2003, Nature.

[54]  Xin Huang,et al.  Overexpression of glycosylphosphatidylinositol (GPI) transamidase subunits phosphatidylinositol glycan class T and/or GPI anchor attachment 1 induces tumorigenesis and contributes to invasion in human breast cancer. , 2006, Cancer research.

[55]  Albertha J. M. Walhout,et al.  A Caenorhabditis elegans Genome-Scale Metabolic Network Model. , 2016, Cell systems.

[56]  Anne Richelle,et al.  Increasing consensus of context-specific metabolic models by integrating data-inferred cell functions , 2018, bioRxiv.

[57]  Meagan E. Sullender,et al.  Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9 , 2015, Nature Biotechnology.

[58]  Francisco J. Planes,et al.  Assessment of FBA Based Gene Essentiality Analysis in Cancer with a Fast Context-Specific Network Reconstruction Method , 2016, PloS one.

[59]  T. Conway,et al.  Quantitative bacterial transcriptomics with RNA-seq. , 2015, Current opinion in microbiology.