What Can Causal Networks Tell Us about Metabolic Pathways?

Graphical models describe the linear correlation structure of data and have been used to establish causal relationships among phenotypes in genetic mapping populations. Data are typically collected at a single point in time. Biological processes on the other hand are often non-linear and display time varying dynamics. The extent to which graphical models can recapitulate the architecture of an underlying biological processes is not well understood. We consider metabolic networks with known stoichiometry to address the fundamental question: “What can causal networks tell us about metabolic pathways?”. Using data from an Arabidopsis BaySha population and simulated data from dynamic models of pathway motifs, we assess our ability to reconstruct metabolic pathways using graphical models. Our results highlight the necessity of non-genetic residual biological variation for reliable inference. Recovery of the ordering within a pathway is possible, but should not be expected. Causal inference is sensitive to subtle patterns in the correlation structure that may be driven by a variety of factors, which may not emphasize the substrate-product relationship. We illustrate the effects of metabolic pathway architecture, epistasis and stochastic variation on correlation structure and graphical model-derived networks. We conclude that graphical models should be interpreted cautiously, especially if the implied causal relationships are to be used in the design of intervention strategies.

[1]  L. Avery,et al.  Ordering gene function: the interpretation of epistasis in regulatory hierarchies. , 1992, Trends in genetics : TIG.

[2]  J. Nielsen,et al.  Mathematical modelling of metabolism. , 2000, Current opinion in biotechnology.

[3]  Jingyuan Fu,et al.  Defining gene and QTL networks. , 2009, Current opinion in plant biology.

[4]  Barbara Ann Halkier,et al.  Biology and biochemistry of glucosinolates. , 2006, Annual review of plant biology.

[5]  Jonathan C. Mattingly,et al.  Long-Range Allosteric Interactions between the Folate and Methionine Cycles Stabilize DNA Methylation Reaction Rate , 2006, Epigenetics.

[6]  S. Abel,et al.  Glucosinolate metabolism and its control. , 2006, Trends in plant science.

[7]  Edward E Allen,et al.  Algebraic dependency models of protein signal transduction networks from time-series data. , 2006, Journal of theoretical biology.

[8]  Karl W. Broman,et al.  Comprar A Guide to QTL Mapping with R/qtl | Broman, Karl W. | 9780387921242 | Springer , 2009 .

[9]  Ron Korstanje,et al.  A Bayesian Framework for Inference of the Genotype–Phenotype Map for Segregating Populations , 2011, Genetics.

[10]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[11]  K. Broman,et al.  A Guide to QTL Mapping with R/qtl , 2009 .

[12]  L. Solon,et al.  Cause and Effect in Biology Kinds of causes , predictability , and teleology are viewed by a practicing biologist , 2022 .

[13]  John D. Storey,et al.  Harnessing naturally randomized transcription to infer regulatory relationships among genes , 2007, Genome Biology.

[14]  D. Remington,et al.  Effects of Genetic and Environmental Factors on Trait Network Predictions From Quantitative Trait Locus Data , 2009, Genetics.

[15]  P. Swain,et al.  Gene Regulation at the Single-Cell Level , 2005, Science.

[16]  Bruce Tidor,et al.  Sloppy models, parameter uncertainty, and the role of experimental design. , 2010, Molecular bioSystems.

[17]  C. Moyes,et al.  The ecological genetics of aliphatic glucosinolates , 2001, Heredity.

[18]  Christopher R. Myers,et al.  Universally Sloppy Parameter Sensitivities in Systems Biology Models , 2007, PLoS Comput. Biol..

[19]  Yael Karshon,et al.  Exact Euler-Maclaurin formulas for simple lattice polytopes , 2007, Adv. Appl. Math..

[20]  Anna M. Lieb,et al.  The biological significance of substrate inhibition: A mechanism with diverse functions , 2010, BioEssays : news and reviews in molecular, cellular and developmental biology.

[21]  Daniel J. Kliebenstein,et al.  Linking Metabolic QTLs with Network and cis-eQTLs Controlling Biosynthetic Pathways , 2007, PLoS genetics.

[22]  Yang Li,et al.  Critical reasoning on causal inference in genome-wide linkage and association studies. , 2010, Trends in genetics : TIG.

[23]  Keith Shockley,et al.  Structural Model Analysis of Multiple Quantitative Traits , 2006, PLoS genetics.

[24]  B. Yandell,et al.  CAUSAL GRAPHICAL MODELS IN SYSTEMS GENETICS: A UNIFIED FRAMEWORK FOR JOINT INFERENCE OF CAUSAL NETWORK AND GENETIC ARCHITECTURE FOR CORRELATED PHENOTYPES. , 2010, The annals of applied statistics.

[25]  Moisés Santillán,et al.  On the Use of the Hill Functions in Mathematical Models of Gene Regulatory Networks , 2008 .

[26]  M. Rockman,et al.  Reverse engineering the genotype–phenotype map with natural genetic variation , 2008, Nature.

[27]  Rory A. Fisher,et al.  The Arrangement of Field Experiments , 1992 .

[28]  J. G. Salway,et al.  Metabolism at a glance , 1993 .

[29]  Liu Yang,et al.  Quantitative Epistasis Analysis and Pathway Inference from Genetic Interaction Data , 2011, PLoS Comput. Biol..

[30]  Stefan Schuster,et al.  Commemorating the 1913 Michaelis–Menten paper Die Kinetik der Invertinwirkung: three perspectives , 2014, The FEBS journal.

[31]  D. Fell Understanding the Control of Metabolism , 1996 .

[32]  Christopher R Myers,et al.  Extracting Falsifiable Predictions from Sloppy Models , 2007, Annals of the New York Academy of Sciences.

[33]  David F Anderson,et al.  Propagation of Fluctuations in Biochemical Systems, I: Linear SSC Networks , 2007, Bulletin of mathematical biology.

[34]  Desmond J. Higham,et al.  An Algorithmic Introduction to Numerical Simulation of Stochastic Differential Equations , 2001, SIAM Rev..

[35]  J. Finley,et al.  Cruciferous Vegetables: Cancer Protective Mechanisms of Glucosinolate Hydrolysis Products and Selenium , 2004, Integrative cancer therapies.

[36]  R. Laubenbacher,et al.  A computational algebra approach to the reverse engineering of gene regulatory networks. , 2003, Journal of theoretical biology.

[37]  Bjarne Gram Hansen,et al.  Biochemical Networks and Epistasis Shape the Arabidopsis thaliana Metabolome[W] , 2008, The Plant Cell Online.

[38]  Bill Shipley,et al.  Cause and Correlation in Biology: A User''s Guide to Path Analysis , 2016 .

[39]  B. Yandell,et al.  Inferring Causal Phenotype Networks From Segregating Populations , 2008, Genetics.

[40]  A. Keller,et al.  Model genetic circuits encoding autoregulatory transcription factors. , 1995, Journal of theoretical biology.

[41]  D A Day,et al.  Regulation of alternative pathway activity in plant mitochondria: nonlinear relationship between electron flux and the redox poise of the quinone pool. , 1989, Archives of biochemistry and biophysics.

[42]  Jason A. Papin,et al.  Determination of redundancy and systems properties of the metabolic network of Helicobacter pylori using genome-scale extreme pathway analysis. , 2002, Genome research.

[43]  Ronan M. T. Fleming,et al.  Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0 , 2007, Nature Protocols.

[44]  P. Schulze-Lefert,et al.  A Glucosinolate Metabolism Pathway in Living Plant Cells Mediates Broad-Spectrum Antifungal Defense , 2009, Science.

[45]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[46]  Jonathan C. Mattingly,et al.  Propagation of fluctuations in biochemical systems, II: Nonlinear chains. , 2007, IET systems biology.

[47]  Ritsert C. Jansen,et al.  Studying complex biological systems using multifactorial perturbation , 2003, Nature Reviews Genetics.

[48]  Abdul Salam Jarrah,et al.  Reverse-engineering of polynomial dynamical systems , 2007, Adv. Appl. Math..

[49]  R Laubenbacher,et al.  Reverse Engineering of Dynamic Networks , 2007, Annals of the New York Academy of Sciences.

[50]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[51]  David Kulp,et al.  Causal Inference of Regulator-Target Pairs by Gene Mapping of Expression Phenotypes , 2005, Systems Biology and Regulatory Genomics.

[52]  D Calvetti,et al.  Dynamic Bayesian sensitivity analysis of a myocardial metabolic model. , 2008, Mathematical biosciences.

[53]  Kamil Erguler,et al.  Practical limits for reverse engineering of dynamical systems: a statistical analysis of sensitivity and parameter inferability in systems biology models. , 2011, Molecular bioSystems.

[54]  Ronan M. T. Fleming,et al.  Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0 , 2007, Nature Protocols.