Multiplex methods provide effective integration of multi-omic data in genome-scale models

BackgroundGenomic, transcriptomic, and metabolic variations shape the complex adaptation landscape of bacteria to varying environmental conditions. Elucidating the genotype-phenotype relation paves the way for the prediction of such effects, but methods for characterizing the relationship between multiple environmental factors are still lacking. Here, we tackle the problem of extracting network-level information from collections of environmental conditions, by integrating the multiple omic levels at which the bacterial response is measured.ResultsTo this end, we model a large compendium of growth conditions as a multiplex network consisting of transcriptomic and fluxomic layers, and we propose a multi-omic network approach to infer similarity of growth conditions by integrating layers of the multiplex network. Each node of the network represents a single condition, while edges are similarities between conditions, as measured by phenotypic and transcriptomic properties on different layers of the network. We then fuse these layers into one network, therefore capturing a global network of conditions and the associated similarities across two omic levels. We apply this multi-omic fusion to an updated genome-scale reconstruction of Escherichia coli that includes underground metabolism and new gene-protein-reaction associations.ConclusionsOur method can be readily used to evaluate and cross-compare different collections of conditions among different species. Acquiring multi-omic information on the topology of the space of experimental conditions makes it possible to infer the position and to build condition-specific models of untested or incomplete profiles for which experimental data is not available. Our weighted network fusion method for genome-scale models is freely available at https://github.com/maxconway/SNFtool.

[1]  Richard A. Notebaart,et al.  Network-level architecture and the evolutionary potential of underground metabolism , 2014, Proceedings of the National Academy of Sciences.

[2]  Susanna Manrubia,et al.  toyLIFE: a computational framework to study the multi-level organisation of the genotype-phenotype map , 2014, Scientific Reports.

[3]  John Quackenbush,et al.  Variance of Gene Expression Identifies Altered Network Constraints in Neurological Disease , 2011, PLoS genetics.

[4]  E. Nishida,et al.  The MAP kinase cascade is essential for diverse signal transduction pathways. , 1993, Trends in biochemical sciences.

[5]  Claudio Angione,et al.  Predictive analytics of environmental adaptability in multi-omic network models , 2015, Scientific Reports.

[6]  Nathan Intrator,et al.  Systems biology and brain activity in neuronal pathways by smart device and advanced signal processing , 2014, Front. Genet..

[7]  Marc Barthelemy,et al.  Growing multiplex networks , 2013, Physical review letters.

[8]  Miguel Rocha,et al.  Transcript level and sequence determinants of protein abundance and noise in Escherichia coli , 2014, Nucleic acids research.

[9]  Daniel Machado,et al.  Systematic Evaluation of Methods for Integration of Transcriptomic Data into Constraint-Based Models of Metabolism , 2014, PLoS Comput. Biol..

[10]  R. Macfarlane An Enzyme Cascade in the Blood Clotting Mechanism, and its Function as a Biochemical Amplifier , 1964, Nature.

[11]  C. Maranas,et al.  Recent advances in the reconstruction of metabolic models and integration of omics data. , 2014, Current opinion in biotechnology.

[12]  B. Palsson Systems Biology: Constraint-based Reconstruction and Analysis , 2015 .

[13]  Pietro Liò,et al.  Network-based analysis of comorbidities risk during an infection: SARS and HIV case studies , 2014, BMC Bioinformatics.

[14]  V. Schachter,et al.  Genome-scale models of bacterial metabolism: reconstruction and applications , 2008, FEMS microbiology reviews.

[15]  Kazuyuki Shimizu,et al.  Metabolic flux analysis based on 13C-labeling experiments and integration of the information with gene and protein expression patterns. , 2004, Advances in biochemical engineering/biotechnology.

[16]  Kathleen Marchal,et al.  COLOMBOS v2.0: an ever expanding collection of bacterial expression compendia , 2013, Nucleic Acids Res..

[17]  K. Shimizu,et al.  Global metabolic regulation analysis for Escherichia coli K12 based on protein expression by 2-dimensional electrophoresis and enzyme activity measurement , 2003, Applied Microbiology and Biotechnology.

[18]  B. Palsson,et al.  Parallel adaptive evolution cultures of Escherichia coli lead to convergent growth phenotypes with different gene expression states. , 2005, Genome research.

[19]  Z. Wang,et al.  The structure and dynamics of multilayer networks , 2014, Physics Reports.

[20]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[21]  Elizabeth Brunk,et al.  Model-driven discovery of underground metabolic functions in Escherichia coli , 2015, Proceedings of the National Academy of Sciences.

[22]  James E. Ferrell,et al.  Feedback regulation of opposing enzymes generates robust, all-or-none bistable responses , 2008, Current Biology.

[23]  Naruemon Pratanwanich,et al.  A Hybrid of Metabolic Flux Analysis and Bayesian Factor Modeling for Multiomic Temporal Pathway Activation. , 2015, ACS synthetic biology.

[24]  Sabin Tabirca,et al.  Logarithmic Growth in Biological Processes , 2010, 2010 12th International Conference on Computer Modelling and Simulation.

[25]  Rainer Breitling,et al.  MultiMetEval: Comparative and Multi-Objective Analysis of Genome-Scale Metabolic Models , 2012, PloS one.

[26]  A. E. Hirsh,et al.  Evolutionary Rate in the Protein Interaction Network , 2002, Science.

[27]  Zachary A. King,et al.  Constraint-based models predict metabolic and associated cellular functions , 2014, Nature Reviews Genetics.

[28]  Adam B. Olshen,et al.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis , 2009, Bioinform..

[29]  Pedro Mendes,et al.  An in vivo control map for the eukaryotic mRNA translation machinery , 2013, Molecular systems biology.

[30]  Adam M. Feist,et al.  A comprehensive genome-scale reconstruction of Escherichia coli metabolism—2011 , 2011, Molecular systems biology.

[31]  Bonnie Berger,et al.  An exact arithmetic toolbox for a consistent and reproducible structural analysis of metabolic network models , 2014, Nature Communications.

[32]  Pietro Liò,et al.  Pareto Optimality in Organelle Energy Metabolism Analysis , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[33]  Ernesto Estrada,et al.  Communicability reveals a transition to coordinated behavior in multiplex networks , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34]  B. Fridley,et al.  Integrative clustering methods for high-dimensional molecular data. , 2014, Translational cancer research.

[35]  B. Ripley,et al.  Recursive Partitioning and Regression Trees , 2015 .

[36]  Giuseppe Nicosia,et al.  A design automation framework for computational bioenergetics in biological networks. , 2013, Molecular bioSystems.

[37]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[38]  U. Sauer,et al.  Multidimensional Optimality of Microbial Metabolism , 2012, Science.

[39]  Pietro Liò,et al.  Analysis and design of molecular machines , 2015, Theor. Comput. Sci..

[40]  Madeleine Udell,et al.  Incorporation of flexible objectives and time-linked simulation with flux balance analysis. , 2014, Journal of theoretical biology.

[41]  C. Pál,et al.  Highly expressed genes in yeast evolve slowly. , 2001, Genetics.

[42]  David W. Erickson,et al.  Quantitative proteomic analysis reveals a simple strategy of global resource allocation in bacteria , 2015, Molecular systems biology.