TMEA: A Thermodynamically Motivated Framework for Functional Characterization of Biological Responses to System Acclimation

The objective of gene set enrichment analysis (GSEA) in modern biological studies is to identify functional profiles in huge sets of biomolecules generated by high-throughput measurements of genes, transcripts, metabolites, and proteins. GSEA is based on a two-stage process using classical statistical analysis to score the input data and subsequent testing for overrepresentation of the enrichment score within a given functional coherent set. However, enrichment scores computed by different methods are merely statistically motivated and often elusive to direct biological interpretation. Here, we propose a novel approach, called Thermodynamically Motivated Enrichment Analysis (TMEA), to account for the energy investment in biological relevant processes. Therefore, TMEA is based on surprisal analysis, which offers a thermodynamic-free energy-based representation of the biological steady state and of the biological change. The contribution of each biomolecule underlying the changes in free energy is used in a Monte Carlo resampling procedure resulting in a functional characterization directly coupled to the thermodynamic characterization of biological responses to system perturbations. To illustrate the utility of our method on real experimental data, we benchmark our approach on plant acclimation to high light and compare the performance of TMEA with the most frequently used method for GSEA.

[1]  F. Remacle,et al.  Metabolomic analysis of the green microalga Chlamydomonas reinhardtii cultivated under day/night conditions. , 2015, Journal of biotechnology.

[2]  Itamar Procaccia,et al.  Potential work: A statistical‐mechanical approach for systems in disequilibrium , 1976 .

[3]  Marcel J. T. Reinders,et al.  Fewer permutations, more accurate P-values , 2009, Bioinform..

[4]  R. Mittler,et al.  Identification and characterization of a core set of ROS wave‐associated transcripts involved in the systemic acquired acclimation response of Arabidopsis to excess light , 2019, The Plant journal : for cell and molecular biology.

[5]  Stanley N Cohen,et al.  Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Timo Mühlhaus,et al.  Systems-Wide Analysis of Acclimation Responses to Long-Term Heat Stress and Recovery in the Photosynthetic Model Organism Chlamydomonas reinhardtii[W][OPEN] , 2014, Plant Cell.

[7]  David Haussler,et al.  Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM , 2010, Bioinform..

[8]  Magdalena Trojak,et al.  Role of anthocyanins in highlight stress response , 2017 .

[9]  Chris T. A. Evelo,et al.  WikiPathways: building research communities on biological pathways , 2011, Nucleic Acids Res..

[10]  N. Slavov,et al.  Ribosome Stoichiometry: From Form to Function. , 2018, Trends in biochemical sciences.

[11]  Roberto Romero,et al.  A Comparison of Gene Set Analysis Methods in Terms of Sensitivity, Prioritization and Specificity , 2013, PloS one.

[12]  May D. Wang,et al.  GoMiner: a resource for biological interpretation of genomic and proteomic data , 2003, Genome Biology.

[13]  C. Chapple,et al.  The Phenylpropanoid Pathway in Arabidopsis , 2011, The arabidopsis book.

[14]  Andrew Young,et al.  OntologyTraverser: an R package for GO analysis , 2005, Bioinform..

[15]  S. Iuchi,et al.  The Responses of Arabidopsis Early Light-Induced Protein2 to Ultraviolet B, High Light, and Cold Stress Are Regulated by a Transcriptional Regulatory Unit Composed of Two Elements1[OPEN] , 2015, Plant Physiology.

[16]  Nandini Raghavan,et al.  On Methods for Gene Function Scoring as a Means of Facilitating the Interpretation of Microarray Results , 2006, J. Comput. Biol..

[17]  R. Levine,et al.  Surprisal Analysis of Transcripts Expression Levels in the Presence of Noise: A Reliable Determination of the Onset of a Tumor Phenotype , 2013, PloS one.

[18]  Léon Personnaz,et al.  Enrichment or depletion of a GO category within a class of genes: which test? , 2007, Bioinform..

[19]  Vassilios Ioannidis,et al.  Avoiding the pitfalls of gene set enrichment analysis with SetRank , 2017, BMC Bioinformatics.

[20]  W. Chow,et al.  Protective effect of supplemental anthocyanins on Arabidopsis leaves under high light. , 2010, Physiologia plantarum.

[21]  D. Hincha,et al.  Plant Temperature Acclimation and Growth Rely on Cytosolic Ribosome Biogenesis Factor Homologs1[OPEN] , 2018, Plant Physiology.

[22]  W. Wong,et al.  GoSurfer: a graphical interactive tool for comparative analysis of large gene sets in Gene Ontology space. , 2004, Applied bioinformatics.

[23]  F. Remacle,et al.  Information-theoretic analysis of phenotype changes in early stages of carcinogenesis , 2010, Proceedings of the National Academy of Sciences.

[24]  Michael E Phelps,et al.  Surprisal analysis characterizes the free energy time course of cancer cells undergoing epithelial-to-mesenchymal transition , 2014, Proceedings of the National Academy of Sciences.

[25]  Min-Sung Kim,et al.  COFECO: composite function annotation enriched by protein complex data , 2009, Nucleic Acids Res..

[26]  M. West,et al.  Bayesian Modeling for Biological Pathway Annotation of Genomic Signatures , 2008 .

[27]  I. Simon,et al.  A probabilistic generative model for GO enrichment analysis , 2008, Nucleic acids research.

[28]  Mark Stitt,et al.  Nitrogen-Sparing Mechanisms in Chlamydomonas Affect the Transcriptome, the Proteome, and Photosynthetic Metabolism[W] , 2014, Plant Cell.

[29]  R. Jennings,et al.  Suppression of Both ELIP1 and ELIP2 in Arabidopsis Does Not Affect Tolerance to Photoinhibition and Photooxidative Stress1 , 2006, Plant Physiology.

[30]  Bing Zhang,et al.  GOTree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies , 2004, BMC Bioinformatics.

[31]  M. Eisenstein From form to function. , 2015, Nature methods.

[32]  K. Kloppstech,et al.  The protective functions of carotenoid and flavonoid pigments against excess visible radiation at chilling temperature investigated in Arabidopsisnpq and tt mutants , 2001, Planta.

[33]  Nicolae Moise,et al.  Early light-induced proteins protect Arabidopsis from photooxidative stress , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Qi Liu,et al.  Improving gene set analysis of microarray data by SAM-GS , 2007, BMC Bioinformatics.

[35]  Melvin E Andersen,et al.  Adaptive Posttranslational Control in Cellular Stress Response Pathways and Its Relationship to Toxicity Testing and Safety Assessment. , 2015, Toxicological sciences : an official journal of the Society of Toxicology.

[36]  P. Giavalisco,et al.  Molecular signatures associated with increased freezing tolerance due to low temperature memory in Arabidopsis. , 2018, Plant, cell & environment.

[37]  Zhen Su,et al.  EasyGO: Gene Ontology-based annotation and functional enrichment analysis tool for agronomical species , 2007, BMC Genomics.

[38]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[39]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[40]  Anthony M. Bolger,et al.  Metabolic Adaptation, a Specialized Leaf Organ Structure and Vascular Responses to Diurnal N2 Fixation by Nostoc azollae Sustain the Astonishing Productivity of Azolla Ferns without Nitrogen Fertilizer , 2017, Front. Plant Sci..

[41]  Martin Kuiper,et al.  BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks , 2005, Bioinform..

[42]  Françoise Remacle,et al.  Multivariate Surprisal Analysis of Gene Expression Levels , 2016, Entropy.

[43]  Yixiong Tang,et al.  LNK1 and LNK2 Corepressors Interact with the MYB3 Transcription Factor in Phenylpropanoid Biosynthesis1 , 2017, Plant Physiology.

[44]  Yan Lv,et al.  Impact of Climate Change on Crops Adaptation and Strategies to Tackle Its Outcome: A Review , 2019, Plants.

[45]  M. Lehmann,et al.  Translational Components Contribute to Acclimation Responses to High Light, Heat, and Cold in Arabidopsis , 2020, iScience.

[46]  Peter N. Robinson,et al.  GOing Bayesian: model-based gene set analysis of genome-scale data , 2010, Nucleic acids research.

[47]  Jean-Daniel Zucker,et al.  FunNet: an integrative tool for exploring transcriptional interactions , 2008, Bioinform..

[48]  K. Paszkiewicz,et al.  The influence of ascorbate on anthocyanin accumulation during high light acclimation in Arabidopsis thaliana: further evidence for redox control of anthocyanin synthesis. , 2012, Plant, cell & environment.

[49]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[50]  S. Rhee,et al.  MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. , 2004, The Plant journal : for cell and molecular biology.

[51]  A Agresti,et al.  On Small‐Sample Confidence Intervals for Parameters in Discrete Distributions , 2001, Biometrics.

[52]  Varda Rotter,et al.  Convergence of Logic of Cellular Regulation in Different Premalignant Cells by an Information Theoretic Approach , 2011, BMC Systems Biology.

[53]  L. Nussaume,et al.  The light stress-induced protein ELIP2 is a regulator of chlorophyll synthesis in Arabidopsis thaliana. , 2007, The Plant journal : for cell and molecular biology.

[54]  Jason H. Moore,et al.  Spectral gene set enrichment (SGSE) , 2015, BMC Bioinformatics.

[55]  R. D. Levine,et al.  INFORMATION THEORY APPROACH TO MOLECULAR REACTION DYNAMICS , 1978 .

[56]  C. Ouzounis,et al.  Expansion of the BioCyc collection of pathway/genome databases to 160 genomes , 2005, Nucleic acids research.

[57]  Joaquín Dopazo,et al.  FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes , 2004, Bioinform..

[58]  Naser A. Anjum Plant acclimation to environmental stress: a critical appraisal , 2015, Front. Plant Sci..

[59]  F. Remacle,et al.  Metabolic, Physiological, and Transcriptomics Analysis of Batch Cultures of the Green Microalga Chlamydomonas Grown on Different Acetate Concentrations , 2019, Cells.

[60]  G. Coruzzi,et al.  Achieving the in Silico Plant. Systems Biology and the Future of Plant Biological Research , 2003, Plant Physiology.

[61]  S. Komatsu,et al.  Impact of Post-Translational Modifications of Crop Proteins under Abiotic Stress , 2016, Proteomes.

[62]  A. Weber,et al.  Transcriptional response of the extremophile red alga Cyanidioschyzon merolae to changes in CO2 concentrations. , 2017, Journal of plant physiology.

[63]  J B Harborne,et al.  Anthocyanins and other flavonoids. , 2001, Natural product reports.

[64]  C. Testerink,et al.  How Plants Sense and Respond to Stressful Environments1[OPEN] , 2020, Plant Physiology.

[65]  C. Brownlee,et al.  Communicating with Calcium , 1999, Plant Cell.

[66]  W. Weckwerth,et al.  Systemic Cold Stress Adaptation of Chlamydomonas reinhardtii* , 2013, Molecular & Cellular Proteomics.

[67]  R. Levine,et al.  Free energy rhythms in Saccharomyces cerevisiae: a dynamic perspective with implications for ribosomal biogenesis. , 2013, Biochemistry.

[68]  R. Levine,et al.  An Algorithm for Finding the Distribution of Maximal Entropy , 1979 .

[69]  Jonathan D. G. Jones,et al.  Role of plant hormones in plant defence responses , 2009, Plant Molecular Biology.

[70]  Steven R. Eichten,et al.  Excess Light Priming in Arabidopsis thaliana Genotypes with Altered DNA Methylomes , 2019, G3: Genes, Genomes, Genetics.

[71]  G. Krouk,et al.  Update on Systems View of Responses to Nutritional Cues in Arabidopsis A Systems View of Responses to Nutritional Cues in Arabidopsis: Toward a Paradigm Shift for Predictive Network Modeling , 2010 .

[72]  Björn Nilsson,et al.  Threshold-free high-power methods for the ontological analysis of genome-wide gene-expression studies , 2007, Genome Biology.

[73]  R. Levine,et al.  Surprisal analysis of genome-wide transcript profiling identifies differentially expressed genes and pathways associated with four growth conditions in the microalga Chlamydomonas , 2018, PloS one.

[74]  Pawel Herzyk,et al.  A UV-B-specific signaling component orchestrates plant UV protection. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[75]  H. Neufeld,et al.  Why some stems are red: cauline anthocyanins shield photosystem II against high light stress , 2010, Journal of experimental botany.

[76]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[77]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[78]  B. Mueller‐Roeber,et al.  Systems Analysis of the Response of Photosynthesis, Metabolism, and Growth to an Increase in Irradiance in the Photosynthetic Model Organism Chlamydomonas reinhardtii[C][W][OPEN] , 2014, Plant Cell.