Meta‐analysis and meta‐regression of transcriptomic responses to water stress in Arabidopsis

Summary The large amounts of transcriptome data available for Arabidopsis thaliana make a compelling case for the need to generalize results across studies and extract the most robust and meaningful information possible from them. The results of various studies seeking to identify water stress‐responsive genes only partially overlap. The aim of this work was to combine transcriptomic studies in a systematic way that identifies commonalities in response, taking into account variation among studies due to batch effects as well as sampling variation, while also identifying the effect of study‐specific variables, such as the method of applying water stress, and the part of the plant the mRNA was extracted from. We used meta‐analysis, the quantitative synthesis of independent research results, to summarize expression responses to water stress across studies, and meta‐regression to model the contribution of covariates that may affect gene expression. We found that some genes with small but consistent differential responses become evident only when results are synthesized across experiments, and are missed in individual studies. We also identified genes with expression responses that are attributable to use of different plant parts and alternative methods for inducing water stress. Our results indicate that meta‐analysis and meta‐regression provide a powerful approach for identifying a robust gene set that is less sensitive to idiosyncratic results and for quantifying study characteristics that result in contrasting gene expression responses across studies. Combining meta‐analysis with individual analyses may contribute to a richer understanding of the biology of water stress responses, and may prove valuable in other gene expression studies.

[1]  H. Ohta,et al.  Basic Helix-Loop-Helix Transcription Factors JASMONATE-ASSOCIATED MYC2-LIKE1 (JAM1), JAM2, and JAM3 Are Negative Regulators of Jasmonate Responses in Arabidopsis1[W][OPEN] , 2013, Plant Physiology.

[2]  S. P. Wright,et al.  Adjusted P-values for simultaneous inference , 1992 .

[3]  Rainer Breitling,et al.  RankProd: a bioconductor package for detecting differentially expressed genes in meta-analysis , 2006, Bioinform..

[4]  E. Bray Genes commonly regulated by water-deficit stress in Arabidopsis thaliana. , 2004, Journal of experimental botany.

[5]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[6]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[7]  Kerrie Mengersen,et al.  Statistical Models and Approaches to Inference , 2013 .

[8]  E. Lander Array of hope , 1999, Nature Genetics.

[9]  Matthew J Christians,et al.  The eer5 mutation, which affects a novel proteasome-related subunit, indicates a prominent role for the COP9 signalosome in resetting the ethylene-signaling pathway in Arabidopsis. , 2008, The Plant journal : for cell and molecular biology.

[10]  Douglas G Altman,et al.  Key Issues in Conducting a Meta-Analysis of Gene Expression Microarray Datasets , 2008, PLoS medicine.

[11]  Wolfgang Viechtbauer,et al.  Conducting Meta-Analyses in R with the metafor Package , 2010 .

[12]  W. Ramakrishna,et al.  Genes and Co-Expression Modules Common to Drought and Bacterial Stress Responses in Arabidopsis and Rice , 2013, PloS one.

[13]  Christopher H. Schmid,et al.  Incorporating measurement error in the estimation of autoregressive models for longitudinal data , 1994 .

[14]  K. Akiyama,et al.  Monitoring the expression profiles of 7000 Arabidopsis genes under drought, cold and high-salinity stresses using a full-length cDNA microarray. , 2002, The Plant journal : for cell and molecular biology.

[15]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[16]  I. Holme Relationship between total mortality and cholesterol reduction as found by meta-regression analysis of randomized cholesterol-lowering trials. , 1996, Controlled clinical trials.

[17]  L. Hedges,et al.  Introduction to Meta‐Analysis , 2009, International Coaching Psychology Review.

[18]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[19]  A. Loraine,et al.  Identification of Cytokinin-Responsive Genes Using Microarray Meta-Analysis and RNA-Seq in Arabidopsis1[C][W][OA] , 2013, Plant Physiology.

[20]  Weida Tong,et al.  Reproducible and reliable microarray results through quality control: good laboratory proficiency and appropriate data analysis practices are essential. , 2008, Current opinion in biotechnology.

[21]  Zhou Du,et al.  agriGO: a GO analysis toolkit for the agricultural community , 2010, Nucleic Acids Res..

[22]  S. Thompson,et al.  How should meta‐regression analyses be undertaken and interpreted? , 2002, Statistics in medicine.

[23]  Tristan Mary-Huard,et al.  GEM2Net: from gene expression modeling to -omics networks, a new CATdb module to investigate Arabidopsis thaliana genes involved in stress response , 2014, Nucleic Acids Res..

[24]  Jessica Gurevitch,et al.  THE META‐ANALYSIS OF RESPONSE RATIOS IN EXPERIMENTAL ECOLOGY , 1999 .

[25]  Guowu Yu,et al.  Meta-analysis of Arabidopsis thaliana under abscisic acid and salt stress , 2011 .

[26]  L. Lai,et al.  Meta-Analysis of Gene Expression Signatures Reveals Hidden Links among Diverse Biological Processes in Arabidopsis , 2014, PloS one.

[27]  Gene E Robinson,et al.  Meta-analysis of genome-wide expression patterns associated with behavioral maturation in honey bees , 2008, BMC Genomics.

[28]  J. Sheen,et al.  Emerging connections in the ethylene signaling network. , 2009, Trends in plant science.

[29]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[30]  Stephen W. Raudenbush,et al.  Analyzing effect sizes: Random-effects models. , 2009 .

[31]  R. Doerge,et al.  Statistical Design and Analysis of RNA Sequencing Data , 2010, Genetics.

[32]  Nicola J Cooper,et al.  A generalized weighting regression‐derived meta‐analysis estimator robust to small‐study effects and heterogeneity , 2012, Statistics in medicine.

[33]  Jessica Gurevitch,et al.  STATISTICAL ISSUES IN ECOLOGICAL META‐ANALYSES , 1999 .

[34]  G. Rücker,et al.  Heterogeneity and Meta-Regression , 2015 .

[35]  M. Salicrú,et al.  Statistical methods for the analysis of high-throughput data based on functional profiles derived from the Gene Ontology , 2007 .

[36]  N. Laird,et al.  Meta-analysis in clinical trials. , 1986, Controlled clinical trials.

[37]  Jessica Gurevitch,et al.  Effect sizes: Conventional choices and calculations , 2013 .

[38]  Miquel Salicru,et al.  Comparison of lists of genes based on functional profiles , 2011, BMC Bioinformatics.

[39]  Chunyu Liu,et al.  Removing Batch Effects in Analysis of Expression Microarray Data: An Evaluation of Six Batch Adjustment Methods , 2011, PloS one.

[40]  Xiao-Hua Zhou,et al.  Statistical Methods for Meta‐Analysis , 2008 .

[41]  K. Shinozaki,et al.  ABA-mediated transcriptional regulation in response to osmotic stress in plants , 2011, Journal of Plant Research.

[42]  O. Postnikova,et al.  Comparative analysis of microarray data in Arabidopsis transcriptome during compatible interactions with plant viruses , 2012, Virology Journal.

[43]  Nicholas J Provart,et al.  Genotype and time of day shape the Populus drought response. , 2009, The Plant journal : for cell and molecular biology.

[44]  John R. Stevens,et al.  Combining Affymetrix microarray results , 2005, BMC Bioinformatics.

[45]  P. Verslues,et al.  Unique Drought Resistance Functions of the Highly ABA-Induced Clade A Protein Phosphatase 2Cs1[W][OA] , 2012, Plant Physiology.

[46]  Sokbom Kang,et al.  Sentinel lymph node biopsy in endometrial cancer: meta-analysis of 26 studies. , 2011, Gynecologic oncology.

[47]  D. Schachtman,et al.  Chemical root to shoot signaling under drought. , 2008, Trends in plant science.

[48]  W. David Kelton,et al.  Statistical design and analysis , 1986, WSC '86.

[49]  Laura B. Sheard,et al.  Plant biology: Signal advance for abscisic acid , 2009, Nature.

[50]  J. Gurevitch,et al.  Research synthesis methods in ecology , 2015 .

[51]  Rafael A. Irizarry,et al.  A Model-Based Background Adjustment for Oligonucleotide Expression Arrays , 2004 .

[52]  G. Tseng,et al.  Comprehensive literature review and statistical considerations for microarray meta-analysis , 2012, Nucleic acids research.

[53]  G. Tseng,et al.  Comprehensive literature review and statistical considerations for GWAS meta-analysis , 2012, Nucleic acids research.

[54]  Sangsoo Kim,et al.  Combining multiple microarray studies and modeling interstudy variation , 2003, ISMB.

[55]  Elizabeth Tipton,et al.  Robust variance estimation in meta‐regression with dependent effect size estimates , 2010, Research synthesis methods.

[56]  Olivia W. Wilkins,et al.  Time of day shapes Arabidopsis drought transcriptomes. , 2010, The Plant journal : for cell and molecular biology.

[57]  Shinichi Nakagawa,et al.  Methodological issues and advances in biological meta-analysis , 2012, Evolutionary Ecology.

[58]  S. Elena,et al.  A Meta-Analysis Reveals the Commonalities and Differences in Arabidopsis thaliana Response to Different Viral Pathogens , 2012, PloS one.

[59]  M. Lajeunesse Bias and correction for the log response ratio in ecological meta-analysis. , 2015, Ecology.

[60]  Sung Chul Lee,et al.  Functional roles of the protein phosphatase 2C, AtAIP1, in abscisic acid signaling and sugar tolerance in Arabidopsis. , 2012, Plant science : an international journal of experimental plant biology.

[61]  Jessica Gurevitch,et al.  Uses and misuses of meta‐analysis in plant ecology , 2014 .

[62]  Rainer Breitling,et al.  A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments , 2008, Bioinform..

[63]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.