Bayesian models and meta analysis for multiple tissue gene expression data following corticosteroid administration

BackgroundThis paper addresses key biological problems and statistical issues in the analysis of large gene expression data sets that describe systemic temporal response cascades to therapeutic doses in multiple tissues such as liver, skeletal muscle, and kidney from the same animals. Affymetrix time course gene expression data U34A are obtained from three different tissues including kidney, liver and muscle. Our goal is not only to find the concordance of gene in different tissues, identify the common differentially expressed genes over time and also examine the reproducibility of the findings by integrating the results through meta analysis from multiple tissues in order to gain a significant increase in the power of detecting differentially expressed genes over time and to find the differential differences of three tissues responding to the drug.Results and conclusionBayesian categorical model for estimating the proportion of the 'call' are used for pre-screening genes. Hierarchical Bayesian Mixture Model is further developed for the identifications of differentially expressed genes across time and dynamic clusters. Deviance information criterion is applied to determine the number of components for model comparisons and selections. Bayesian mixture model produces the gene-specific posterior probability of differential/non-differential expression and the 95% credible interval, which is the basis for our further Bayesian meta-inference. Meta-analysis is performed in order to identify commonly expressed genes from multiple tissues that may serve as ideal targets for novel treatment strategies and to integrate the results across separate studies. We have found the common expressed genes in the three tissues. However, the up/down/no regulations of these common genes are different at different time points. Moreover, the most differentially expressed genes were found in the liver, then in kidney, and then in muscle.

[1]  D J Spiegelhalter,et al.  Bayesian approaches to random-effects meta-analysis: a comparative study. , 1995, Statistics in medicine.

[2]  Arpad Kelemen,et al.  Statistical advances and challenges for analyzing correlated high dimensional SNP data in genomic study for complex diseases , 2008, 0803.4065.

[3]  P. Müller,et al.  A Bayesian mixture model for differential gene expression , 2005 .

[4]  Wei Pan,et al.  A mixture model approach to detecting differentially expressed genes with microarray data , 2003, Functional & Integrative Genomics.

[5]  G. Smith,et al.  Meta-analysis: Potentials and promise , 1997, BMJ.

[6]  Alex Lewin,et al.  A mixture model-based strategy for selecting sets of genes in multiclass response microarray experiments , 2004, Bioinform..

[7]  Eric P Hoffman,et al.  In vivo multi-tissue corticosteroid microarray time series available online at Public Expression Profile Resource (PEPR). , 2003, Pharmacogenomics.

[8]  Debashis Ghosh,et al.  Statistical issues and methods for meta-analysis of microarray data: a case study in prostate cancer , 2003, Functional & Integrative Genomics.

[9]  Yulan Liang,et al.  Hierarchical Bayesian Neural Network for Gene Expression Temporal Patterns , 2004, Statistical applications in genetics and molecular biology.

[10]  Marina Vannucci,et al.  Variable selection in clustering via Dirichlet process mixture models , 2006 .

[11]  Göran Kauermann,et al.  Modeling Microarray Data Using a Threshold Mixture Model , 2004, Biometrics.

[12]  Andrew Thomas,et al.  WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[13]  Debashis Ghosh,et al.  Mixture models for assessing differential expression in complex tissues using microarray data , 2004, Bioinform..

[14]  Wenguang Sun,et al.  Oracle and Adaptive Compound Decision Rules for False Discovery Rate Control , 2007 .

[15]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[16]  Anna Liu,et al.  Bayesian meta-analysis models for microarray data: a comparative study , 2007, BMC Bioinformatics.

[17]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[18]  Arpad Kelemen,et al.  Differential and trajectory methods for time course gene expression data , 2005, Bioinform..

[19]  Arpad Kelemen,et al.  Associating phenotypes with molecular events: recent statistical advances and challenges underpinning microarray experiments , 2005, Functional & Integrative Genomics.

[20]  Jeffrey E. Harris,et al.  Bayes Methods for Combining the Results of Cancer Studies in Humans and other Species , 1983 .

[21]  J. Chimka Categorical Data Analysis, Second Edition , 2003 .

[22]  Atul J. Butte,et al.  Reproducibility of gene expression across generations of Affymetrix microarrays , 2003, BMC Bioinformatics.

[23]  H. Zou,et al.  The doubly regularized support vector machine , 2006 .

[24]  George Davey Smith,et al.  Meta-analysis: Principles and procedures , 1997, BMJ.

[25]  A. Agresti Categorical data analysis , 1993 .

[26]  Andrew E. Teschendorff,et al.  A variational Bayesian mixture modelling framework for cluster analysis of gene-expression data , 2005, Bioinform..

[27]  Geoffrey J. McLachlan,et al.  A mixture model-based approach to the clustering of microarray expression data , 2002, Bioinform..

[28]  Wei Pan,et al.  A Parametric Joint Model of DNA-Protein Binding, Gene Expression and DNA Sequence Data to Detect Target Genes of a Transcription Factor , 2007, Pacific Symposium on Biocomputing.

[29]  Alan Agresti,et al.  Bayesian inference for categorical data analysis , 2005, Stat. Methods Appl..

[30]  Wenxuan Zhong,et al.  A data-driven clustering method for time course gene expression data , 2006, Nucleic acids research.

[31]  John D. Storey,et al.  Empirical Bayes Analysis of a Microarray Experiment , 2001 .

[32]  Jin Y. Jin,et al.  Modeling of Corticosteroid Pharmacogenomics in Rat Liver Using Gene Microarrays , 2003, Journal of Pharmacology and Experimental Therapeutics.

[33]  Weichung Joe Shih,et al.  A mixture model for estimating the local false discovery rate in DNA microarray analysis , 2004, Bioinform..

[34]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[35]  J. Bailar The promise and problems of meta-analysis. , 1997, The New England journal of medicine.

[36]  Arpad Kelemen,et al.  Bayesian State Space Models for Inferring and Predicting Temporal Gene Expression Profiles , 2007, Biometrical journal. Biometrische Zeitschrift.

[37]  Hongzhe Li,et al.  Model-based methods for identifying periodically expressed genes based on time course microarray gene expression data , 2004, Bioinform..

[38]  Arpad Kelemen,et al.  Temporal gene expression classification with regularised neural network , 2005, Int. J. Bioinform. Res. Appl..

[39]  Mario Medvedovic,et al.  Bayesian infinite mixture model based clustering of gene expression profiles , 2002, Bioinform..

[40]  P. Brown,et al.  Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Hongzhe Li,et al.  Clustering of time-course gene expression data using a mixed-effects model with B-splines , 2003, Bioinform..