Validation of differential gene expression algorithms: Application comparing fold-change estimation to hypothesis testing

BackgroundSustained research on the problem of determining which genes are differentially expressed on the basis of microarray data has yielded a plethora of statistical algorithms, each justified by theory, simulation, or ad hoc validation and yet differing in practical results from equally justified algorithms. Recently, a concordance method that measures agreement among gene lists have been introduced to assess various aspects of differential gene expression detection. This method has the advantage of basing its assessment solely on the results of real data analyses, but as it requires examining gene lists of given sizes, it may be unstable.ResultsTwo methodologies for assessing predictive error are described: a cross-validation method and a posterior predictive method. As a nonparametric method of estimating prediction error from observed expression levels, cross validation provides an empirical approach to assessing algorithms for detecting differential gene expression that is fully justified for large numbers of biological replicates. Because it leverages the knowledge that only a small portion of genes are differentially expressed, the posterior predictive method is expected to provide more reliable estimates of algorithm performance, allaying concerns about limited biological replication. In practice, the posterior predictive method can assess when its approximations are valid and when they are inaccurate. Under conditions in which its approximations are valid, it corroborates the results of cross validation. Both comparison methodologies are applicable to both single-channel and dual-channel microarrays. For the data sets considered, estimating prediction error by cross validation demonstrates that empirical Bayes methods based on hierarchical models tend to outperform algorithms based on selecting genes by their fold changes or by non-hierarchical model-selection criteria. (The latter two approaches have comparable performance.) The posterior predictive assessment corroborates these findings.ConclusionsAlgorithms for detecting differential gene expression may be compared by estimating each algorithm's error in predicting expression ratios, whether such ratios are defined across microarray channels or between two independent groups.According to two distinct estimators of prediction error, algorithms using hierarchical models outperform the other algorithms of the study. The fact that fold-change shrinkage performed as well as conventional model selection criteria calls for investigating algorithms that combine the strengths of significance testing and fold-change estimation.

[1]  M. J. van der Laan,et al.  A Comparison of Methods to Control Type I Errors in Microarray Studies , 2007, Statistical applications in genetics and molecular biology.

[2]  D. Bickel The Strength of Statistical Evidence for Composite Hypotheses with an Application to Multiple Comparisons , 2008 .

[3]  Stephen C. Harris,et al.  Rat toxicogenomic study reveals analytical consistency across microarray platforms , 2006, Nature Biotechnology.

[4]  David R. Bickel,et al.  Degrees of differential gene expression: detecting biologically significant expression differences and estimating their magnitudes , 2004, Bioinform..

[5]  S Richardson,et al.  Tail posterior probability for inference in pairwise and multiclass gene expression data. , 2007, Biometrics.

[6]  Jean-Jacques Daudin,et al.  Correction: Determination of the differentially expressed genes in microarray experiments using local FDR , 2005, BMC Bioinformatics.

[7]  David R. Bickel,et al.  On a fast, robust estimator of the mode: Comparisons to other robust estimators with applications , 2005, Comput. Stat. Data Anal..

[8]  James J. Chen,et al.  Reproducibility of microarray data: a further analysis of microarray quality control (MAQC) data , 2007, BMC Bioinformatics.

[9]  B. Efron Size, power and false discovery rates , 2007, 0710.2245.

[10]  Gordon K. Smyth,et al.  Testing significance relative to a fold-change threshold is a TREAT , 2009, Bioinform..

[11]  Jean-Jacques Daudin,et al.  Determination of the differentially expressed genes in microarray experiments using local FDR , 2004, BMC Bioinformatics.

[12]  D. Lindley,et al.  Bayesian statistics 7 : proceedings of the Seventh Valencia International Meeting, dedicated to Dennis V. Lindley, June 2-6, 2002 , 2003 .

[13]  B. Efron Large-Scale Simultaneous Hypothesis Testing , 2004 .

[14]  D. Allison,et al.  Microarray data analysis: from disarray to consolidation and consensus , 2006, Nature Reviews Genetics.

[15]  D. Parkinson,et al.  Bayesian Methods in Cosmology: Model selection and multi-model inference , 2009 .

[16]  L. Qin,et al.  Empirical evaluation of data transformations and ranking statistics for microarray analysis. , 2004, Nucleic acids research.

[17]  Geoffrey J. McLachlan,et al.  Using mixture models to detect differentially expressed genes , 2005 .

[18]  John D. Storey,et al.  Empirical Bayes Analysis of a Microarray Experiment , 2001 .

[19]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[20]  B. Efron The Estimation of Prediction Error , 2004 .

[21]  D. Bickel,et al.  Shrinkage Estimation of Expression Fold Change As an Alternative to Testing Hypotheses of Equivalent Expression , 2009 .

[22]  Yudi Pawitan,et al.  Multidimensional local false discovery rate for microarray studies , 2006, Bioinform..

[23]  Roger Sauter,et al.  In All Likelihood , 2002, Technometrics.

[24]  Bradley Efron,et al.  Scales of Evidence for Model Selection: Fisher versus Jeffreys , 2001 .

[25]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[26]  Pia Veldt Larsen,et al.  In All Likelihood: Statistical Modelling and Inference Using Likelihood , 2003 .

[27]  R. Irizarry,et al.  Consolidated strategy for the analysis of microarray spike-in data , 2008, Nucleic acids research.

[28]  Stan Pounds,et al.  Estimating the Occurrence of False Positives and False Negatives in Microarray Studies by Approximating and Partitioning the Empirical Distribution of P-values , 2003, Bioinform..

[29]  David R. Anderson,et al.  Bayesian Methods in Cosmology: Model selection and multi-model inference , 2009 .

[30]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[31]  Robert Nadon,et al.  Comparison of small n statistical tests of differential expression applied to microarrays , 2009, BMC Bioinformatics.

[32]  Gary M Hellmann,et al.  Confirming microarray data—is it really necessary? , 2003, Genomics.

[33]  G. Martin,et al.  Transcriptome and Selected Metabolite Analyses Reveal Multiple Points of Ethylene Control during Tomato Fruit Developmentw⃞ , 2005, The Plant Cell Online.

[34]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[35]  M. Schummer,et al.  Selecting Differentially Expressed Genes from Microarray Experiments , 2003, Biometrics.

[36]  J J Chen,et al.  Selection of differentially expressed genes in microarray data analysis , 2007, The Pharmacogenomics Journal.

[37]  J. Q. Smith,et al.  1. Bayesian Statistics 4 , 1993 .

[38]  D. Bickel Correcting the Estimated Level of Differential Expression for Gene Selection Bias: Application to a Microarray Study , 2008, Statistical applications in genetics and molecular biology.

[39]  Weichung Joe Shih,et al.  A mixture model for estimating the local false discovery rate in DNA microarray analysis , 2004, Bioinform..

[40]  S. Scheid,et al.  A stochastic downhill search algorithm for estimating the local false discovery rate , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[41]  Catalin C. Barbacioru,et al.  The balance of reproducibility, sensitivity, and specificity of lists of differentially expressed genes in microarray studies , 2008, BMC Bioinformatics.

[42]  Aaron Aragaki,et al.  Significance testing for small microarray experiments , 2005, Statistics in medicine.

[43]  D. Bickel HighProbability determines which alternative hypotheses are sufficiently probable: Genomic applications include detection of differential gene expression , 2004, q-bio/0402049.

[44]  David R Bickel,et al.  Error-Rate and Decision-Theoretic Methods of Multiple Testing: Which Genes Have High Objective Probabilities of Differential Expression? , 2002, Statistical applications in genetics and molecular biology.

[45]  Cajo J. F. ter Braak,et al.  A Markov Chain Monte Carlo version of the genetic algorithm Differential Evolution: easy Bayesian computing for real parameter spaces , 2006, Stat. Comput..

[46]  Rainer Spang,et al.  twilight; a Bioconductor package for estimating the local false discovery rate , 2005, Bioinform..

[47]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[48]  Ian B. Jeffery,et al.  Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data , 2006, BMC Bioinformatics.

[49]  Geoffrey J. McLachlan,et al.  Mixture Models for Detecting Differentially Expressed Genes in Microarrays , 2006, Int. J. Neural Syst..

[50]  Peter Dalgaard,et al.  R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[51]  S. Richardson,et al.  Bayesian Modeling of Differential Gene Expression , 2006, Biometrics.

[52]  Mark A van de Wiel,et al.  Estimating the False Discovery Rate Using Nonparametric Deconvolution , 2007, Biometrics.