Local-pooled-error test for identifying differentially expressed genes with a small number of replicated microarrays

MOTIVATION In microarray studies gene discovery based on fold-change values is often misleading because error variability for each gene is heterogeneous under different biological conditions and intensity ranges. Several statistical testing methods for differential gene expression have been suggested, but some of these approaches are underpowered and result in high false positive rates because within-gene variance estimates are based on a small number of replicated arrays. RESULTS We propose to use local-pooled-error (LPE) estimates and robust statistical tests for evaluating significance of each gene's differential expression. Our LPE estimation is based on pooling errors within genes and between replicate arrays for genes in which expression values are similar. We have applied our LPE method to compare gene expression in naïve and activated CD8+ T-cells. Our results show that the LPE method effectively identifies significant differential-expression patterns with a small number of replicated arrays. AVAILABILITY The methodology is implemented with S-PLUS and R functions available at http://hesweb1.med.virginia.edu/bioinformatics

[1]  Alexander Kamb,et al.  A simple method for statistical analysis of intensity differences in microarray-derived gene expression data , 2001, BMC biotechnology.

[2]  Xiaohong Huang,et al.  Comparing three methods for variance estimation with duplicated high density oligonucleotide arrays , 2002, Functional & Integrative Genomics.

[3]  S. Dudoit,et al.  STATISTICAL METHODS FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN REPLICATED cDNA MICROARRAY EXPERIMENTS , 2002 .

[4]  Pierre Baldi,et al.  A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes , 2001, Bioinform..

[5]  S. S. Young,et al.  Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment , 1993 .

[6]  Brian S. Yandell,et al.  Adaptive Gene Picking with Microarray Data: Detecting Important Low Abundance Signals , 2003 .

[7]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[8]  Douglas M. Hawkins,et al.  A variance-stabilizing transformation for gene-expression microarray data , 2002, ISMB.

[9]  J. Wolfowitz,et al.  Introduction to the Theory of Statistics. , 1951 .

[10]  Klaus Ley,et al.  Relevance of L-selectin Shedding for Leukocyte Rolling In Vivo , 1999, The Journal of experimental medicine.

[11]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Christina Kendziorski,et al.  On Differential Variability of Expression Ratios: Improving Statistical Inference about Gene Expression Changes from Microarray Data , 2001, J. Comput. Biol..

[13]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Russell D. Wolfinger,et al.  The contributions of sex, genotype and age to transcriptional variance in Drosophila melanogaster , 2001, Nature Genetics.

[15]  Franklin A. Graybill,et al.  Introduction to the Theory of Statistics, 3rd ed. , 1974 .

[16]  G. Churchill,et al.  Statistical design and the analysis of gene expression microarray data. , 2001, Genetical research.

[17]  Ingrid Lönnstedt Replicated microarray data , 2001 .

[18]  John Law,et al.  Robust Statistics—The Approach Based on Influence Functions , 1986 .

[19]  Jae K. Lee,et al.  An S-PLUS Library for the Analysis and Visualization of Differential Expression , 2003 .

[20]  Yogendra P. Chaubey Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment , 1993 .

[21]  Terence P. Speed,et al.  Comparison of Methods for Image Analysis on cDNA Microarray Data , 2002 .

[22]  Peter Ramm,et al.  Statistical inference methods for gene expression arrays , 2001, SPIE BiOS.

[23]  John D. Storey,et al.  SAM Thresholding and False Discovery Rates for Detecting Differential Gene Expression in DNA Microarrays , 2003 .

[24]  Y. Hochberg A sharper Bonferroni procedure for multiple tests of significance , 1988 .

[25]  J K Lee,et al.  Analysis issues for gene expression array data. , 2001, Clinical chemistry.