Practical Quality Assessment of Microarray Data by Simulation of Differential Gene Expression

There are many methods for assessing the quality of microarray data, but little guidance regarding what to do when defective data is identified. Depending on the scientific question asked, discarding flawed data from a small experiment may be detrimental. Here we describe a novel quality assessment method that is designed to identify chips that should be discarded from an experiment. This technique simulates a set of differentially expressed genes and then assesses whether discarding each chip enhances or obscures the recovery of this known set. We compare our method to expert annotations derived using popular quality diagnostics and show, with examples, that the decision to discard a chip depends on the details of the particular experiment.

[1]  J. Davis Bioinformatics and Computational Biology Solutions Using R and Bioconductor , 2007 .

[2]  Timothy Wilkes,et al.  Microarray data quality - review of current developments. , 2007, Omics : a journal of integrative biology.

[3]  Rickard Sandberg,et al.  Comparative microarray analysis. , 2006, Omics : a journal of integrative biology.

[4]  Beate Sick,et al.  Quality assessment of Affymetrix GeneChip data. , 2006, Omics : a journal of integrative biology.

[5]  Todd H. Stokes,et al.  chip artifact CORRECTion (caCORRECT): A Bioinformatics System for Quality Assurance of Genomics and Proteomics Array Data , 2007, Annals of Biomedical Engineering.

[6]  D. Grigoryev,et al.  Chronic intermittent hypoxia upregulates genes of lipid biosynthesis in obese mice. , 2005, Journal of applied physiology.

[7]  Gordon K. Smyth,et al.  Empirical array quality weights in the analysis of microarray data , 2006, BMC Bioinformatics.

[8]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[9]  Yudi Pawitan,et al.  False discovery rate, sensitivity and sample size for microarray studies , 2005, Bioinform..

[10]  John N. Weinstein,et al.  Quality assessment of microarrays: Visualization of spatial artifacts and quantitation of regional biases , 2005, BMC Bioinformatics.

[11]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[12]  Kellie J Archer,et al.  Assessing quality of hybridized RNA in Affymetrix GeneChip experiments using mixed-effects models. , 2005, Biostatistics.

[13]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[14]  Pierre Baldi,et al.  A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes , 2001, Bioinform..

[15]  Beate Sick,et al.  Unsupervised assessment of microarray data quality using a Gaussian mixture model , 2009, BMC Bioinformatics.

[16]  Rafael A. Irizarry,et al.  Bioinformatics and Computational Biology Solutions using R and Bioconductor , 2005 .

[17]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[18]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.