Multiplicative background correction for spotted microarrays to improve reproducibility.

We propose a simple approach, the multiplicative background correction, to solve a perplexing problem in spotted microarray data analysis: correcting the foreground intensities for the background noise, especially for spots with genes that are weakly expressed or not at all. The conventional approach, the additive background correction, directly subtracts the background intensities from foreground intensities. When the foreground intensities marginally dominate the background intensities, the additive background correction provides unreliable estimates of the differential gene expression levels and usually presents M-A plots with fishtails or fans. Unreliable additive background correction makes it preferable to ignore the background noise, which may increase the number of false positives. Based on the more realistic multiplicative assumption instead of the conventional additive assumption, we propose to logarithmically transform the intensity readings before the background correction, with the logarithmic transformation symmetrizing the skewed intensity readings. This approach not only precludes the fishtails and fans in the M-A plots, but provides highly reproducible background-corrected intensities for both strongly and weakly expressed genes. The superiority of the multiplicative background correction to the additive one as well as the no background correction is justified by publicly available self-hybridization datasets.

[1]  Scott L. Zeger,et al.  The Analysis of Gene Expression Data: An Overview of Methods and Software , 2003 .

[2]  Gordon K. Smyth,et al.  limma: Linear Models for Microarray Data , 2005 .

[3]  Terence P. Speed,et al.  Normalization for cDNA microarry data , 2001, SPIE BiOS.

[4]  L. Qin,et al.  Empirical evaluation of data transformations and ranking statistics for microarray analysis. , 2004, Nucleic acids research.

[5]  J. Rogers,et al.  Coping with cold: An integrative, multitissue analysis of the transcriptome of a poikilothermic vertebrate. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Giovanni Parmigiani,et al.  When should one subtract background fluorescence in 2-color microarrays? , 2006, Biostatistics.

[7]  Martin T. Wells,et al.  Bayesian Normalization and Identification for Differential Gene Expression Data , 2005, J. Comput. Biol..

[8]  P. Sorger,et al.  Image metrics in the statistical analysis of DNA microarray data , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Jin Hyuk KIm,et al.  Effect of local background intensities in the normalization of cDNA microarray data with a skewed expression profiles , 2002, Experimental & Molecular Medicine.

[10]  Jin Hyun Park,et al.  Normalization for cDNA Microarray Data on the oral cancer , 2002 .

[11]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[12]  P. Kemmeren,et al.  Monitoring global messenger RNA changes in externally controlled microarray experiments , 2003, EMBO reports.

[13]  G. Gibson,et al.  Microarray Analysis , 2020, Definitions.

[14]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[15]  David Edwards,et al.  Non-linear Normalization and Background Correction in One-channel CDNA Microarray Studies , 2003, Bioinform..

[16]  Charles L. Kooperberg,et al.  Improved Background Correction for Spotted DNA Microarrays , 2002, J. Comput. Biol..

[17]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .