Correction for pixel censoring in cDNA microarrays

cDNA microarrays are revolutionizing post-genomic biology, by monitoring the expression activity of thousands of genes simultaneously. The first stage in the analysis of microarray data is estimation of the level of gene expression from digital images obtained by laser scanning. However these data often include saturated pixel values set to the software limit of 65535. We consider three statistical models that correct for bias introduced by these saturation effects. Model 1 is applicable to the digital image of a microarray, and uses a linear model based on the principal components of uncensored spots, with parameters estimated by penalised least squares. Model 2 also uses pixel values, but we show that the ratio of expressions between pairs of samples can be estimated simply using the uncensored pixels associated with a spot. Model 3 is instead applicable to mean spot values from multiple laser scans at different settings. A functional regression model is used, based on a nonlinear relationship with both additive and multiplicative error terms, and fitted by robust methods. All three models are shown to be effective in correcting for the bias.

[1]  I. Helland Statistical Regression with Measurement Error. Chi‐Lun Cheng and John W. Van Ness, Kendall's Library of Statistics 6, Arnold, London, 1999. No. of pages: xiv+262. Price: £35. ISBN: 0‐340‐61461‐7 , 2000 .

[2]  John Aach,et al.  Measuring absolute expression with microarrays with a calibrated reference sample and an extended signal intensity range , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[3]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[4]  Chris A. Glasbey,et al.  Combinatorial image analysis of DNA microarray features , 2003, Bioinform..

[5]  David M. Rocke,et al.  A Model for Measurement Error for Gene Expression Arrays , 2001, J. Comput. Biol..

[6]  Sudhir Gupta,et al.  Statistical Regression With Measurement Error , 1999, Technometrics.

[7]  Chiara Romualdi,et al.  Improved detection of differentially expressed genes in microarray experiments through multiple scanning and image integration. , 2003, Nucleic acids research.

[8]  Claus Thorn Ekstrøm,et al.  Spot shape modelling and data transformations for microarrays , 2004, Bioinform..

[9]  Hans Lehrach,et al.  Automated image analysis for array hybridization experiments , 2001, Bioinform..

[10]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[11]  Ernst Wit,et al.  Statistical Adjustment of Signal Censoring in Gene Expression Experiments , 2003, Bioinform..

[12]  Ralf Herwig,et al.  Simulation of DNA array hybridization experiments and evaluation of critical parameters during subsequent image and data analysis , 2002, BMC Bioinformatics.