Reducing the variability in cDNA microarray image processing by Bayesian inference

MOTIVATION Gene expression levels are obtained from microarray experiments through the extraction of pixel intensities from a scanned image of the slide. It is widely acknowledged that variabilities can occur in expression levels extracted from the same images by different users with the same software packages. These inconsistencies arise due to differences in the refinement of the placement of the microarray 'grids'. We introduce a novel automated approach to the refinement of grid placements that is based upon the use of Bayesian inference for determining the size, shape and positioning of the microarray 'spots', capturing uncertainty that can be passed to downstream analysis. RESULTS Our experiments demonstrate that variability between users can be significantly reduced using the approach. The automated nature of the approach also saves hours of researchers' time normally spent in refining the grid placement.

[1]  P. Brown,et al.  DNA arrays for analysis of gene expression. , 1999, Methods in enzymology.

[2]  D S Varnum,et al.  Aphakia, a new mutation in the mouse. , 1968, The Journal of heredity.

[3]  Terence P. Speed,et al.  Normalization for cDNA microarry data , 2001, SPIE BiOS.

[4]  Matthew J. Beal,et al.  Graphical Models and Variational Methods , 2001 .

[5]  Anne Lohrli Chapman and Hall , 1985 .

[6]  Steve R. Waterhouse,et al.  Bayesian Methods for Mixtures of Experts , 1995, NIPS.

[7]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[8]  A. Brazma,et al.  Gene expression data analysis. , 2001, FEBS letters.

[9]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[10]  R. T. Cox Probability, frequency and reasonable expectation , 1990 .

[11]  Terry Speed,et al.  Normalization of cDNA microarray data. , 2003, Methods.

[12]  M. Ko,et al.  Genome-wide expression profiling of mid-gestation placenta and embryo using a 15,000 mouse developmental cDNA microarray. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Michael Isard,et al.  BraMBLe: a Bayesian multiple-blob tracker , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[14]  Peter Green,et al.  Markov chain Monte Carlo in Practice , 1996 .

[15]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[16]  Timothy J. Robinson,et al.  Sequential Monte Carlo Methods in Practice , 2003 .

[17]  Neil D. Lawrence,et al.  Bayesian processing of microarray images , 2003, 2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718).

[18]  Terence P. Speed,et al.  Comparison of Methods for Image Analysis on cDNA Microarray Data , 2002 .

[19]  A. Brazma,et al.  Gene expression data analysis , 2000, FEBS letters.