Normalization of microarray data using a spatial mixed model analysis which includes splines

MOTIVATION Microarray experiments with thousands of genes on a slide and multiple slides used in any experimental set represent a large body of data with many sources of variation. The identification of such sources of variation within microarray experimental sets is critical for correct deciphering of desired gene expression differences. RESULTS We describe new methods for the normalization using spatial mixed models which include splines and analysis of two-colour spotted microarrays for within slide variation and for a series of slides. The model typically explains 45-85% of the variation on a slide with only approximately 1% of the total degrees of freedom. The results from our methods compare favourably with those from intensity dependent normalization loess methods where we accounted for twice as much uncontrolled and unwanted variation on the slides. We have also developed an index for each EST that combines the various measures of the differential response into a single value that researchers can use to rapidly assess the genes of interest.

[1]  Carl de Boor,et al.  A Practical Guide to Splines , 1978, Applied Mathematical Sciences.

[2]  B. Weir,et al.  A systematic statistical linear modeling approach to oligonucleotide array experiments. , 2002, Mathematical biosciences.

[3]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .

[4]  Sue J. Welham,et al.  Likelihood Ratio Tests for Fixed Model Terms using Residual Maximum Likelihood , 1997 .

[5]  Brian R. Cullis,et al.  Spatial analysis of field experiments : an extension to two dimensions , 1991 .

[6]  M. Cox The Numerical Evaluation of B-Splines , 1972 .

[7]  Douglas M. Hawkins,et al.  A variance-stabilizing transformation for gene-expression microarray data , 2002, ISMB.

[8]  Robin Thompson,et al.  Average information REML: An efficient algorithm for variance parameter estimation in linear mixed models , 1995 .

[9]  David M. Rocke,et al.  Estimation of Transformation Parameters for Microarray Data , 2003, Bioinform..

[10]  Brian R. Cullis,et al.  Accounting for natural and extraneous variation in the analysis of field experiments , 1997 .

[11]  Emlyn Williams,et al.  ROW AND COLUMAN DESIGNS WITH CONTIGUOUS REPLICATES , 1986 .

[12]  W. Cleveland,et al.  Locally Weighted Regression: An Approach to Regression Analysis by Local Fitting , 1988 .

[13]  Terry Speed,et al.  Normalization of cDNA microarray data. , 2003, Methods.

[14]  Y. Chen,et al.  Ratio-based decisions and the quantitative analysis of cDNA microarray images. , 1997, Journal of biomedical optics.

[15]  B. Weir,et al.  Assessing sources of variability in microarray gene expression data. , 2002, BioTechniques.

[16]  Pierre R. Bushel,et al.  Assessing Gene Significance from cDNA Microarray Expression Data via Mixed Models , 2001, J. Comput. Biol..

[17]  E. Dougherty,et al.  Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.

[18]  M. Kenward,et al.  The Analysis of Designed Experiments and Longitudinal Data by Using Smoothing Splines , 1999 .

[19]  David R. Cox Planning of Experiments , 1958 .

[20]  Martin Vingron,et al.  Variance stabilization applied to microarray data calibration and to the quantification of differential expression , 2002, ISMB.

[21]  P. Brown,et al.  DNA arrays for analysis of gene expression. , 1999, Methods in enzymology.

[22]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[23]  S. Dudoit,et al.  STATISTICAL METHODS FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN REPLICATED cDNA MICROARRAY EXPERIMENTS , 2002 .

[24]  R. A. Fisher,et al.  Design of Experiments , 1936 .

[25]  H T Lynch,et al.  Hereditary breast cancer. , 1991, Annals of medicine.

[26]  J. Boyd,et al.  Hereditary breast cancer. , 2001, Current problems in surgery.