Efficient design and analysis of two colour factorial microarray experiments

Statistical microarray data analysis has evolved fast in the last 5 years. However, design issues related to gene expression profiling experiments using two-colour cDNA or oligonucleotide-based microarrays have been investigated only recently. Different approaches and criteria for the choice of efficient designs have been proposed, which apply to a wide range of experiments from simple to complex factorial experimental setups. Here, results from previous research are expanded by showing how contrast vectors and matrices describing the experimental questions under investigation can be used to tailor efficient and economic designs that take into account the estimability of the desired contrasts. The most efficient design among a set of candidate designs is found by applying the e-efficiency criterion from optimal design theory suitable for general factorial designs, which reduces to the comparison of variance factors in the case of vectorial contrasts. These results enable researchers to select efficient designs among several candidate designs for arbitrary complex factorial designs given a set of experimental questions. The corresponding algorithms are implemented in the open source R package daMA available at www.bioconductor.org. The electronic version of this paper can be found at www.sciencedirect.com.

[1]  M. Kathleen Kerr,et al.  Linear Models for Microarray Data Analysis: Hidden Similarities and Differences , 2003, J. Comput. Biol..

[2]  S. Dudoit,et al.  Multiple Hypothesis Testing in Microarray Experiments , 2003 .

[3]  M Kathleen Kerr,et al.  Design considerations for efficient and effective microarray studies. , 2003, Biometrics.

[4]  G F V Glonek,et al.  Factorial and time course designs for cDNA microarray experiments. , 2004, Biostatistics.

[5]  G A Whitmore,et al.  Models for microarray gene expression data , 2002, Journal of biopharmaceutical statistics.

[6]  Martin Vingron,et al.  Variance stabilization applied to microarray data calibration and to the quantification of differential expression , 2002, ISMB.

[7]  X. Cui,et al.  Statistical tests for differential expression in cDNA microarray experiments , 2003, Genome Biology.

[8]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[9]  S. R. Searle Linear Models , 1971 .

[10]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[11]  Pierre R. Bushel,et al.  Assessing Gene Significance from cDNA Microarray Expression Data via Mixed Models , 2001, J. Comput. Biol..

[12]  Gary A. Churchill,et al.  Sources of Variation in Microarray Experiments , 2003 .

[13]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[14]  Kevin Dobbin,et al.  Comparison of microarray designs for class comparison and class discovery , 2002, Bioinform..

[15]  G. Churchill,et al.  Experimental design for gene expression microarrays. , 2001, Biostatistics.

[16]  T. Speed,et al.  Design issues for cDNA microarray experiments , 2002, Nature Reviews Genetics.

[17]  G. Churchill Fundamentals of experimental design for cDNA microarrays , 2002, Nature Genetics.

[18]  F. Pukelsheim Optimal Design of Experiments , 1993 .

[19]  Kevin Dobbin,et al.  Statistical Design of Reverse Dye Microarrays , 2003, Bioinform..

[20]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[21]  G. Churchill,et al.  Statistical design and the analysis of gene expression microarray data. , 2007, Genetical research.

[22]  Terence P. Speed,et al.  Normalization for cDNA microarry data , 2001, SPIE BiOS.

[23]  L. Penland,et al.  Use of a cDNA microarray to analyse gene expression patterns in human cancer , 1996, Nature Genetics.

[24]  Andreas D. Baxevanis,et al.  Chipping away at genomic medicine , 2002, Nature Genetics.