Optimal designs for two-color microarray experiments in multi-factorial models

Two-color microarray experiments form an important tool in gene expression analysis. They are often used to identify candidate genes that can be made accountable for the genesis of a certain disease. Due to the high costs of microarray experiments it is fundamental to design these experiments carefully and specifically give instructions, which samples should be allocated on the same microarray. Thereby, two samples are hybridized together on one array and the assignment of samples to arrays influences the precision of the results. Therefore, design issues for microarray experiments have been investigated intensively in the last years. However, only few authors, e.g., Stanzel [37], focused on more than one factor of interest. We extend Stanzel’s work and derive approximate optimal designs for estimating interactions in multi-factorial settings. Thereby, optimality of candidate designs is shown using equivalence theorems (Pukelsheim [33]). Another practical important but less studied topic is the derivation of exact optimal designs. Most research considers approximate designs or exact designs for special contrast sets and selected numbers of arrays. Therefore, we focus on exact designs and present a method to construct A-optimal microarray designs for arbitrary numbers of arrays and arbitrary contrast sets. This method is applied to derive optimal designs for estimating treatment-control comparisons, all-tonext contrasts, Helmert contrasts and all pairwise comparisons. Furthermore, we derive robust designs, which achieve efficient results even if observations are missing. Missing values are a crucial topic in the context of microarray experiments, since they often occur due to scratches on the slide or other damaging. In applications recommendations for the choice of efficient experimental layouts can be derived from our constructed designs.

[1]  S. Ghosh Robustness of BIBD against the unavailability of data , 1982 .

[2]  J. Kiefer Construction and optimality of generalized Youden designs II , 1975 .

[3]  Sorin Drăghici,et al.  Data Analysis Tools for DNA Microarrays , 2003 .

[4]  R. A. Bailey Designs for two‐colour microarray experiments , 2007 .

[5]  Kathleen F. Kerr,et al.  Efficient 2k Factorial Designs for Blocks of Size 2 with Microarray Applications , 2006 .

[6]  Ernst Wit,et al.  Statistics for microarrays , 2004 .

[7]  E. J. G. Moonen,et al.  Optimal Experimental Designs for DNA Microarray Experiments , 2006 .

[8]  T. Speed,et al.  Design issues for cDNA microarray experiments , 2002, Nature Reviews Genetics.

[9]  D. V. Gokhale,et al.  A Survey of Statistical Design and Linear Models. , 1976 .

[10]  Weiming Ke,et al.  The Optimal Design of Blocked and Split-Plot Experiments , 2005, Technometrics.

[11]  Kay Nieselt,et al.  cDNA microarray analysis reveals novel candidate genes expressed in human peripheral blood following exhaustive exercise. , 2005, Physiological genomics.

[12]  G. Churchill,et al.  Experimental design for gene expression microarrays. , 2001, Biostatistics.

[13]  T. Banerjee,et al.  Optimal factorial designs for cDNA microarray experiments , 2008, 0803.3911.

[14]  Friedrich Pukelsheim Optimum Experimental Designs, with SAS by Anthony Atkinson, Alexander Donev, Randall Tobias , 2007 .

[15]  William D. Marslen-Wilson,et al.  Cingulate control of fronto-temporal integration reflects linguistic demands: A three-way interaction in functional connectivity , 2005, NeuroImage.

[16]  Sudhir Gupta,et al.  Balanced Factorial Designs for cDNA Microarray Experiments , 2006 .

[17]  Xiaohui Liu,et al.  An experimental evaluation of a loop versus a reference design for two-channel microarrays , 2005, Bioinform..

[18]  G. Churchill Fundamentals of experimental design for cDNA microarrays , 2002, Nature Genetics.

[19]  Pierre R. Bushel,et al.  Assessing Gene Significance from cDNA Microarray Expression Data via Mixed Models , 2001, J. Comput. Biol..

[20]  K. Kerr Design and Analysis of Experiments, Vol. 2: Advanced Experimental Design , 2006 .

[21]  Peter J. Cameron,et al.  Combinatorics of optimal designs , 2008 .

[22]  G F V Glonek,et al.  Factorial and time course designs for cDNA microarray experiments. , 2004, Biostatistics.

[23]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[24]  Michael Jackson,et al.  Optimal Design of Experiments , 1994 .

[25]  E. Wit Design and Analysis of DNA Microarray Investigations , 2004, Human Genomics.

[26]  Scott L. Zeger,et al.  The Analysis of Gene Expression Data: Methods and Software , 2013 .

[27]  Sven Stanzel Optimale statistische Versuchsplanung dreifaktorieller Zwei-Farben-cDNA-Microarray-Experimente , 2008 .

[28]  Tahsin Erkan Ture Optimal row-column designs for multiple comparisons with a control : a complete catalog , 1994 .

[29]  Kevin Dobbin,et al.  Statistical Design of Reverse Dye Microarrays , 2003, Bioinform..

[30]  Ching-Shui Cheng,et al.  Maximizing the total number of spanning trees in a graph: Two related problems in graph theory and optimum design theory , 1981, J. Comb. Theory B.

[31]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[32]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[33]  Edgar Brunner,et al.  Robustness considerations in selecting efficient two-color microarray designs , 2009, Bioinform..

[34]  R. Hilgers,et al.  The Within-B-Swap (BS) Design is A- and D-optimal for Estimating the Linear Contrast for the Treatment Effect in 3-Factorial cDNA Microarray Experiments , 2007 .

[35]  N. Gaffke,et al.  D-optimal block designs with at most six varieties , 1982 .

[36]  M Kathleen Kerr,et al.  Design considerations for efficient and effective microarray studies. , 2003, Biometrics.

[37]  Anthony C. Atkinson,et al.  Optimum Experimental Designs, with SAS , 2007 .

[38]  Joachim Kunert,et al.  Optimal Designs for Treatment-Control Comparisons in Microarray Experiments , 2009 .

[39]  P. Druilhet,et al.  Information Matrices for Non Full Rank Subsystems , 2007 .

[40]  Jack A. Taylor,et al.  The role of N-acetylation polymorphisms in smoking-associated bladder cancer: evidence of a gene-gene-exposure three-way interaction. , 1998, Cancer research.

[41]  C A Ross,et al.  Decreased expression of striatal signaling genes in a mouse model of Huntington's disease. , 2000, Human molecular genetics.

[42]  F. Pukelsheim,et al.  Efficient rounding of approximate designs , 1992 .

[43]  Edgar Brunner,et al.  Efficient design and analysis of two colour factorial microarray experiments , 2006, Comput. Stat. Data Anal..

[44]  S. Dudoit,et al.  Microarray expression profiling identifies genes with altered expression in HDL-deficient mice. , 2000, Genome research.

[45]  A. Shapira,et al.  Extremal Graph Theory , 2013 .

[46]  Chen-Tuo Liao,et al.  Statistical designs for two-color microarray experiments involving technical replication , 2006, Comput. Stat. Data Anal..

[47]  R. Schwabe,et al.  The relationship between optimal designs for microarray and paired comparison experiments , 2007 .