BMC Bioinformatics BioMed Central Research article Sources of variation in Affymetrix microarray experiments

BackgroundA typical microarray experiment has many sources of variation which can be attributed to biological and technical causes. Identifying sources of variation and assessing their magnitude, among other factors, are important for optimal experimental design. The objectives of this study were: (1) to estimate relative magnitudes of different sources of variation and (2) to evaluate agreement between biological and technical replicates.ResultsWe performed a microarray experiment using a total of 24 Affymetrix GeneChip® arrays. The study included 4th mammary gland samples from eight 21-day-old Sprague Dawley CD female rats exposed to genistein (soy isoflavone). RNA samples from each rat were split to assess variation arising at labeling and hybridization steps. A general linear model was used to estimate variance components. Pearson correlations were computed to evaluate agreement between technical and biological replicates.ConclusionThe greatest source of variation was biological variation, followed by residual error, and finally variation due to labeling when *.cel files were processed with dChip and RMA image processing algorithms. When MAS 5.0 or GCRMA-EB were used, the greatest source of variation was residual error, followed by biology and labeling. Correlations between technical replicates were consistently higher than between biological replicates.

[1]  Wei-Min Liu,et al.  Robust estimators for expression analysis , 2002, Bioinform..

[2]  L. Qin,et al.  Empirical evaluation of data transformations and ranking statistics for microarray analysis. , 2004, Nucleic acids research.

[3]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[4]  Rafael A. Irizarry,et al.  A Model-Based Background Adjustment for Oligonucleotide Expression Arrays , 2004 .

[5]  Eric P. Hoffman,et al.  Sources of variability and effect of experimental approach on expression profiling data interpretation , 2002, BMC Bioinformatics.

[6]  Susan G Hilsenbeck,et al.  Reproducibility, sources of variability, pooling, and sample size: important considerations for the design of high-density oligonucleotide array experiments. , 2004, The journals of gerontology. Series A, Biological sciences and medical sciences.

[7]  Kenneth H Buetow,et al.  Interlaboratory comparability study of cancer gene expression analysis using oligonucleotide microarrays. , 2005, Clinical cancer research : an official journal of the American Association for Cancer Research.

[8]  Mark P Molloy,et al.  Overcoming technical variation and biological variation in quantitative proteomics , 2003, Proteomics.

[9]  K. Jain,et al.  Applications of biochips: from diagnostics to personalized medicine. , 2004, Current opinion in drug discovery & development.

[10]  Cheng Cheng,et al.  Statistical Development and Evaluation of Microarray Gene Expression Data Filters , 2005, J. Comput. Biol..

[11]  Members of the Complex Trait Consortium Standardizing global gene expression analysis between laboratories and across platforms , 2005 .

[12]  Nigel Walker,et al.  Hybridization buffer systems impact the quality of filter array data. , 2004, Journal of pharmacological and toxicological methods.

[13]  Matthias Nees,et al.  Impact of pre‐analytical handling on bone marrow mRNA gene expression , 2004, British journal of haematology.

[14]  C. Kendziorski,et al.  The efficiency of pooling mRNA in microarray experiments. , 2003, Biostatistics.

[15]  David B. Allison,et al.  Power and money: Designing statistically powerful studies while minimizing financial costs. , 1997 .

[16]  Richard Weindruch,et al.  A design and statistical perspective on microarray gene expression studies in nutrition: the need for playful creativity and scientific hard-mindedness. , 2003, Nutrition.

[17]  Aeilko H. Zwinderman,et al.  Comparing transformation methods for DNA microarray data , 2004, BMC Bioinformatics.

[18]  David M. Rocke,et al.  Transformation and normalization of oligonucleotide microarray data , 2003, Bioinform..

[19]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[20]  David B. Allison,et al.  A mixture model approach for the analysis of microarray gene expression data , 2002 .

[21]  G. Churchill,et al.  Variation in gene expression within and among natural populations , 2002, Nature Genetics.

[22]  Wei Chen,et al.  Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data , 2005, BMC Bioinformatics.

[23]  C. Li,et al.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[24]  X. Cui,et al.  Improved statistical tests for differential gene expression by shrinking variance components estimates. , 2005, Biostatistics.

[25]  John Quackenbush,et al.  Multiple-laboratory comparison of microarray platforms , 2005, Nature Methods.

[26]  T. Speed,et al.  Design issues for cDNA microarray experiments , 2002, Nature Reviews Genetics.

[27]  D. Kuhn,et al.  Quantification of sources of variation and accuracy of sequence discrimination in a replicated microarray experiment. , 2004, BioTechniques.

[28]  Kathleen F. Kerr,et al.  Standardizing global gene expression analysis between laboratories and across platforms , 2005, Nature Methods.

[29]  A. Cossins,et al.  Application of microarray technology in environmental and comparative physiology. , 2003, Annual review of physiology.

[30]  G. Churchill,et al.  A comparison of cDNA, oligonucleotide, and Affymetrix GeneChip gene expression microarray platforms. , 2004, Journal of biomolecular techniques : JBT.

[31]  Bryan Frank,et al.  Independence and reproducibility across microarray platforms , 2005, Nature Methods.

[32]  Lei Liu,et al.  A study of inter-lab and inter-platform agreement of DNA microarray data , 2005, BMC Genomics.

[33]  G. Churchill,et al.  Experimental design for gene expression microarrays. , 2001, Biostatistics.

[34]  Ash A. Alizadeh,et al.  Individuality and variation in gene expression patterns in human blood , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[36]  B. Weir,et al.  Assessing sources of variability in microarray gene expression data. , 2002, BioTechniques.

[37]  Kellie J Archer,et al.  Evaluation of quality-control criteria for microarray gene expression analysis. , 2004, Clinical chemistry.

[38]  G. Churchill Fundamentals of experimental design for cDNA microarrays , 2002, Nature Genetics.

[39]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[40]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.