Identifying and Quantifying Sources of Variation in Microarray Data Using High-Density cDNA Membrane Arrays

Microarray experiments involve many steps, including spotting cDNA, extracting RNA, labeling targets, hybridizing, scanning, and analyzing images. Each step introduces variability, confounding our ability to obtain accurate estimates of the biological differences between samples. We ran repeated experiments using high-density cDNA microarray membranes (Research Genetics Human GeneFilters Microarrays Version I) and 33P-labeled targets. Total RNA was extracted from a Burkitt lymphoma cell line (GA-10). We estimated the components of variation coming from: (1) image analysis, (2) exposure time to PhosphorImager screens, (3) differences in membranes, (4) reuse of membranes, and (5) differences in targets prepared from two independent RNA extractions. Variation was assessed qualitatively using a clustering algorithm and quantitatively using a version of ANOVA adapted to multivariate microarray data. The largest contribution to variation came from reusing membranes, which contributed 38% of the total variation. Differences in membranes and in exposure time each contributed about 10%. Differences in target preparations contributed less than 5%. The effect of image quantification was negligible. Much of the effect from reusing membranes was attributable to increasing levels of background radiation and can be reduced by using membranes at most four times. The effects of exposure time, which were partly attributable to variation in the scanning process, can be minimized by using the same exposure time for all experiments.

[1]  S. Granjeaud,et al.  From hybridization image to numerical values: a practical, high throughput quantification system for high density filter hybridizations. , 1996, Genetic analysis : biomolecular engineering.

[2]  N. Lee,et al.  A concise guide to cDNA microarray analysis. , 2000, BioTechniques.

[3]  J Walker,et al.  Gene expression profiling in human peripheral blood mononuclear cells using high-density filter-based cDNA microarrays. , 2000, Journal of immunological methods.

[4]  S. Dudoit,et al.  STATISTICAL METHODS FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN REPLICATED cDNA MICROARRAY EXPERIMENTS , 2002 .

[5]  G E Archer,et al.  Maximization of signal derived from cDNA microarrays. , 2001, BioTechniques.

[6]  Pierre Baldi,et al.  A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes , 2001, Bioinform..

[7]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[8]  F. Bertucci,et al.  Sensitivity issues in DNA array-based expression measurements and performance of nylon microarrays for small samples. , 1999, Human molecular genetics.

[9]  G. W. Hatfield,et al.  Global gene expression profiling in Escherichia coli K12. The effects of integration host factor. , 2000, The Journal of biological chemistry.

[10]  G. Churchill,et al.  Experimental design for gene expression microarrays. , 2001, Biostatistics.

[11]  William N. Venables,et al.  Modern Applied Statistics with S-Plus. , 1996 .

[12]  J. Claverie Computational methods for the identification of differential and coordinated gene expression. , 1999, Human molecular genetics.

[13]  J. Hudson,et al.  Development of a prostate cDNA microarray and statistical gene expression analysis package , 2000, Molecular carcinogenesis.

[14]  Ingrid Lönnstedt Replicated microarray data , 2001 .

[15]  D. Botstein,et al.  The transcriptional program in the response of human fibroblasts to serum. , 1999, Science.

[16]  A Buckpitt,et al.  Development of a toxicological gene array and quantitative assessment of this technology. , 2000, Archives of biochemistry and biophysics.

[17]  Christina Kendziorski,et al.  On Differential Variability of Expression Ratios: Improving Statistical Inference about Gene Expression Changes from Microarray Data , 2001, J. Comput. Biol..

[18]  Dennis E. Grawoig,et al.  Statistics, a foundation for analysis , 1972, The Mathematical Gazette.

[19]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[20]  A D Long,et al.  Improved Statistical Inference from DNA Microarray Data Using Analysis of Variance and A Bayesian Statistical Framework , 2001, The Journal of Biological Chemistry.

[21]  Gary A. Churchill,et al.  Analysis of Variance for Gene Expression Microarray Data , 2000, J. Comput. Biol..

[22]  E. Dougherty,et al.  Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.

[23]  E. Wolski,et al.  Normalization strategies for cDNA microarrays. , 2000, Nucleic acids research.

[24]  G. A. Whitmore,et al.  Importance of replication in microarray gene expression studies: statistical methods and evidence from repetitive cDNA hybridizations. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[26]  Doron Lancet,et al.  GeneNote: whole genome expression profiles in normal human tissues. , 2003, Comptes rendus biologies.

[27]  Kevin R. Coombes,et al.  Identifying Differentially Expressed Genes in cDNA Microarray Experiments , 2001, J. Comput. Biol..

[28]  C. K. Lee,et al.  Gene expression profile of aging and its retardation by caloric restriction. , 1999, Science.

[29]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Christian A. Rees,et al.  Systematic variation in gene expression patterns in human cancer cell lines , 2000, Nature Genetics.

[31]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[32]  Terence P. Speed,et al.  Normalization for cDNA microarry data , 2001, SPIE BiOS.

[33]  Pierre R. Bushel,et al.  STATISTICAL ANALYSIS OF A GENE EXPRESSION MICROARRAY EXPERIMENT WITH REPLICATION , 2002 .

[34]  N. Sampas,et al.  Molecular classification of cutaneous malignant melanoma by gene expression profiling , 2000, Nature.

[35]  A. Brazma,et al.  Gene expression data analysis , 2000, FEBS letters.

[36]  Wayne W. Daniel,et al.  Biostatistics: A Foundation for Analysis in the Health Sciences , 1974 .

[37]  E. Dougherty,et al.  Multivariate measurement of gene expression relationships. , 2000, Genomics.

[38]  T. Hughes,et al.  Signaling and circuitry of multiple MAPK pathways revealed by a matrix of global gene expression profiles. , 2000, Science.