Sources of variability and effect of experimental approach on expression profiling data interpretation

BackgroundWe provide a systematic study of the sources of variability in expression profiling data using 56 RNAs isolated from human muscle biopsies (34 Affymetrix MuscleChip arrays), and 36 murine cell culture and tissue RNAs (42 Affymetrix U74Av2 arrays).ResultsWe studied muscle biopsies from 28 human subjects as well as murine myogenic cell cultures, muscle, and spleens. Human MuscleChip arrays (4,601 probe sets) and murine U74Av2 Affymetrix microarrays were used for expression profiling. RNAs were profiled both singly, and as mixed groups. Variables studied included tissue heterogeneity, cRNA probe production, patient diagnosis, and GeneChip hybridizations. We found that the greatest source of variability was often different regions of the same patient muscle biopsy, reflecting variation in cell type content even in a relatively homogeneous tissue such as muscle. Inter-patient variation was also very high (SNP noise). Experimental variation (RNA, cDNA, cRNA, or GeneChip) was minor. Pre-profile mixing of patient cRNA samples effectively normalized both intra- and inter-patient sources of variation, while retaining a high degree of specificity of the individual profiles (86% of statistically significant differences detected by absolute analysis; and 85% by a 4-pairwise comparison survival method).ConclusionsUsing unsupervised cluster analysis and correlation coefficients of 92 RNA samples on 76 oligonucleotide microarrays, we found that experimental error was not a significant source of unwanted variability in expression profiling experiments. Major sources of variability were from use of small tissue biopsies, particularly in humans where there is substantial inter-patient variability (SNP noise).

[1]  B. Yandell,et al.  The expression of adipogenic genes is decreased in obesity and diabetes mellitus. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[2]  D. Hartl,et al.  Manifold anomalies in gene expression in a vineyard isolate of Saccharomyces cerevisiae revealed by DNA microarray analysis. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[3]  B. Williams,et al.  Identification of genes differentially regulated by interferon α, β, or γ using oligonucleotide arrays , 1998 .

[4]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[5]  G. Mcgall,et al.  Light-directed synthesis of high-density oligonucleotide arrays using semiconductor photoresists. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[6]  David G. Morris,et al.  Global analysis of gene expression in pulmonary fibrosis reveals distinct programs regulating lung inflammation and fibrosis. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[7]  J. Claverie Computational methods for the identification of differential and coordinated gene expression. , 1999, Human molecular genetics.

[8]  M. Morley,et al.  Making and reading microarrays , 1999, Nature Genetics.

[9]  Alexander Kamb,et al.  A simple method for statistical analysis of intensity differences in microarray-derived gene expression data , 2001, BMC biotechnology.

[10]  C. K. Lee,et al.  Gene expression profile of aging and its retardation by caloric restriction. , 1999, Science.

[11]  Daniel R. Richards,et al.  Direct allelic variation scanning of the yeast genome. , 1998, Science.

[12]  J. Mills,et al.  A new approach for filtering noise from high-density oligonucleotide microarray datasets. , 2001, Nucleic acids research.

[13]  N. Sampas,et al.  Molecular classification of cutaneous malignant melanoma by gene expression profiling , 2000, Nature.

[14]  Christian A. Rees,et al.  Distinctive gene expression patterns in human mammary epithelial cells and breast cancers. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[15]  B. Williams,et al.  Identification of genes differentially regulated by interferon alpha, beta, or gamma using oligonucleotide arrays. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[16]  P. Brown,et al.  New components of a system for phosphate accumulation and polyphosphate metabolism in Saccharomyces cerevisiae revealed by genomic expression analysis. , 2000, Molecular biology of the cell.

[17]  P. Brown,et al.  Whole-genome expression analysis of snf/swi mutants of Saccharomyces cerevisiae. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Eric P. Hoffman,et al.  Expression Profiling in the Muscular Dystrophies Identification of Novel Aspects of Molecular Pathophysiology , 2000 .

[19]  D. Lockhart,et al.  Expression monitoring by hybridization to high-density oligonucleotide arrays , 1996, Nature Biotechnology.

[20]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[21]  J. Gordon,et al.  Molecular analysis of commensal host-microbial relationships in the intestine. , 2001, Science.

[22]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[23]  S. P. Fodor,et al.  High density synthetic oligonucleotide arrays , 1999, Nature Genetics.

[24]  Michael N Liebman,et al.  Characterization of adjacent breast tumors using oligonucleotide microarrays , 2001, Breast Cancer Research.

[25]  M. Bittner,et al.  Expression profiling using cDNA microarrays , 1999, Nature Genetics.

[26]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[27]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[28]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.