Validation of oligonucleotide microarray data using microfluidic low-density arrays: a new statistical method to normalize real-time RT-PCR data.

Profiling studies using microarrays to measure messenger RNA (mRNA) expression frequently identify long lists of differentially expressed genes. Differential expression is often validated using real-time reverse transcription PCR (RT-PCR) assays. In conventional real-time RT-PCR assays, expression is normalized to a control, or housekeeping gene. However, no single housekeeping gene can be used for all studies. We used TaqMan Low-Density Arrays, a medium-throughput method for real-time RT-PCR using microfluidics to simultaneously assay the expression of 96 genes in nine samples of chronic lymphocytic leukemia (CLL). We developed a novel statistical method, based on linear mixed-effects models, to analyze the data. This method automatically identifies the genes whose expression does not vary significantly over the samples, allowing them to be used to normalize the remaining genes. We compared the normalized real-time RT-PCR values with results obtained from Affymetrix Hu133A GeneChip oligonucleotide microarrays. We found that real-time RT-PCR using TaqMan Low-Density Arrays yielded reproducible measurements over seven orders of magnitude. Our model identified numerous genes that were expressed at nearly constant levels, including the housekeeping genes PGK1, GAPD, GUSB, TFRC, and 18S rRNA. After normalizing to the geometric mean of the unvarying genes, the correlation between real-time RT-PCR and microarrays was high for genes that were moderately expressed and varied across samples.

[1]  Thomas D. Schmittgen,et al.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. , 2001, Methods.

[2]  K. Aldape,et al.  A model of molecular interactions on short oligonucleotide microarrays , 2003, Nature Biotechnology.

[3]  S. Engeli,et al.  Validation of Endogenous Controls for Gene Expression Studies in Human Adipocytes and Preadipocytes , 2001, Hormone and metabolic research = Hormon- und Stoffwechselforschung = Hormones et metabolisme.

[4]  Kevin R Coombes,et al.  High expression of activation-induced cytidine deaminase (AID) and splice variants is a distinctive feature of poor-prognosis chronic lymphocytic leukemia. , 2003, Blood.

[5]  Claus Lindbjerg Andersen,et al.  Normalization of Real-Time Quantitative Reverse Transcription-PCR Data: A Model-Based Variance Estimation Approach to Identify Genes Suited for Normalization, Applied to Bladder and Colon Cancer Data Sets , 2004, Cancer Research.

[6]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[7]  T. Hudson,et al.  Control genes and variability: absence of ubiquitous reference transcripts in diverse mammalian expression studies. , 2002, Genome research.

[8]  Steven L. Allen,et al.  Ig V gene mutation status and CD38 expression as novel prognostic indicators in chronic lymphocytic leukemia. , 1999, Blood.

[9]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[10]  A. Siegbahn,et al.  A quantitative real-time PCR method for tissue factor mRNA. , 2003, Thrombosis research.

[11]  Thomas D. Schmittgen,et al.  Effect of experimental treatment on housekeeping gene expression: validation by real-time, quantitative RT-PCR. , 2000, Journal of biochemical and biophysical methods.

[12]  K. Coombes,et al.  A comparative analysis of data generated using two different target preparation methods for hybridization to high-density oligonucleotide microarrays , 2004, BMC Genomics.

[13]  R. Fisher FREQUENCY DISTRIBUTION OF THE VALUES OF THE CORRELATION COEFFIENTS IN SAMPLES FROM AN INDEFINITELY LARGE POPU;ATION , 1915 .

[14]  N. Chiorazzi,et al.  Ig V gene mutation status and CD38 expression as novel prognostic indicators in chronic lymphocytic leukemia. , 1999, Blood.

[15]  Mario Pazzagli,et al.  Quantitative real-time reverse transcription polymerase chain reaction: normalization to rRNA or single housekeeping genes is inappropriate for human tissue biopsies. , 2002, Analytical biochemistry.

[16]  W. Schaper,et al.  Differential expression of GAPDH and β-actin in growing collateral arteries , 2002, Molecular and Cellular Biochemistry.

[17]  F. Speleman,et al.  Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes , 2002, Genome Biology.

[18]  C. Cohen,et al.  Validation of endogenous controls for gene expression analysis in microdissected human renal biopsies. , 2003, Kidney international.

[19]  Charles M Perou,et al.  Statistical modeling for selecting housekeeper genes , 2004, Genome Biology.

[20]  J. Aerts,et al.  Selection of appropriate control genes to assess expression of tumor antigens using real-time RT-PCR. , 2004, BioTechniques.

[21]  Juan Cabrera-Luque,et al.  Absolute quantitation of normal and ROS-induced patterns of gene expression: an in vivo real-time PCR study in mice. , 2003, Gene expression.

[22]  P. J. Higgins,et al.  Control selection for RNA quantitation. , 2000, BioTechniques.