Agreement in breast cancer classification between microarray and quantitative reverse transcription PCR from fresh-frozen and formalin-fixed, paraffin-embedded tissues.

BACKGROUND Microarray studies have identified different molecular subtypes of breast cancer with prognostic significance. To transition these classifications into the clinical laboratory, we have developed a real-time quantitative reverse transcription (qRT)-PCR assay to diagnose the biological subtypes of breast cancer from fresh-frozen (FF) and formalin-fixed, paraffin-embedded (FFPE) tissues. METHODS We used microarray data from 124 breast samples as a training set for classifying tumors into 4 previously defined molecular subtypes: Luminal, HER2(+)/ER(-), basal-like, and normal-like. We used the training set data in 2 different centroid-based algorithms to predict sample class on 35 breast tumors (test set) procured as FF and FFPE tissues (70 samples). We classified samples on the basis of large and minimized gene sets. We used the minimized gene set in a real-time qRT-PCR assay to predict sample subtype from the FF and FFPE tissues. We evaluated primer set performance between procurement methods by use of several measures of agreement. RESULTS The centroid-based algorithms were in complete agreement in classification from FFPE tissues by use of qRT-PCR and the minimized "intrinsic" gene set (40 classifiers). There was 94% (33 of 35) concordance between the diagnostic algorithms when comparing subtype classification from FF tissue by use of microarray (large and minimized gene set) and qRT-PCR data. We found that the ratio of the diagonal SD to the dynamic range was the best method for assessing agreement on a gene-by-gene basis. CONCLUSIONS Centroid-based algorithms are robust classifiers for breast cancer subtype assignment across platforms and procurement conditions.

[1]  Zhiyuan Hu,et al.  Classification and risk stratification of invasive breast carcinomas using a real-time quantitative RT-PCR assay , 2006, Breast Cancer Research.

[2]  C. Perou,et al.  High reproducibility using sodium hydroxide-stripped long oligonucleotide DNA microarrays. , 2005, BioTechniques.

[3]  Charles M Perou,et al.  Statistical modeling for selecting housekeeper genes , 2004, Genome Biology.

[4]  R. Tibshirani,et al.  Use of gene-expression profiling to identify prognostic subclasses in adult acute myeloid leukemia. , 2004, The New England journal of medicine.

[5]  A. Nobel,et al.  The molecular portraits of breast tumors are conserved across microarray platforms , 2006, BMC Genomics.

[6]  S. Dudoit,et al.  A prediction-based resampling method for estimating the number of clusters in a dataset , 2002, Genome Biology.

[7]  Joel S. Parker,et al.  Adjustment of systematic microarray data biases , 2004, Bioinform..

[8]  Debjani Dutta,et al.  Measurement of gene expression in archival paraffin-embedded tissues: development and performance of a 92-gene reverse transcriptase-polymerase chain reaction assay. , 2004, The American journal of pathology.

[9]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[11]  Alan R. Dabney BIOINFORMATICS Classification of Microarrays to Nearest Centroids , 2022 .

[12]  R. Tibshirani,et al.  Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data , 2004, PLoS biology.

[13]  M. Cronin,et al.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. , 2004, The New England journal of medicine.

[14]  S. Dudoit,et al.  Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data , 2002 .

[15]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[16]  Van,et al.  A gene-expression signature as a predictor of survival in breast cancer. , 2002, The New England journal of medicine.

[17]  S. Dudoit,et al.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. , 2002, Nucleic acids research.

[18]  L. Lin,et al.  A concordance correlation coefficient to evaluate reproducibility. , 1989, Biometrics.

[19]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[20]  A. Nobel,et al.  Concordance among Gene-Expression – Based Predictors for Breast Cancer , 2011 .

[21]  D. Seligson,et al.  Clinical Chemistry , 1965, Bulletin de la Societe de chimie biologique.

[22]  R. Tibshirani,et al.  Repeated observation of breast tumor subtypes in independent gene expression data sets , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[23]  R. Tibshirani,et al.  Diagnosis of multiple cancer types by shrunken centroids of gene expression , 2002, Proceedings of the National Academy of Sciences of the United States of America.