Tumor classification by tissue microarray profiling: random forest clustering applied to renal cell carcinoma

We describe a novel strategy (random forest clustering) for tumor profiling based on tissue microarray data. Random forest clustering is attractive for tissue microarray and other immunohistochemistry data since it handles highly skewed tumor marker expressions well and weighs the contribution of each marker according to its relatedness with other tumor markers. This is the first tumor class discovery analysis of renal cell carcinoma patients based on protein expression profiles. The tissue array data contained at least three tumor samples from each of 366 renal cell carcinoma patients. The eight tumor markers explore tumor proliferation, cell cycle abnormalities, cell mobility, and the hypoxia pathway. Since the procedure is unsupervised, no clinicopathological data or traditional classifications are used a priori. To explore whether the tissue microarray data can be used to identify fundamental subtypes of renal cell carcinoma patients, we first carried out random forest clustering of all 366 patients. By analyzing the tumor markers simultaneously, the procedure automatically detected classes that correspond to clear- vs non-clear cell tumors (demonstration of proof-of-principle). The resulting molecular grouping provides better prediction of survival (logrank P=0.000090) than this classical pathological grouping (logrank P=0.023). We then sought to extend the class discovery by searching for finer subclasses of clear cell patients. The procedure automatically discovered: (a) two classes corresponding to low- and high-grade patients (demonstration of proof-of-principle); (b) a subgroup of long-surviving clear cell patients with a distinct molecular profile and (c) two novel tumor subclasses in low-grade clear cell patients that could not be explained by any clinicopathological variables (demonstration of discovery).

[1]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[2]  L. Sobin,et al.  TNM classification of malignant tumors, fifth edition (1997) , 1997, Cancer.

[3]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[4]  G. Riethmüller,et al.  Randomised trial of monoclonal antibody for adjuvant therapy of resected Dukes' C colorectal carcinoma , 1994, The Lancet.

[5]  C. Rubio,et al.  The tumor-associated antigens BR55-2, GA73-3 and GICA 19-9 in normal and corresponding neoplastic human tissues, especially gastrointestinal tissues. , 1989, Anticancer research.

[6]  E. McFadden,et al.  Toxicity and response criteria of the Eastern Cooperative Oncology Group , 1982, American journal of clinical oncology.

[7]  S. Fuhrman,et al.  Prognostic significance of morphologic parameters in renal cell carcinoma , 1982, The American journal of surgical pathology.

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  N. Rioux-Leclercq,et al.  Value of immunohistochemical Ki-67 and p53 determinations as predictive factors of outcome in renal cell carcinoma. , 2000, Urology.

[10]  B. Delahunt,et al.  Intragenic PTEN/MMAC1 Loss of Heterozygosity in Conventional (Clear-Cell) Renal Cell Carcinoma is Associated with Poor Patient Prognosis , 2002, Modern Pathology.

[11]  O. Nativ,et al.  The role of vimentin expression in predicting the long-term outcome of patients with localized renal cell carcinoma. , 1997, British journal of urology.

[12]  J. Kononen,et al.  Tissue microarrays for high-throughput molecular profiling of tumor specimens , 1998, Nature Medicine.

[13]  P. Rehak,et al.  Expression of MUC1 (EMA) and E-cadherin in renal cell carcinoma: a systematic immunohistochemical analysis of 188 cases , 2004, Modern Pathology.

[14]  S. Horvath,et al.  Using Protein Expressions to Predict Survival in Clear Cell Renal Carcinoma , 2004, Clinical Cancer Research.

[15]  S. Nordling,et al.  Expression of the free β‐subunit of human chorionic gonadotropin in renal cell carcinoma: Prognostic study on tissue and serum , 2003, International journal of cancer.

[16]  P. Rehak,et al.  Steroid hormone receptor expression in renal cell carcinoma: an immunohistochemical analysis of 182 tumors. , 2004, The Journal of urology.

[17]  G. Landberg,et al.  Expression of vascular endothelial growth factor protein in human renal cell carcinoma , 2004, BJU international.

[18]  F. Marshall,et al.  Expression profiling of renal epithelial neoplasms: a method for tumor classification and discovery of diagnostic molecular markers. , 2001, The American journal of pathology.

[19]  Steve Horvath,et al.  Carbonic anhydrase IX is an independent predictor of survival in advanced renal clear cell carcinoma: implications for prognosis and therapy. , 2003, Clinical cancer research : an official journal of the American Association for Cancer Research.

[20]  E. Zabarovsky,et al.  Somatic mutation and homozygous deletion of PTEN/MMAC1 gene of 10q23 in renal cell carcinoma. , 1999, Anticancer research.

[21]  D. Sugarbaker,et al.  Cell motility as a prognostic factor in Stage I nonsmall cell lung carcinoma , 1999, Cancer.

[22]  J. Cheville,et al.  Comparisons of Outcome and Prognostic Features Among Histologic Subtypes of Renal Cell Carcinoma , 2003, The American journal of surgical pathology.

[23]  H. Moch,et al.  High-throughput tissue microarray analysis to evaluate genes uncovered by cDNA microarray screening in renal cell carcinoma. , 1999, The American journal of pathology.

[24]  M Vingron,et al.  Identification and Classification of Differentially Expressed Genes in Renal Cell Carcinoma by Expression Profiling on a Global Human 31 , 500-Element cDNA Array , 2001 .

[25]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[26]  Ximing J. Yang,et al.  Molecular subclassification of kidney tumors and the discovery of new diagnostic markers , 2003, Oncogene.

[27]  K. Furge,et al.  Gene expression profiling of clear cell renal cell carcinoma: Gene identification and prognostic classification , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Alicia Samuels,et al.  Cancer Statistics, 2003 , 2003, CA: a cancer journal for clinicians.

[29]  W. K. Alfred Yung,et al.  Identification of a candidate tumour suppressor gene, MMAC1, at chromosome 10q23.3 that is mutated in multiple advanced cancers , 1997, Nature Genetics.

[30]  G. Landberg,et al.  Expression of cyclin D1, D3, E, and p27 in human renal cell carcinoma analysed by tissue microarray , 2003, British Journal of Cancer.