An algorithm for expanding the TNM staging system.

AIM We describe a new method to expand the tumor, lymph node, metastasis (TNM) staging system using a clustering algorithm. Cases of breast cancer were used for demonstration. MATERIALS & METHODS An unsupervised ensemble-learning algorithm was used to create dendrograms. Cutting the dendrograms produced prognostic systems. RESULTS Prognostic systems contained groups of patients with similar outcomes. The prognostic systems based on tumor size and lymph node status recapitulated the general structure of the TNM for breast cancer. The prognostic systems based on tumor size, lymph node status, histologic grade and estrogen receptor status revealed a more detailed stratification of patients when grade and estrogen receptor status were added. CONCLUSION Prognostic systems from cutting the dendrogram have the potential to improve and expand the TNM.

[1]  David Elder,et al.  Thin primary cutaneous malignant melanoma: a prognostic tree for 10-year metastasis is more accurate than American Joint Committee on Cancer staging. , 2004, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[2]  L S Freedman,et al.  Relationship among outcome, stage of disease, and histologic grade for 22,616 cases of breast cancer. The basis for a prognostic index , 1991, Cancer.

[3]  J. Klein,et al.  Survival Analysis: Techniques for Censored and Truncated Data , 1997 .

[4]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[5]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[6]  M. Kattan,et al.  A postoperative prognostic nomogram for renal cell carcinoma. , 2001, The Journal of urology.

[7]  B. Everitt,et al.  Cluster Analysis: Everitt/Cluster Analysis , 2011 .

[8]  Li Zhang,et al.  Development and validation of a nomogram for predicting survival in patients with resected non-small-cell lung cancer. , 2015, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[9]  H. Burke,et al.  Criteria for prognostic factors and for an enhanced prognostic system , 1993, Cancer.

[10]  Xiuzhen Cheng,et al.  Developing Prognostic Systems of Cancer Patients by Ensemble Clustering , 2009, Journal of biomedicine & biotechnology.

[11]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[12]  Dengyuan Wu,et al.  An examination of TNM staging of melanoma by a machine learning algorithm , 2012, 2012 International Conference on Computerized Healthcare (ICCH).

[13]  Li Sheng,et al.  On an ensemble algorithm for clustering cancer patient data , 2013, BMC Systems Biology.

[14]  D. Henson,et al.  Histologic grade remains a prognostic factor for breast cancer regardless of the number of positive lymph nodes and tumor size: a study of 161 708 cases of breast cancer from the SEER Program. , 2014, Archives of pathology & laboratory medicine.

[15]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[16]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Martin Radespiel-Tröger,et al.  Improved prediction of recurrence after curative resection of colon carcinoma using tree‐based risk stratification , 2004, Cancer.

[18]  Dengyuan Wu,et al.  The clinical implications of integrating additional prognostic factors into the TNM , 2014, Journal of surgical oncology.