Prediction of axillary lymph node metastasis in primary breast cancer patients using a decision tree-based model

BackgroundThe aim of this study was to develop a new data-mining model to predict axillary lymph node (AxLN) metastasis in primary breast cancer. To achieve this, we used a decision tree-based prediction method—the alternating decision tree (ADTree).MethodsClinical datasets for primary breast cancer patients who underwent sentinel lymph node biopsy or AxLN dissection without prior treatment were collected from three institutes (institute A, n = 148; institute B, n = 143; institute C, n = 174) and were used for variable selection, model training and external validation, respectively. The models were evaluated using area under the receiver operating characteristics (ROC) curve analysis to discriminate node-positive patients from node-negative patients.ResultsThe ADTree model selected 15 of 24 clinicopathological variables in the variable selection dataset. The resulting area under the ROC curve values were 0.770 [95% confidence interval (CI), 0.689–0.850] for the model training dataset and 0.772 (95% CI: 0.689–0.856) for the validation dataset, demonstrating high accuracy and generalization ability of the model. The bootstrap value of the validation dataset was 0.768 (95% CI: 0.763–0.774).ConclusionsOur prediction model showed high accuracy for predicting nodal metastasis in patients with breast cancer using commonly recorded clinical variables. Therefore, our model might help oncologists in the decision-making process for primary breast cancer patients before starting treatment.

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  T. Julian,et al.  Technical outcomes of sentinel-lymph-node resection and conventional axillary-lymph-node dissection in patients with clinically node-negative breast cancer: results from the NSABP B-32 randomised phase III trial. , 2007, The Lancet. Oncology.

[3]  Shoji Natsugoe,et al.  Ultrasound examination is useful for prediction of histologic type in invasive ductal carcinoma of the breast. , 2008, Ultrasound in medicine & biology.

[4]  Gary H Lyman,et al.  Lymphatic mapping and sentinel lymph node biopsy in early‐stage breast carcinoma , 2006, Cancer.

[5]  L. Belghiti,et al.  Prognostic factors in breast cancer , 2002 .

[6]  Xiaolin Zhou,et al.  Machine learning methods for anticipating the psychological distress in patients with alzheimer’s disease , 2006, Australasian Physics & Engineering Sciences in Medicine.

[7]  P. Langenberg,et al.  Breast Imaging Reporting and Data System: inter- and intraobserver variability in feature analysis and final assessment. , 2000, AJR. American journal of roentgenology.

[8]  Michael W Kattan,et al.  Doctor, what are my chances of having a positive sentinel node? A validated nomogram for risk estimation. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[9]  Patrick Tighe,et al.  Use of machine learning theory to predict the need for femoral nerve block following ACL repair. , 2011, Pain medicine.

[10]  Solomon Henry,et al.  New models and online calculator for predicting non-sentinel lymph node status in sentinel lymph node positive breast cancer patients , 2008, BMC Cancer.

[11]  N. Natarajan,et al.  Breast cancer in the medial half. Results of 1978 national survey of the American College of surgeons , 1983, Cancer.

[12]  C. Redmond,et al.  Relation of number of positive axillary nodes to the prognosis of patients with primary breast cancer. An NSABP update , 1983, Cancer.

[13]  S Hellman,et al.  Pathological prognostic factors in stage I (T1N0M0) and stage II (T1N1M0) breast carcinoma: a study of 644 patients with median follow-up of 18 years. , 1989, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[14]  Kelly K. Hunt,et al.  Clinicopathologic Factors Predicting Involvement of Nonsentinel Axillary Nodes in Women With Breast Cancer , 2003, Annals of Surgical Oncology.

[15]  Marko Hočevar,et al.  Ljubljana nomograms for predicting the likelihood of non-sentinel lymph node metastases in breast cancer patients with a positive sentinel lymph node , 2009, Breast Cancer Research and Treatment.

[16]  Hiram S. Cody,et al.  A Nomogram for Predicting the Likelihood of Additional Nodal Metastases in Breast Cancer Patients With a Positive Sentinel Node Biopsy , 2003, Annals of Surgical Oncology.

[17]  Robert C. G. Martin,et al.  Prediction of sentinel lymph node-only disease in women with invasive breast cancer. , 2006, American journal of surgery.

[18]  S. Schnitt,et al.  Assessment of pathologic prognostic factors in breast core needle biopsies. , 1999, Modern pathology : an official journal of the United States and Canadian Academy of Pathology, Inc.

[19]  S. Edge,et al.  Prognostic factors in breast cancer , 2005 .

[20]  Masaru Tomita,et al.  Predictive value of CD24 and CD44 for neoadjuvant chemotherapy response and prognosis in primary breast cancer patients. , 2010, Journal of medical and dental sciences.

[21]  E Mjolsness,et al.  Machine learning for science: state of the art and future prospects. , 2001, Science.

[22]  Masakazu Toi,et al.  A Novel Method for Sentinel Lymph Node Biopsy by Indocyanine Green Fluorescence Technique in Breast Cancer , 2010, Cancers.

[23]  Yoav Freund,et al.  The Alternating Decision Tree Learning Algorithm , 1999, ICML.

[24]  Roman Rouzier,et al.  The Molecular Subtype Classification Is a Determinant of Sentinel Node Positivity in Early Breast Carcinoma , 2011, PloS one.

[25]  H. Cody,et al.  State-of-the-art approaches to sentinel node biopsy for breast cancer: study design, patient selection, technique, and quality control at Memorial Sloan-Kettering Cancer Center. , 1999, Surgical oncology.

[26]  Jean-Marie Aerts,et al.  Computerized prediction of intensive care unit discharge after cardiac surgery: development and validation of a Gaussian processes model , 2011, BMC Medical Informatics Decis. Mak..

[27]  橋本 秀行 Quantitative Ultrasound as a Predictor of Node Metastases and Prognosis in Patients with Breast Cancer , 2000 .

[28]  Khaled Rasheed,et al.  Decision tree and ensemble learning algorithms with their applications in bioinformatics. , 2011, Advances in experimental medicine and biology.

[29]  Roman Rouzier,et al.  Development and validation of nomograms for predicting residual tumor size and the probability of successful conservative surgery with neoadjuvant chemotherapy for breast cancer , 2006, Cancer.

[30]  A. Carmichael,et al.  A clinicopathological scoring system to select breast cancer patients for sentinel node biopsy. , 2006, European journal of surgical oncology : the journal of the European Society of Surgical Oncology and the British Association of Surgical Oncology.

[31]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[32]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[33]  Woo Kyung Moon,et al.  A Scoring System to Predict Nonsentinel Lymph Node Status in Breast Cancer Patients with Metastatic Sentinel Lymph Nodes: A Comparison with Other Scoring Systems , 2008, Annals of Surgical Oncology.

[34]  Guilherme Del Fiol,et al.  Classification models for the prediction of clinicians' information needs , 2009, J. Biomed. Informatics.

[35]  Antoine Flahault,et al.  An axilla scoring system to predict non-sentinel lymph node status in breast cancer patients with sentinel lymph node involvement , 2005, Breast Cancer Research and Treatment.

[36]  L. Liberman,et al.  Breast imaging reporting and data system (BI-RADS). , 2002, Radiologic clinics of North America.

[37]  Marko Hocevar,et al.  5120 Predicting the likelihood of non-sentinel lymph node metastases in breast cancer patients by three nomograms suitable for different institutions , 2009 .

[38]  A. Purushotham,et al.  A model for predicting non‐sentinel lymph node metastatic disease when the sentinel lymph node is positive , 2008, The British journal of surgery.

[39]  K Vajda,et al.  [Prognostic factors in breast cancer]. , 1998, Orvosi hetilap.

[40]  Tanya Hoskin,et al.  Nonsentinel node metastasis in breast cancer patients: assessment of an existing and a new predictive nomogram. , 2005, American journal of surgery.