A Comparative Machine Learning Algorithm to Predict the Bone Metastasis Cervical Cancer with Imbalance Data Problem

This paper attempted to develop and validate a tool to predict the immediate results of radiation on bone metastasis in cervical cancer cases. Cases of bone metastasis in cervical cancer are based on radiation treatment data, which is imbalanced. This imbalanced data is a challenge among the researchers in data mining, called class imbalance learning (CIL) and has lead to difficulties in machine learning and a reduction in the classifier performance. In this paper, we compared several algorithms to deal with the data imbalance classification problem using the synthetic minority over-sampling technique (SMOTE) used to drive classification models: Ant-Miner, RIPPER, Ridor, PART, ADTree, C4.5, ELM and Weighted ELM using Accuracy, G-mean and F-measure to evaluate performance. The results of this paper show that the RIPPER algorithm outperformed the other algorithms in Accuracy and F-measure, but weighted ELM outperformed other algorithms by G-mean. This may be useful when evaluating clinical assessments.

[1]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[2]  Yoav Freund,et al.  The Alternating Decision Tree Learning Algorithm , 1999, ICML.

[3]  Stan Matwin,et al.  Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[4]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[5]  Brian R. Gaines,et al.  Induction of ripple-down rules applied to modeling large databases , 1995, Journal of Intelligent Information Systems.

[6]  K. Murase,et al.  Survival prediction using artificial neural networks in patients with uterine cervical cancer treated by radiation therapy alone , 2002, International Journal of Clinical Oncology.

[7]  U. Udomsubpayakul,et al.  Bone Metastasis in Cervical Cancer Patients Over a 10-Year Period , 2010, International Journal of Gynecologic Cancer.

[8]  D. Thanapprapasr,et al.  Comparison of Outcomes for Patients With Cervical Cancer Who Developed Bone Metastasis After the Primary Treatment With Concurrent Chemoradiation Versus Radiation Therapy Alone , 2010, International Journal of Gynecologic Cancer.

[9]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[10]  Yiqiang Chen,et al.  Weighted extreme learning machine for imbalance learning , 2013, Neurocomputing.

[11]  Dianhui Wang,et al.  Extreme learning machines: a survey , 2011, Int. J. Mach. Learn. Cybern..

[12]  Alex Alves Freitas,et al.  Data mining with an ant colony optimization algorithm , 2002, IEEE Trans. Evol. Comput..

[13]  Vorachai Tangvoraphonkchai,et al.  Model for Cervical Cancer Result Prediction using Artificial Neural Network , 2013 .

[14]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[15]  Nitesh V. Chawla,et al.  Editorial: special issue on learning from imbalanced data sets , 2004, SKDD.

[16]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.