Comparative study on data mining classification methods for cervical cancer prediction using pap smear results

The number of woman with cervical cancer in Indonesia is getting higher. Indonesia becomes the country with the highest number of women with cervical cancer in the world. Cervical cancer became the highest cause of cancer deaths in women globally. There has been a lot of research using data mining techniques with variety of different data mining models that can be used for analyzing cervical cancer. In this research, data that be used were obtained from the medical records of the Pap smear test results. There are 38 symptoms and 7 classes. Naïve Bayes, Support Vector Machines (SVM), and Random Forest Tree was used to evaluate the performance of the classifier. The performance matric that used in this study are accuracy, recall, precision, and ROC curve. Based on the performance matric, Random Forest Tree is the best classifier among other classifiers to classify Pap smear results.

[1]  Upi Rianantika Implementasi Metode Similarity Untuk Pendukung Keputusan Diagnosis Kanker Serviks , 2013 .

[3]  S. Beaulah Advances in Natural and Applied Sciences , 2014 .

[4]  Yunqian Ma,et al.  Imbalanced Learning: Foundations, Algorithms, and Applications , 2013 .

[5]  Chintan Shah,et al.  Comparison of data mining classification algorithms for breast cancer prediction , 2013, 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT).

[6]  Christophoros Nikou,et al.  Automated Detection of Cell Nuclei in Pap Smear Images Using Morphological Reconstruction and Clustering , 2011, IEEE Transactions on Information Technology in Biomedicine.

[7]  Goutam Saha,et al.  Identification of Genetic Pathway for Cervical Cancer Development Using Rough and Bayesian Theory , 2014, 2014 Fourth International Conference of Emerging Applications of Information Technology.

[9]  Guangming Li,et al.  Multi-classes Imbalanced Dataset Classification Based on Sample Information , 2015, 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems.

[10]  Chi-Jie Lu,et al.  Prediction of Recurrence in Patients with Cervical Cancer Using MARS and Classification , 2022 .

[11]  Sharhabeel H. Alnabelsi,et al.  CERVICAL CANCER DIAGNOSTIC SYSTEM USING ADAPTIVE FUZZY MOVING K-MEANS ALGORITHM AND FUZZY MIN-MAX NEURAL NETWORK , 2013 .

[12]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[13]  Lotfi Nabli,et al.  The contribution of artificial intelligence tools in screening for cancer of the cervix , 2011, 2011 International Conference on Communications, Computing and Control Applications (CCCA).

[14]  Sansanee Auephanwiriyakul,et al.  Automatic cervical cell segmentation and classification in Pap smears , 2014, Comput. Methods Programs Biomed..