An Enhanced Ensemble Diagnosis of Cervical Cancer: A Pursuit of Machine Intelligence Towards Sustainable Health

Cervical cancer is a potentially life-threatening disease marked by health practitioners. The late diagnosis and treatment, being quite challenging, stake the precious lives of patients. In both developed and undeveloped states, the formal screening for disease identification suffers due to its medical cost, unavailable health facilities, society norms, and late appearance of symptoms. Machine intelligence is cost-effective, computationally inexpensive, and early diagnosis of several types of diseases, including cervical cancer. The patients are not required to pass through contemporary and tedious medical procedures, and early diagnosis of cervical cancer is quite handy with machine-intelligent solutions. The problem with the current machine classification methods for disease identification is the reliance on a single classifier’s prediction accuracy. The adoption of single classification methods doesn’t ensure the optimum prediction due to bias, over-fitting, mishandling of noisy data, and outliers. This research study proposes an Ensemble classification method based on majority voting for an accurate diagnosis addressing the patient’s medical conditions or symptoms. The study experiments a wide range of available classifiers, namely Decision Tree (DT), Support Vector Machine (SVM), Random Forest (RF), K-Nearest Neighbor (KNN), Naive Bayes (NB), Multiple Perceptron (MP), J48 Trees, and Logistic Regression (LR) classifiers. The study records a significant enhancement in prediction accuracy of 94% that outperforms the prediction accuracies of single classification methods tested on the same benchmarked datasets. Thus, the proposed model bestows a second opinion to health practitioners for disease identification and timely treatment.

[1]  C. Meijer,et al.  Clinical utility of HPV genotyping. , 2006, Gynecologic oncology.

[2]  Frank Speleman,et al.  Real-time quantitative PCR as an alternative to Southern blot or fluorescence in situ hybridization for detection of gene copy number changes. , 2007, Methods in molecular biology.

[3]  L. G. Koss,et al.  Cervical Cancer , 1981, Current Topics in Pathology.

[4]  J. A. Ware,et al.  A review of image analysis and machine learning techniques for automated cervical cancer screening from pap-smear images , 2018, Comput. Methods Programs Biomed..

[5]  Yanxi Liu,et al.  Cervical Cancer Detection Using SVM Based Feature Screening , 2004, MICCAI.

[6]  Cong Wang,et al.  Analysis of Risk Factors for Cervical Cancer Based on Machine Learning Methods , 2018, 2018 5th IEEE International Conference on Cloud Computing and Intelligence Systems (CCIS).

[7]  Georgios Dounias,et al.  Pap-Smear Classification Using Efficient Second Order Neural Network Training Algorithms , 2004, SETN.

[8]  R. Geetha,et al.  Cervical Cancer Identification with Synthetic Minority Oversampling Technique and PCA Analysis using Random Forest Classifier , 2019, Journal of Medical Systems.

[9]  B. Nithya,et al.  Evaluation of machine learning based optimized feature selection approaches and classification methods for cervical cancer prediction , 2019, SN Applied Sciences.

[10]  Gustavo Carneiro,et al.  Automated Nucleus and Cytoplasm Segmentation of Overlapping Cervical Cells , 2013, MICCAI.

[11]  Nestor J. Hernandez Marcano,et al.  Using Machine Learning Methods to Forecast if Solar Flares Will Be Associated with CMEs and SEPs , 2018, The Astrophysical Journal.

[12]  Hansang Lee,et al.  Segmentation of Overlapping Cervical Cells in Microscopic Images with Superpixel Partitioning and Cell-Wise Contour Refinement , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[13]  Guanglu Sun,et al.  Cervical Cancer Diagnosis based on Random Forest , 2017 .

[14]  Prateek Agrawal,et al.  Classification of Clinical Dataset of Cervical Cancer using KNN , 2016 .

[15]  Yen-Ping Chu,et al.  Edge Enhancement Nucleus and Cytoplast Contour Detector of Cervical Smear Images , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16]  Q. Zou,et al.  Cancer Diagnosis Through IsomiR Expression with Machine Learning Method , 2016 .

[17]  Vinesh Thiruchelvam,et al.  Exploratory Data Analysis and ETL with SAS on Hadoop Eco-system with Cervical Cancer Dataset , 2020 .

[18]  Christophoros Nikou,et al.  Combining shape, texture and intensity features for cell nuclei extraction in Pap smear images , 2011, Pattern Recognit. Lett..

[19]  Selim Aksoy,et al.  Segmentation of Cervical Cell Images , 2010, 2010 20th International Conference on Pattern Recognition.

[20]  Jianping Yin,et al.  Cytoplasm and nucleus segmentation in cervical smear images using Radiating GVF Snake , 2012, Pattern Recognit..

[21]  Eileen M. Burd,et al.  Human Papillomavirus and Cervical Cancer , 1988, The Lancet.

[22]  J. Peto,et al.  The clinical effectiveness and cost-effectiveness of primary human papillomavirus cervical screening in England: extended follow-up of the ARTISTIC randomised trial cohort through three screening rounds. , 2014, Health technology assessment.

[23]  Xudong Jiang,et al.  Accurate Cervical Cell Segmentation from Overlapping Clumps in Pap Smear Images , 2017, IEEE Trans. Medical Imaging.

[24]  Chih-Jen Tseng,et al.  Application of machine learning to predict the recurrence-proneness for cervical cancer , 2013, Neural Computing and Applications.

[25]  Muhammad Atif,et al.  Cervical Cancer Prediction through Different Screening Methods using Data Mining , 2019, International Journal of Advanced Computer Science and Applications.

[26]  Bai Ying Lei,et al.  Accurate Segmentation of Cervical Cytoplasm and Nuclei Based on Multiscale Convolutional Network and Graph Partitioning , 2015, IEEE Transactions on Biomedical Engineering.

[27]  M. Follen,et al.  Comparing the Performance of Hybrid Capture II and Polymerase Chain Reaction (PCR) for the Identification of Cervical Dysplasia in the Screening and Diagnostic Settings , 2013, Clinical Medicine Insights. Oncology.

[28]  Konstantinos I. Diamantaras,et al.  Airfare prices prediction using machine learning techniques , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[29]  I. Indrayanti,et al.  Automated cervical cell nuclei segmentation using morphological operation and watershed transformation , 2012, 2012 IEEE International Conference on Computational Intelligence and Cybernetics (CyberneticsCom).

[30]  M. Anousouya Devi,et al.  Classification of Cervical Cancer Using Artificial Neural Networks , 2016 .

[31]  Huiru Zheng,et al.  Machine Learning for Medical Applications , 2015, TheScientificWorldJournal.

[32]  T. Kessler,et al.  Cervical Cancer: Prevention and Early Detection. , 2017, Seminars in oncology nursing.

[33]  José Manuel Benítez,et al.  Segmentation of cervical cell nuclei in high-resolution microscopic images: A new algorithm and a web-based software framework , 2012, Comput. Methods Programs Biomed..

[34]  Hwa Jen Yap,et al.  Integrative machine learning analysis of multiple gene expression profiles in cervical cancer , 2018, PeerJ.

[35]  Wim Schoutens,et al.  Machine learning for quantitative finance: fast derivative pricing, hedging and fitting , 2018, Quantitative Finance.

[36]  Dimitrios I. Fotiadis,et al.  Machine learning applications in cancer prognosis and prediction , 2014, Computational and structural biotechnology journal.

[37]  Sansanee Auephanwiriyakul,et al.  Automatic cervical cell segmentation and classification in Pap smears , 2014, Comput. Methods Programs Biomed..

[38]  Zeyu Wang,et al.  A review of artificial intelligence based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models , 2017 .

[39]  Jie Su,et al.  Automatic Detection of Cervical Cancer Cells by a Two-Level Cascade Classification System , 2016, Analytical cellular pathology.

[40]  Adi Wijaya,et al.  Behavior Determinant Based Cervical Cancer Early Detection with Machine Learning Algorithm , 2016 .

[41]  M. DeRosa,et al.  Recent advances in cancer early detection and diagnosis: Role of nucleic acid based aptasensors , 2020 .

[42]  Md. Rakibul Hoque,et al.  An empirical investigation of the relationship between e-government development and the digital economy: the case of Asian countries , 2018, J. Knowl. Manag..

[43]  D. Millar,et al.  Comparison of a novel HPV test with the Hybrid Capture II (hcII) and a reference PCR method shows high specificity and positive predictive value for 13 high-risk human papillomavirus infections. , 2008, Journal of clinical virology : the official publication of the Pan American Society for Clinical Virology.

[44]  M. Uda,et al.  Human Papillomavirus E6 biosensing: Current progression on early detection strategies for cervical Cancer. , 2019, International journal of biological macromolecules.

[45]  Septimiu E. Salcudean,et al.  A new era: artificial intelligence and machine learning in prostate cancer , 2019, Nature Reviews Urology.

[46]  H. Nalwa,et al.  A review on graphene-based nanocomposites for electrochemical and fluorescent biosensors , 2019, RSC advances.

[47]  Laurent Schwartz,et al.  The Warburg Effect and the Hallmarks of Cancer. , 2017, Anti-cancer agents in medicinal chemistry.

[48]  Dimitrios I. Fotiadis,et al.  Automated Detection of Cell Nuclei in PAP stained cervical smear images using Fuzzy Clustering , 2009 .

[49]  Selim Aksoy,et al.  Unsupervised segmentation and classification of cervical cell images , 2012, Pattern Recognit..

[50]  Georgios Dounias,et al.  Pap smear diagnosis using a hybrid intelligent scheme focusing on genetic algorithm based feature selection and nearest neighbor classification , 2009, Comput. Biol. Medicine.

[51]  Meng-Hsiun Tsai,et al.  Nucleus and cytoplast contour detector of cervical smear image , 2008, Pattern Recognit. Lett..

[52]  In Lee,et al.  Machine learning for enterprises: Applications, algorithm selection, and challenges , 2020 .

[53]  Ling Zhang,et al.  A deep learning based framework for accurate segmentation of cervical cytoplasm and nuclei , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[54]  N. Zari,et al.  Label-Free DNA Biosensor for Electrochemical Detection of Short DNA Sequences Related to Human Papilloma Virus , 2009 .

[55]  Lawrence B. Holder,et al.  Machine learning for epigenetics and future medical applications , 2017, Epigenetics.

[56]  Søren Brunak,et al.  Finding Cervical Cancer Symptoms in Swedish Clinical Text using a Machine Learning Approach and NegEx , 2015, AMIA.

[57]  Xuebing Li,et al.  Application of cyclodextrins in cancer treatment , 2017, Journal of Inclusion Phenomena and Macrocyclic Chemistry.

[58]  Stephen Yip,et al.  Machine learning classifies cancer , 2018, Nature.

[59]  Nishtha Hooda,et al.  Predicting risk of Cervical Cancer : A case study of machine learning , 2019, Journal of Statistics and Management Systems.

[60]  Chun-Chieh Chen,et al.  Detection and segmentation of cervical cell cytoplast and nucleus , 2009, Int. J. Imaging Syst. Technol..

[61]  M. Yapar,et al.  Efficiency of MY09/11 consensus PCR in the detection of multiple HPV infections. , 2014, Diagnostic microbiology and infectious disease.

[62]  Dr.P. Aruna,et al.  Comparison of Feature selection methods for diagnosis of cervical cancer using SVM classifier , 2016 .

[63]  T. A. Brown,et al.  Southern Blotting and Related DNA Detection Techniques , 2001 .

[64]  M. von Knebel Doeberitz,et al.  Biomarkers in Cervical Cancer Screening , 2007, Disease markers.

[65]  Tianfu Wang,et al.  A Practical Segmentation Method for Automated Screening of Cervical Cytology , 2011, 2011 International Conference on Intelligent Computation and Bio-Medical Instrumentation.

[66]  Hui Kong,et al.  Partitioning Histopathological Images: An Integrated Framework for Supervised Color-Texture Segmentation and Cell Splitting , 2011, IEEE Transactions on Medical Imaging.

[67]  R. Kumar,et al.  Detection and Classification of Cancer from Microscopic Biopsy Images Using Clinically Significant and Biologically Interpretable Features , 2015, Journal of medical engineering.

[68]  Muhammad Attique,et al.  Data-Driven Cervical Cancer Prediction Model with Outlier Detection and Over-Sampling Methods , 2020, Sensors.

[69]  Gustavo Carneiro,et al.  An Improved Joint Optimization of Multiple Level Set Functions for the Segmentation of Overlapping Cervical Cells , 2015, IEEE Transactions on Image Processing.

[70]  Nor Ashidi Mat Isa,et al.  Overlapping cells separation method for cervical cell images , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[71]  P. Mullen,et al.  Comparison of the accuracy of Hybrid Capture II and polymerase chain reaction in detecting clinically important cervical dysplasia: a systematic review and meta-analysis , 2013, Cancer medicine.

[72]  Bin Zhang,et al.  Hybridization biosensor based on the covalent immobilization of probe DNA on chitosan-mutiwalled carbon nanotubes nanocomposite by using glutaraldehyde as an arm linker , 2011 .

[73]  Shilin Wang,et al.  A Bayesian Possibilistic C-Means clustering approach for cervical cancer screening , 2019, Inf. Sci..

[74]  Jaime S. Cardoso,et al.  Transfer Learning with Partial Observability Applied to Cervical Cancer Screening , 2017, IbPRIA.

[75]  Ahmed Ghoneim,et al.  Machine learning for assisting cervical cancer diagnosis: An ensemble approach , 2020, Future Gener. Comput. Syst..

[76]  P. Svenmarck,et al.  Possibilities and Challenges for Artificial Intelligence in Military Applications , 2018 .