Naïve Bayes Model Based Improved K-Nearest Neighbor Classifier for Breast Cancer Prediction

Breast cancer is one of the major cancers that is common to women all over the world. Though, the cancer is curable and can be prevented if it is detected in early stages. In medical science, lots of different strategies have been developed to detect and diagnose the cancer patients. Data mining techniques are no far behind and are widely used to extract information from large databases of the cancer patients to discover some patterns making decisions. Classification is one of the data mining techniques that can be used to classify the data in two stages i.e. benign or malignant. This paper presents the Naive Bayes improved K-Nearest Neighbor method (NBKNN) for breast cancer prediction and compares the results with traditional classifiers like traditional K-nearest Neighbor and naive Bayes. In the experiments, the standard dataset used is taken from UCI repository. Sensitivity and specificity have been used as accuracy measures for comparing the results. Experimental results show that proposed classifier is better than traditional classifiers.

[1]  Walid Cherif,et al.  Optimization of K-NN algorithm by clustering and reliability coefficients: application to breast-cancer diagnosis , 2018 .

[2]  Peter Adebayo Idowu,et al.  BREAST CANCER RISK PREDICTION USING DATA MINING CLASSIFICATION TECHNIQUES , 2015 .

[3]  Mohannad Alhanahnah,et al.  Breast Cancer Detection Using K-Nearest Neighbor Machine Learning Algorithm , 2016, 2016 9th International Conference on Developments in eSystems Engineering (DeSE).

[4]  Sunita Soni,et al.  Naive Bayes Classifiers: A Probabilistic Detection Model for Breast Cancer , 2014 .

[5]  Md. Kamrul Hasan,et al.  Prediction of breast cancer using support vector machine and K-Nearest neighbors , 2017, 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC).

[6]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[7]  Abbas Toloie Eshlaghy,et al.  Using Three Machine Learning Techniques for Predicting Breast Cancer Recurrence , 2013 .

[8]  S. Pal,et al.  Data Mining Techniques: To Predict and Resolve Breast Cancer Survivability , 2017 .

[9]  R. Chang,et al.  Data mining with decision trees for diagnosis of breast tumor in medical ultrasonic images , 2001, Breast Cancer Research and Treatment.

[10]  Baijnath Kaushik,et al.  Feature Selection from Biological Database for Breast Cancer Prediction and Detection Using Machine Learning Classifier , 2018 .

[11]  O. Mangasarian,et al.  Multisurface method of pattern separation for medical diagnosis applied to breast cytology. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[13]  Divya Tomar,et al.  A survey on Data Mining approaches for Healthcare , 2013, BSBT 2013.

[14]  Dharminder Kumar,et al.  DATA MINING CLASSIFICATION TECHNIQUES APPLIED FOR BREAST CANCER DIAGNOSIS AND PROGNOSIS , 2011 .

[15]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.

[16]  Teeradej Ratanachaikanont,et al.  Clinical breast examination and its relevance to diagnosis of palpable breast lesion. , 2005, Journal of the Medical Association of Thailand = Chotmaihet thangphaet.

[17]  Waqas Anjum,et al.  Modern Breast Cancer Detection: A Technological Review , 2009, Int. J. Biomed. Imaging.

[18]  E. Venkatesan,et al.  Performance Analysis of Decision Tree Algorithms for Breast Cancer Classification , 2015 .

[19]  Zidong Wang,et al.  Machine Learning with Applications in Breast Cancer Diagnosis and Prognosis , 2018 .

[20]  Kamel Hamrouni,et al.  A Based Bayesian Wavelet Thresholding Method to Enhance Nuclear Imaging , 2009, Int. J. Biomed. Imaging.

[21]  Safdar Ali,et al.  Prediction of human breast and colon cancers from imbalanced data using nearest neighbor and support vector machines , 2014, Comput. Methods Programs Biomed..