Machine Learning based System for Prediction of Breast Cancer Severity

Breast cancer is one of the most common diseases and the leading cause of death to mostly females all over the world. Early detection can provide higher treatment efficiency and better healing chances. Even though mammography screening is handy in diagnosing breast cancer at an early stage, Computer-Aided Diagnosis (CAD) systems can help to reduce the cancer death-rate. Radiologists, physicians, and doctors, in general, make use of these CAD systems to diagnose, detect, analyze and make decisions whether the patient is benign or malignant. The present paper presents some data mining techniques used in the diagnosis of cancer such as Artificial Neuron Network (ANN), K-Nearest Neighbors (KNN), Binary Support Vector Machine (Binary SVM), and Decision Tree (DT). Within this framework, the database utilized is the Mammographic Mass dataset. This database contains data of probabilistic breast cancer patients and the advanced results by experts in the field. The paper adopts a confusion matrix for binary prediction as a method of data analysis. The present paper provides a comparison between the different Computer-Aided diagnosis systems techniques regarding accuracy, specificity, and sensitivity amidst many other criteria to find the most accurate alternative among ANN, KNN, Binary SVM, and DT.

[1]  A. Jemal,et al.  Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries , 2018, CA: a cancer journal for clinicians.

[2]  Natcha Thawesaengskulthai,et al.  Decision support system using artificial neural network for managing product innovation , 2011 .

[3]  C. Compton,et al.  Cancer Survival Analysis , 2012 .

[4]  S. Pal,et al.  Prediction of benign and malignant breast cancer using data mining techniques , 2018 .

[5]  Amit Ganatra,et al.  Support Vector Machine Classification using Mahalanobis Distance Function , 2015 .

[6]  E. E. Houby A survey on applying machine learning techniques for management of diseases , 2018 .

[7]  Robert T. Fazzio,et al.  Breast Cancer Screening for Women at Average Risk , 2019, Current Breast Cancer Reports.

[8]  J. Havel,et al.  Artificial neural networks in medical diagnosis , 2013 .

[9]  Mikko Kolehmainen,et al.  Structure-based classification of active and inactive estrogenic compounds by decision tree, LVQ and kNN methods. , 2006, Chemosphere.

[10]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[11]  S. Fields,et al.  Improved mammographic interpretation of masses using computer-aided diagnosis , 2000, European Radiology.

[12]  Brian K. Smith,et al.  An optimum ANN-based breast cancer diagnosis: Bridging gaps between ANN learning and decision-making goals , 2018, Appl. Soft Comput..

[13]  Mojgan Mokhtari,et al.  Breast cancer diagnosis: Imaging techniques and biochemical markers , 2018, Journal of cellular physiology.

[14]  Alaa M. El-Halees,et al.  Breast Cancer Severity Degree Predication Using Data Mining Techniques in the Gaza Strip , 2018, 2018 International Conference on Promising Electronic Technologies (ICPET).

[15]  Kunio Doi,et al.  Computer-aided diagnosis in medical imaging: Historical review, current status and future potential , 2007, Comput. Medical Imaging Graph..

[16]  Noel C. F. Codella,et al.  Skin lesion analysis toward melanoma detection: A challenge at the 2017 International symposium on biomedical imaging (ISBI), hosted by the international skin imaging collaboration (ISIC) , 2016, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[17]  Savita Goel,et al.  A study on prediction of breast cancer recurrence using data mining techniques , 2017, 2017 7th International Conference on Cloud Computing, Data Science & Engineering - Confluence.

[18]  Ayman M. Eldeib,et al.  Breast cancer classification using deep belief networks , 2016, Expert Syst. Appl..

[19]  J. Lortet-Tieulent,et al.  Breast Cancer Screening for Women at Average Risk: 2015 Guideline Update From the American Cancer Society , 2016 .

[20]  Kai Ming Ting,et al.  Confusion Matrix , 2010, Encyclopedia of Machine Learning and Data Mining.