Predictive Model Prototype for the Diagnosis of Breast Cancer Using Big Data Technology

Big data is the collection of thousands of datasets from different application sources just as social media, banking, sales, marketing, etc. In every field, big data technologies are used for analyzing, preprocessing, storing, and generating new patterns for the benefits of the organization. The era of big data technology is nowadays booming [1]. Health care is one of the most important applications of big data. In health care, data exist in different forms like heart rate, blood pressure, blood test, sugar test, cholesterol, and many more. Diagnosis of diseases at an early stage is also very important in healthcare services. Cancer disease is an abnormal cell that negatively affects our body texture and regular functioning body organs. Due to cancer, the death rate is increased as it gets diagnosed at a later stage. Early diagnosis of cancer increases the survival rate of a patient. This paper focuses on the prediction model for the breast cancer diagnosis at an early stage as it increases the chances for successful treatment because of the advanced diagnostics technologies like MRI scans, ductogram, diagnostics mammogram, ultrasound, and many more. So predicting the prognosis of breast cancer increases the survival rate of women. Data mining classification algorithm like SVM, naive Bayes, k-NN, decision tree, etc. combined with analytical tool, which is a promising independent tool for handling huge datasets, is proven better in prediction of the breast cancer diagnosis.

[1]  Hajar Mousannif,et al.  Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis , 2016, ANT/SEIT.

[2]  Athanasios V. Vasilakos,et al.  Big data analytics: a survey , 2015, Journal of Big Data.

[3]  M. Mansourian,et al.  A Hybrid Computer-aided-diagnosis System for Prediction of Breast Cancer Recurrence (HPBCR) Using Optimized Ensemble Learning , 2016, Computational and structural biotechnology journal.

[4]  Hyunjung Shin,et al.  Robust predictive model for evaluating breast cancer survivability , 2013, Eng. Appl. Artif. Intell..

[5]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[6]  Markus Hagenbuchner,et al.  Breast cancer data analysis for survivability studies and prediction , 2018, Comput. Methods Programs Biomed..

[7]  Sapiah Binti Sakri,et al.  Particle Swarm Optimization Feature Selection for Breast Cancer Recurrence Prediction , 2018, IEEE Access.

[8]  Keun Ho Ryu,et al.  Design and Partial Implementation of Health Care System for Disease Detection and Behavior Analysis by Using DM Techniques , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[9]  Thora Jonsdottir,et al.  The feasibility of constructing a Predictive Outcome Model for breast cancer using the tools of data mining , 2008, Expert Syst. Appl..

[10]  Chintan Shah,et al.  Comparison of data mining classification algorithms for breast cancer prediction , 2013, 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT).

[11]  Jaber Alwidian,et al.  WCBA: Weighted classification based on association rules algorithm for breast cancer disease , 2018, Appl. Soft Comput..