Breast Cancer Detection in the IOT Health Environment Using Modified Recursive Feature Selection

The accurate and efficient diagnosis of breast cancer is extremely necessary for recovery and treatment in early stages in IoT healthcare environment. Internet of Things has witnessed the transition in life for the last few years which provides a way to analyze both the real-time data and past data by the emerging role of artificial intelligence and data mining techniques. The current state-of-the-art method does not effectively diagnose the breast cancer in the early stages, and most of the ladies suffered from this dangerous disease. Thus, the early detection of breast cancer significantly poses a great challenge for medical experts and researchers. To solve the problem of early-stage detection of breast cancer, we proposed machine learning-based diagnostic system which effectively classifies the malignant and benign people in the environment of IoT. In the development of our proposed system, a machine learning classifier support vector machine is used to classify the malignant and benign people. To improve the classification performance of the classification system, we used a recursive feature selection algorithm to select more suitable features from the breast cancer dataset. The training/testing splits method is applied for training and testing of the classifier for the best predictive model. Additionally, the classifier performance has been checked on by using performance evaluation metrics such as classification, specificity, sensitivity, Matthews’s correlation coefficient, F1-score, and execution time. To test the proposed method, the dataset “Wisconsin Diagnostic Breast Cancer” has been used in this research study. The experimental results demonstrate that the recursive feature selection algorithm selects the best subset of features, and the classifier SVM achieved optimal classification performance on this best subset of features. The SVM kernel linear achieved high classification accuracy (99%), specificity (99%), and sensitivity (98%), and the Matthews’s correlation coefficient is 99%. From these experimental results, we concluded that the proposed system performance is excellent due to the selection of more appropriate features that are selected by the recursive feature selection algorithm. Furthermore, we suggest this proposed system for effective and efficient early stages diagnosis of breast cancer. Thus, through this system, the recovery and treatment will be more effective for breast cancer. Lastly, the implementation of the proposed system is very reliable in all aspects of IoT healthcare for breast cancer.

[1]  Janaina Mourão Miranda,et al.  Classifying brain states and determining the discriminating activation patterns: Support Vector Machine on functional MRI data , 2005, NeuroImage.

[2]  M. M. A. Hashem,et al.  Mathematical model development to detect breast cancer using multigene genetic programming , 2016, 2016 5th International Conference on Informatics, Electronics and Vision (ICIEV).

[3]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[4]  Cheng Wang,et al.  Decision Tree Based Predictive Models for Breast Cancer Survivability on Imbalanced Data , 2009, 2009 3rd International Conference on Bioinformatics and Biomedical Engineering.

[5]  Joel Quintanilla-Domínguez,et al.  WBCD breast cancer database classification applying artificial metaplasticity neural network , 2011, Expert Syst. Appl..

[6]  Sahibzada Ali Mahmud,et al.  Breast cancer detection using cartesian genetic programming evolved artificial neural networks , 2012, GECCO '12.

[7]  Dimitris Kanellopoulos,et al.  Data Preprocessing for Supervised Leaning , 2007 .

[8]  Aytug Onan,et al.  A fuzzy-rough nearest neighbor classifier combined with consistency-based subset evaluation and instance selection for automated diagnosis of breast cancer , 2015, Expert Syst. Appl..

[9]  Moshe Sipper,et al.  A fuzzy-genetic approach to breast cancer diagnosis , 1999, Artif. Intell. Medicine.

[10]  Abdulhamit Subasi,et al.  Breast cancer diagnosis using GA feature selection and Rotation Forest , 2015, Neural Computing and Applications.

[11]  Asad Malik,et al.  Feature Selection Based on L1-Norm Support Vector Machine and Effective Recognition System for Parkinson’s Disease Using Voice Recordings , 2019, IEEE Access.

[12]  Hussein A. Abbass,et al.  An evolutionary artificial neural networks approach for breast cancer diagnosis , 2002, Artif. Intell. Medicine.

[13]  Nidhi Mishra,et al.  Breast cancer diagnosis using adaptive voting ensemble machine learning algorithm , 2018, 2018 IEEMA Engineer Infinite Conference (eTechNxT).

[14]  Yamuna Prasad,et al.  SVM Classifier Based Feature Selection Using GA, ACO and PSO for siRNA Design , 2010, ICSI.

[15]  Sai-Ho Ling,et al.  An Efficient Diagnosis System for Parkinson's Disease Using Deep Belief Network , 2017 .

[16]  Nor Ashidi Mat Isa,et al.  A genetic algorithm-based multi-objective optimization of an artificial neural network classifier for breast cancer diagnosis , 2013, Neural Computing and Applications.

[17]  Jian-Ping Li,et al.  A Hybrid Intelligent System Framework for the Prediction of Heart Disease Using Machine Learning Algorithms , 2018, Mob. Inf. Syst..

[18]  Md. Kamrul Hasan,et al.  Prediction of breast cancer using support vector machine and K-Nearest neighbors , 2017, 2017 IEEE Region 10 Humanitarian Technology Conference (R10-HTC).

[19]  Mehmet Fatih Akay,et al.  Support vector machines combined with feature selection for breast cancer diagnosis , 2009, Expert Syst. Appl..

[20]  A.A. Albrecht,et al.  Two applications of the LSA machine , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[21]  Sang Won Yoon,et al.  Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms , 2014, Expert Syst. Appl..

[22]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[23]  S. Krishna,et al.  Iot based patient monitoring and diagnostic prediction tool using ensemble classifier , 2017, 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[24]  F. Greene,et al.  Breast cancer statistics: use and misuse. , 1986, International surgery.

[25]  Jun Wu,et al.  Breast Cancer Diagnosis Using an Unsupervised Feature Extraction Algorithm Based on Deep Learning , 2018, 2018 37th Chinese Control Conference (CCC).

[26]  Wei-Min Shen,et al.  Data Preprocessing and Intelligent Data Analysis , 1997, Intell. Data Anal..

[27]  Carlos J. Perez,et al.  A two-stage variable selection and classification approach for Parkinson's disease detection by using voice recording replications , 2017, Comput. Methods Programs Biomed..

[28]  Wei Xiang,et al.  An IoT-cloud Based Wearable ECG Monitoring System for Smart Healthcare , 2016, Journal of Medical Systems.

[29]  Bo Gao,et al.  Improved Cost-Sensitive Support Vector Machine Classifier for Breast Cancer Diagnosis , 2018, Mathematical Problems in Engineering.

[30]  Randal S. Olson,et al.  Relief-Based Feature Selection: Introduction and Review , 2017, J. Biomed. Informatics.

[31]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[32]  Mohammad Teshnehlab,et al.  Breast cancer diagnosis in DCE-MRI using mixture ensemble of convolutional neural networks , 2017, Pattern Recognit..

[33]  Jianhua Gu,et al.  A New Hybrid Intelligent Framework for Predicting Parkinson’s Disease , 2017, IEEE Access.

[34]  Ahmad Taher Azar,et al.  Probabilistic neural network for breast cancer classification , 2012, Neural Computing and Applications.

[35]  Razieh Sheikhpour,et al.  Particle swarm optimization for bandwidth determination and feature selection of kernel density estimation based classifiers in diagnosis of breast cancer , 2016, Appl. Soft Comput..

[36]  Dayou Liu,et al.  A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis , 2011, Expert Syst. Appl..

[37]  A. V.DavidSánchez,et al.  Advanced support vector machines and kernel methods , 2003, Neurocomputing.

[38]  D. Plewes,et al.  Systematic Review: Using Magnetic Resonance Imaging to Screen Women at High Risk for Breast Cancer , 2008, Annals of Internal Medicine.

[39]  Xionghui Zhou,et al.  Integrating Feature Selection and Feature Extraction Methods With Deep Learning to Predict Clinical Outcome of Breast Cancer , 2018, IEEE Access.

[40]  Max A. Little,et al.  Nonlinear speech analysis algorithms mapped to a standard metric achieve clinically useful quantification of average Parkinson's disease symptom severity , 2011, Journal of The Royal Society Interface.