Enhancing Ensemble Prediction Accuracy of Breast Cancer Survivability and Diabetes Diagnostic Using Optimized EKF-RBFN Trained Prototypes

We are in a machine learning age where several predictive applications that are life dependent are made by machines and robotic devices that relies on ensemble decision making algorithms. These have attracted many researchers and led to the development of an algorithm that is based on the integration of EKF, RBF networks and AdaBoost as an ensemble model to improve prediction accuracy. Firstly, EKF is used to optimize the slow training speed and improve the efficiency of the RBF network training parameters. Secondly, AdaBoost is applied to generate and combine RBFN-EKF weak predictors to form a strong predictor. Breast cancer survivability and diabetes diagnostic datasets used were obtained from the UCI repository. Results are presented on the proposed model as applied to Breast cancer survivability and Diabetes diagnostic predictive problems. The model outputs an accuracy of 96% when EKF-RBFN is applied as a base classifier compare to 94% when Decision Stump is applied and AdaBoost as an ensemble technique in both examples. The output accuracy of ensemble AdaBoostM1-Random Forest and standalone Random Forest models is 97% in both cases. The study has gone some way towards enhancing our knowledge and improving the prediction accuracy through the amalgamation of EKF, RBFN and AdaBoost algorithms as an ensemble model.

[1]  M. O’Halloran,et al.  Spiking Neural Networks for Breast Cancer Classification Using Radar Target Signatures , 2010 .

[2]  Joseph W. Connolly,et al.  Model-Based Engine Control Architecture with an Extended Kalman Filter , 2016 .

[3]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Alberto Pigazo,et al.  Kalman Filter Recent Advances and Applications , 2009 .

[5]  Lihua Li,et al.  A novel Multi-Agent Ada-Boost algorithm for predicting protein structural class with the information of protein secondary structure , 2015, J. Bioinform. Comput. Biol..

[6]  Hooshang Jazayeri-Rad,et al.  Comparing the Fault Diagnosis Performances of Single Neural Networks and Two Ensemble Neural Networks Based on the Boosting Methods , 2014 .

[7]  Hamidreza Rashidy Kanan,et al.  Breast cancer detection and classification in digital mammography based on Non-Subsampled Contourlet Transform (NSCT) and Super Resolution , 2015, Comput. Methods Programs Biomed..

[8]  Seungjun Lee,et al.  Recent Advances in Microwave Imaging for Breast Cancer Detection , 2016, Int. J. Biomed. Imaging.

[9]  Manal Alghamdi,et al.  Predicting diabetes mellitus using SMOTE and ensemble machine learning approach: The Henry Ford ExercIse Testing (FIT) project , 2017, PloS one.

[10]  Ian T. Nabney,et al.  Netlab: Algorithms for Pattern Recognition , 2002 .

[11]  Ebad Banissi,et al.  Predictive Ensemble Modelling: Experimental Comparison of Boosting Implementation Methods , 2017, 2017 European Modelling Symposium (EMS).

[12]  Dan Simon,et al.  Training radial basis neural networks with the extended Kalman filter , 2002, Neurocomputing.

[13]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[14]  L. Vatten,et al.  Modern mammography screening and breast cancer mortality: population study , 2014, BMJ : British Medical Journal.

[15]  Silvia Curteanu,et al.  Predictions of ocular changes caused by diabetes in glaucoma patients , 2018, Comput. Methods Programs Biomed..

[16]  A. N. Chernodub Training Neural Networks for classification using the Extended Kalman Filter: A comparative study , 2014, Optical Memory and Neural Networks.

[17]  Yunsong Li,et al.  Breast mass classification in digital mammography based on extreme learning machine , 2016, Neurocomputing.

[18]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[19]  Jungwon Yoon,et al.  An Analytical Approach for Fast Recovery of the LSI Properties in Magnetic Particle Imaging , 2016, Int. J. Biomed. Imaging.

[20]  Ebad Banissi,et al.  Prediction of breast cancer survivability using ensemble algorithms , 2017, 2017 International Conference on Smart Systems and Technologies (SST).

[21]  Andrew P. Bradley,et al.  Intelligible Support Vector Machines for Diagnosis of Diabetes Mellitus , 2010, IEEE Transactions on Information Technology in Biomedicine.

[22]  Hanseok Ko,et al.  Reinforced AdaBoost Learning for Object Detection with Local Pattern Representations , 2013, TheScientificWorldJournal.

[23]  Ya Zhang,et al.  A machine learning-based framework to identify type 2 diabetes through electronic health records , 2017, Int. J. Medical Informatics.

[24]  B. W. Leach An introduction to Kalman filtering , 1984 .

[25]  Chastine Fatichah,et al.  A Combined AdaBoost and NEWFM Technique for Medical Data Classification , 2015 .

[26]  Denis Pereira de Lima Neural Network Training Using Unscented and Extended Kalman Filter , 2017, ICRA 2017.

[27]  Li-Yeh Chuang,et al.  Evaluation of Breast Cancer Susceptibility Using Improved Genetic Algorithms to Generate Genotype SNP Barcodes , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[28]  Rudolph van der Merwe,et al.  The unscented Kalman filter for nonlinear estimation , 2000, Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No.00EX373).

[29]  Sanjay N. Talbar,et al.  An Overview of Pectoral Muscle Extraction Algorithms Applied to Digital Mammograms , 2016 .

[30]  M. O’Halloran,et al.  Spiking Neural Networks for Breast Cancer Classification in a Dielectrically Heterogeneous Breast , 2011 .

[31]  S. Love,et al.  Global Breast Cancer: The Lessons to Bring Home , 2011, International journal of breast cancer.