A Metaheuristic Optimization Approach for Parameter Estimation in Arrhythmia Classification from Unbalanced Data

The electrocardiogram records the heart’s electrical activity and generates a significant amount of data. The analysis of these data helps us to detect diseases and disorders via heart bio-signal abnormality classification. In unbalanced-data contexts, where the classes are not equally represented, the optimization and configuration of the classification models are highly complex, reflecting on the use of computational resources. Moreover, the performance of electrocardiogram classification depends on the approach and parameter estimation to generate the model with high accuracy, sensitivity, and precision. Previous works have proposed hybrid approaches and only a few implemented parameter optimization. Instead, they generally applied an empirical tuning of parameters at a data level or an algorithm level. Hence, a scheme, including metrics of sensitivity in a higher precision and accuracy scale, deserves special attention. In this article, a metaheuristic optimization approach for parameter estimations in arrhythmia classification from unbalanced data is presented. We selected an unbalanced subset of those databases to classify eight types of arrhythmia. It is important to highlight that we combined undersampling based on the clustering method (data level) and feature selection method (algorithmic level) to tackle the unbalanced class problem. To explore parameter estimation and improve the classification for our model, we compared two metaheuristic approaches based on differential evolution and particle swarm optimization. The final results showed an accuracy of 99.95%, a F1 score of 99.88%, a sensitivity of 99.87%, a precision of 99.89%, and a specificity of 99.99%, which are high, even in the presence of unbalanced data.

[1]  Yakup Kutlu,et al.  A multi-stage automatic arrhythmia recognition and classification system , 2011, Comput. Biol. Medicine.

[2]  S. Poornachandra,et al.  Wavelet-based denoising using subband dependent threshold for ECG signals , 2008, Digit. Signal Process..

[3]  Mateusz Lango,et al.  Tackling the Problem of Class Imbalance in Multi-class Sentiment Classification: An Experimental Study , 2019, Foundations of Computing and Decision Sciences.

[4]  Marek Amanowicz,et al.  Intrusion Detection in Software Defined Networks with Self-organized Maps , 2015 .

[5]  Wei Lu,et al.  Feature fusion for imbalanced ECG data analysis , 2018, Biomed. Signal Process. Control..

[6]  José Luis Rojo-Álvarez,et al.  Detection of Life-Threatening Arrhythmias Using Feature Selection and Support Vector Machines , 2014, IEEE Transactions on Biomedical Engineering.

[7]  Vimalraj S Spelmen,et al.  A Review on Handling Imbalanced Data , 2018, 2018 International Conference on Current Trends towards Converging Technologies (ICCTCT).

[8]  Esteban J. Pino,et al.  Atrial activity selection for atrial fibrillation ECG recordings , 2013, Comput. Biol. Medicine.

[9]  P. N. Suganthan,et al.  Differential Evolution: A Survey of the State-of-the-Art , 2011, IEEE Transactions on Evolutionary Computation.

[10]  Naomie Salim,et al.  Arrhythmia recognition and classification using combined linear and nonlinear features of ECG signals , 2016, Comput. Methods Programs Biomed..

[11]  Qinghai Bai,et al.  Analysis of Particle Swarm Optimization Algorithm , 2010, Comput. Inf. Sci..

[12]  Christos Panagiotou,et al.  On the Detection of Myocadial Scar Based on ECG/VCG Analysis , 2013, IEEE Transactions on Biomedical Engineering.

[13]  Rahul Kher,et al.  Signal Processing Techniques for Removing Noise from ECG Signals , 2019 .

[14]  Peng Lu,et al.  An Effective LSTM Recurrent Network to Detect Arrhythmia on Imbalanced ECG Dataset , 2019, Journal of healthcare engineering.

[15]  Nelson F. F. Ebecken,et al.  A KNN Undersampling Approach for Data Balancing , 2015 .

[16]  Dariusz Janecki,et al.  Edge effect elimination in the recursive implementation of Gaussian filters , 2012 .

[17]  William Robson Schwartz,et al.  ECG-based heartbeat classification for arrhythmia detection: A survey , 2016, Comput. Methods Programs Biomed..

[18]  Axel Jantsch,et al.  Computer-aided Arrhythmia Diagnosis with Bio-signal Processing , 2019, ACM Comput. Surv..

[19]  M. Ashtiyani,et al.  Heart Rate Variability Classification using Support Vector Machine and Genetic Algorithm , 2018, Journal of biomedical physics & engineering.

[20]  Chih-Fong Tsai,et al.  Under-sampling class imbalanced datasets by combining clustering analysis and instance selection , 2019, Inf. Sci..

[21]  Robert Tibshirani,et al.  Classification by Pairwise Coupling , 1997, NIPS.

[22]  Ryan M. Rifkin,et al.  In Defense of One-Vs-All Classification , 2004, J. Mach. Learn. Res..

[23]  Jing Zhao,et al.  ACOSampling: An ant colony optimization-based undersampling method for classifying imbalanced DNA microarray data , 2013, Neurocomputing.

[24]  Selcan Kaplan Berkaya,et al.  A survey on ECG analysis , 2018, Biomed. Signal Process. Control..

[25]  Nalan Özkurt,et al.  ECG Beat Arrhythmia Classification by using 1-D CNN in case of Class Imbalance , 2019, 2019 Medical Technologies Congress (TIPTEKNO).

[26]  Kandala N. V. P. S. Rajesh,et al.  Classification of imbalanced ECG beats using re-sampling techniques and AdaBoost ensemble classifier , 2018, Biomed. Signal Process. Control..

[27]  Jian Li,et al.  Heartbeat classification using projected and dynamic features of ECG signal , 2017, Biomed. Signal Process. Control..

[28]  Daniel S. Yeung,et al.  Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems , 2015, IEEE Transactions on Cybernetics.

[29]  Ahnaf Rashik Hassan,et al.  An expert system for automated identification of obstructive sleep apnea from single-lead ECG using random under sampling boosting , 2017, Neurocomputing.

[30]  Neha Mehra,et al.  Survey on Multiclass Classification Methods , 2013 .

[31]  Mohsen Ahmadi,et al.  Automated Method for Discrimination of Arrhythmias Using Time, Frequency, and Nonlinear Features of Electrocardiogram Signals , 2018, Sensors.

[32]  Miguel C. Soriano,et al.  A Fast Machine Learning Model for ECG-Based Heartbeat Classification and Arrhythmia Detection , 2019, Front. Phys..

[33]  S. Mitra,et al.  Detection and identification of ECG waves by histogram approach , 2016, 2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC).

[34]  Kyung-shik Shin,et al.  Optimization of cluster-based evolutionary undersampling for the artificial neural networks in corporate bankruptcy prediction , 2016, Expert Syst. Appl..

[35]  Manuel G. Penedo,et al.  Heartbeat classification fusing temporal and morphological information of ECGs via ensemble of classifiers , 2019, Biomed. Signal Process. Control..

[36]  Ataollah Ebrahimzadeh,et al.  Classification of the electrocardiogram signals using supervised classifiers and efficient features , 2010, Comput. Methods Programs Biomed..

[37]  G.B. Moody,et al.  The impact of the MIT-BIH Arrhythmia Database , 2001, IEEE Engineering in Medicine and Biology Magazine.

[38]  Manu Thomas,et al.  Automatic ECG arrhythmia classification using dual tree complex wavelet based features , 2015 .

[39]  Jing Jiang,et al.  A novel multi-module neural network system for imbalanced heartbeats classification , 2019, Expert Syst. Appl. X.

[40]  Qeethara Al-Shayea Artificial Neural Networks in Medical Diagnosis , 2024, International Journal of Research Publication and Reviews.

[41]  Semih Ergin,et al.  ECG classification using ensemble of features , 2013, 2013 47th Annual Conference on Information Sciences and Systems (CISS).

[42]  Witold Pedrycz,et al.  ECG Signal Processing, Classification and Interpretation: A Comprehensive Framework of Computational Intelligence , 2011 .

[43]  Satyajit D. Sarker,et al.  An Introduction to Computational Phytochemistry , 2018 .

[44]  Aboul Ella Hassanien,et al.  Combining Support Vector Machine and Elephant Herding Optimization for Cardiac Arrhythmias , 2018, ArXiv.