The Application of Genetic Algorithm for Unsupervised Classification of ECG

In this chapter,we have proposed an integrated methodology for electrocardiogram (ECG) based differentiation of arrhythmia and normal sinus rhythm using genetic algorithm optimized k-means clustering. Open source databases consisting of the MIT BIH arrhythmia and MIT BIH normal sinus rhythm data are used. The methodology consists of QRS-complex detection using the Pan-Tompkins algorithm, principal component analysis (PCA), and subsequent pattern classification using the k-means classifier, error back propagation neural network (EBPNN) classifier, and genetic algorithm optimized k-means clustering. The m-fold cross-validation scheme is used in choosing the training and testing sets for classification. The k-means classifier provides an average accuracy of 91.21 % over all folds, whereas EBPNN provides a greater average accuracy of 95.79 %. In the proposed method, the k-means classifier is optimized using the genetic algorithm (GA), and the accuracy of this classifier is 95.79 %, which is equal to that of EBPNN. In conclusion, the classification accuracy of simple unsupervised classifiers can be increased to near that of supervised classifiers by optimization using GA. The application of GA to other unsupervised algorithms to yield higher accuracy as a future direction is also observed.

[1]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[2]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Genetic Clustering for Data Mining , 2008, Soft Computing for Knowledge Discovery and Data Mining.

[3]  U. Niranjan,et al.  Component wave delineation of ECG by filtering in the Fourier domain , 1992, Medical and Biological Engineering and Computing.

[4]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[5]  Chandan Chakraborty,et al.  A two-stage mechanism for registration and classification of ECG using Gaussian mixture model , 2009, Pattern Recognit..

[6]  K. Park,et al.  Park's Textbook of Preventive and Social Medicine , 2015 .

[7]  Roshan Joy Martis,et al.  ARRHYTHMIA DISEASE DIAGNOSIS USING NEURAL NETWORK, SVM, AND GENETIC ALGORITHM-OPTIMIZED k-MEANS CLUSTERING , 2011 .

[8]  Patrick Gaydecki,et al.  The use of the Hilbert transform in ECG signal analysis , 2001, Comput. Biol. Medicine.

[9]  A. Guyton,et al.  Textbook of Medical Physiology , 1961 .

[10]  Asaf Bitton,et al.  Growing epidemic of coronary heart disease in low- and middle-income countries. , 2010, Current problems in cardiology.

[11]  Rui Xu,et al.  Survey of clustering algorithms , 2005, IEEE Transactions on Neural Networks.

[12]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[13]  P. Vaidyanathan Multirate Systems And Filter Banks , 1992 .

[14]  Chandan Chakraborty,et al.  Automated Screening of Arrhythmia Using Wavelet Based Machine Learning Techniques , 2012, Journal of Medical Systems.

[15]  Pablo Laguna,et al.  A wavelet-based ECG delineator: evaluation on standard databases , 2004, IEEE Transactions on Biomedical Engineering.

[16]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[17]  C. Li,et al.  Detection of ECG characteristic points using wavelet transforms. , 1995, IEEE transactions on bio-medical engineering.

[18]  Kalyanmoy Deb,et al.  Multi-objective optimization using evolutionary algorithms , 2001, Wiley-Interscience series in systems and optimization.

[19]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.