Disease Classification and Biomarker Discovery Using ECG Data

In the recent decade, disease classification and biomarker discovery have become increasingly important in modern biological and medical research. ECGs are comparatively low-cost and noninvasive in screening and diagnosing heart diseases. With the development of personal ECG monitors, large amounts of ECGs are recorded and stored; therefore, fast and efficient algorithms are called for to analyze the data and make diagnosis. In this paper, an efficient and easy-to-interpret procedure of cardiac disease classification is developed through novel feature extraction methods and comparison of classifiers. Motivated by the observation that the distributions of various measures on ECGs of the diseased group are often skewed, heavy-tailed, or multimodal, we characterize the distributions by sample quantiles which outperform sample means. Three classifiers are compared in application both to all features and to dimension-reduced features by PCA: stepwise discriminant analysis (SDA), SVM, and LASSO logistic regression. It is found that SDA applied to dimension-reduced features by PCA is the most stable and effective procedure, with sensitivity, specificity, and accuracy being 89.68%, 84.62%, and 88.52%, respectively.

[1]  H. Bazett,et al.  AN ANALYSIS OF THE TIME‐RELATIONS OF ELECTROCARDIOGRAMS. , 1997 .

[2]  Jocelyne Fayn,et al.  A Classification Tree Approach for Cardiac Ischemia Detection Using Spatiotemporal Information From Three Standard ECG Leads , 2011, IEEE Transactions on Biomedical Engineering.

[3]  William J. Long,et al.  Using Classification Tree and Logistic Regression Methods to Diagnose Myocardial Infarction , 1998, MedInfo.

[4]  R. Ghongade,et al.  A robust and reliable ECG pattern classification using QRS morphological features and ANN , 2008, TENCON 2008 - 2008 IEEE Region 10 Conference.

[5]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[6]  Veera Boonjing,et al.  Comparing performances of logistic regression, decision trees, and neural networks for classifying heart disease patients , 2010, 2010 International Conference on Computer Information Systems and Industrial Management Applications (CISIM).

[7]  T. Syeda-Mahmood,et al.  Automatic detection of heart disease from twelve channel electrocardiogram waveforms , 2007, 2007 Computers in Cardiology.

[8]  Philip de Chazal,et al.  Automatic classification of heartbeats using ECG morphology and heartbeat interval features , 2004, IEEE Transactions on Biomedical Engineering.

[9]  Paul Honeine,et al.  Multi-class SVM classification combined with kernel PCA feature extraction of ECG signals , 2012, 2012 19th International Conference on Telecommunications (ICT).

[10]  Liping Wang,et al.  Multi-lead ECG classification based on Independent Component Analysis and Support Vector Machine , 2010, 2010 3rd International Conference on Biomedical Engineering and Informatics.

[11]  Georg Dorffner,et al.  Toward improving exercise ECG for detecting ischemic heart disease with recurrent and feedforward neural nets , 1994, Proceedings of IEEE Workshop on Neural Networks for Signal Processing.

[12]  Imad Barhumi,et al.  ECG signal classification using support vector machine based on wavelet multiresolution analysis , 2012, 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA).

[13]  J. Mair,et al.  A decision tree for the early diagnosis of acute myocardial infarction in nontraumatic chest pain patients at hospital admission. , 1995, Chest.

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[15]  W J Tompkins,et al.  Applications of artificial neural networks for ECG signal detection and classification. , 1993, Journal of electrocardiology.

[16]  Chandan Chakraborty,et al.  A two-stage mechanism for registration and classification of ECG using Gaussian mixture model , 2009, Pattern Recognit..

[17]  Hao Zhang,et al.  Highly Accurate ECG Beat Classification Based on Continuous Wavelet Transformation and Multiple Support Vector Machine Classifiers , 2009, 2009 2nd International Conference on Biomedical Engineering and Informatics.

[18]  Honglak Lee,et al.  Efficient L1 Regularized Logistic Regression , 2006, AAAI.