Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features

OBJECTIVE The paper addresses a common and recurring problem of electrocardiogram (ECG) classification based on heart rate variability (HRV) analysis. Current understanding of the limits of HRV analysis in diagnosing different cardiac conditions is not complete. Existing research suggests that a combination of carefully selected linear and nonlinear HRV features should significantly improve the accuracy for both binary and multiclass classification problems. The primary goal of this work is to evaluate a proposed combination of HRV features. Other explored objectives are the comparison of different machine learning algorithms in the HRV analysis and the inspection of the most suitable period T between two consecutively analyzed R-R intervals for nonlinear features. METHODS AND MATERIAL We extracted 11 features from 5min of R-R interval recordings: SDNN, RMSSD, pNN20, HRV triangular index (HTI), spatial filling index (SFI), correlation dimension, central tendency measure (CTM), and four approximate entropy features (ApEn1-ApEn4). Analyzed heart conditions included normal heart rhythm, arrhythmia (any), supraventricular arrhythmia, and congestive heart failure. One hundred patient records from six online databases were analyzed, 25 for each condition. Feature vectors were extracted by a platform designed for this purpose, named ECG Chaos Extractor. The vectors were then analyzed by seven clustering and classification algorithms in the Weka system: K-means, expectation maximization (EM), C4.5 decision tree, Bayesian network, artificial neural network (ANN), support vector machines (SVM) and random forest (RF). Four-class and two-class (normal vs. abnormal) classification was performed. Relevance of particular features was evaluated using 1-Rule and C4.5 decision tree in the cases of individual features classification and classification with features' pairs. RESULTS Average total classification accuracy obtained for top three classification methods in the two classes' case was: RF 99.7%, ANN 99.1%, SVM 98.9%. In the four classes' case the best results were: RF 99.6%, Bayesian network 99.4%, SVM 98.4%. The best overall method was RF. C4.5 decision tree was successful in the construction of useful classification rules for the two classes' case. EM and K-means showed comparable clustering results: around 50% for the four classes' case and around 75% for the two classes' case. HTI, pNN20, RMSSD, ApEn3, ApEn4 and SFI were shown to be the most relevant features. HTI in particular appears in most of the top-ranked pairs of features and is the best analyzed feature. The choice of the period T for nonlinear features was shown to be arbitrary. However, a combination of five different periods significantly improved classification accuracy, from 70% for a single period up to 99% for five periods. CONCLUSIONS Analysis shows that the proposed combination of 11 linear and nonlinear HRV features gives high classification accuracy when nonlinear features are extracted for five periods. The features' combination was thoroughly analyzed using several machine learning algorithms. In particular, RF algorithm proved to be highly efficient and accurate in both binary and multiclass classification of HRV records. Interpretable and useful rules were obtained with C4.5 decision tree. Further work in this area should elucidate which features should be extracted for the best classification results for specific types of cardiac disorders.

[1]  Sam Kash Kachigan Multivariate statistical analysis: A conceptual introduction , 1982 .

[2]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[3]  Dimitrios I. Fotiadis,et al.  Automatic arrhythmia detection based on time and time-frequency analysis of heart rate variability , 2004, Comput. Methods Programs Biomed..

[4]  Stavros D. Nikolopoulos,et al.  Assessment of the classification capability of prediction and approximation methods for HRV analysis , 2007, Comput. Biol. Medicine.

[5]  G. Breithardt,et al.  Heart rate variability: standards of measurement, physiological interpretation and clinical use. Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology. , 1996 .

[6]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[7]  A. Malliani,et al.  Heart rate variability. Standards of measurement, physiological interpretation, and clinical use , 1996 .

[8]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[9]  C. M. Lim,et al.  Heart rate variability analysis using correlation dimension and detrended fluctuation analysis , 2002 .

[10]  Donna L. Hudson,et al.  Measurement of variability in Holter tape R-R intervals for patients with congestive heart failure , 1994, Proceedings of 16th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[11]  J. Richman,et al.  Physiological time-series analysis using approximate entropy and sample entropy. , 2000, American journal of physiology. Heart and circulatory physiology.

[12]  Ian Witten,et al.  Data Mining , 2000 .

[13]  Javad Hashemi,et al.  Classification of ECG arrhythmias based on statistical and time-frequency features , 2006 .

[14]  Maurice K. Wong,et al.  Algorithm AS136: A k-means clustering algorithm. , 1979 .

[15]  John M. Darrington,et al.  A new methodology for assessment of the performance of heartbeat classification systems , 2008, BMC Medical Informatics Decis. Mak..

[16]  A. Jovic,et al.  Feature Extraction for ECG Time-Series Mining Based on Chaos Theory , 2007, International Conference on Information Technology Interfaces.

[17]  Madalena Costa,et al.  Multiscale entropy analysis of complex physiologic time series. , 2002, Physical review letters.

[18]  Grebogi,et al.  Plateau onset for correlation dimension: When does it occur? , 1993, Physical review letters.

[19]  P. Grassberger,et al.  Measuring the Strangeness of Strange Attractors , 1983 .

[20]  Roberto Hornero,et al.  Improving diagnostic ability of blood oxygen saturation from overnight pulse oximetry in obstructive sleep apnea detection by means of central tendency measure , 2007, Artif. Intell. Medicine.

[21]  H. Stanley,et al.  Quantification of scaling exponents and crossover phenomena in nonstationary heartbeat time series. , 1995, Chaos.

[22]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[23]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[24]  Elif Derya íbeyli Recurrent neural networks with composite features for detection of electrocardiographic changes in partial epileptic patients , 2008 .

[25]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[26]  C. Peng,et al.  Age-related alterations in the fractal scaling of cardiac interbeat interval dynamics. , 1996, The American journal of physiology.

[27]  N. Chattipakorn,et al.  Heart rate variability in myocardial infarction and heart failure. , 2007, International journal of cardiology.

[28]  A L Goldberger,et al.  Physiological time-series analysis: what does regularity quantify? , 1994, The American journal of physiology.

[30]  C. M. Lim,et al.  Classification of cardiac abnormalities using heart rate signals , 2004, Medical and Biological Engineering and Computing.

[31]  Roberto Hornero,et al.  Variability, regularity, and complexity of time series generated by schizophrenic patients and control subjects , 2006, IEEE Transactions on Biomedical Engineering.

[32]  A. Goldberger Non-linear dynamics for clinicians: chaos theory, fractals, and complexity at the bedside , 1996, The Lancet.

[33]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[34]  Anastasios Bezerianos,et al.  Radial basis function neural networks for the characterization of heart rate variability dynamics , 1999, Artif. Intell. Medicine.

[35]  H. Schuster Deterministic chaos: An introduction , 1984 .

[36]  Remzi Seker,et al.  Validity Test for a Set of Nonlinear Measures for Short Data Length with Reference to Short-Term Heart Rate Variability Signal , 2000, J. Syst. Integr..

[37]  U. Rajendra Acharya,et al.  Heart rate variability: a review , 2006, Medical and Biological Engineering and Computing.

[38]  L. Amaral,et al.  Multifractality in human heartbeat dynamics , 1998, Nature.

[39]  T P Hutchinson Statistics and graphs for heart rate variability: pNN50 or pNN20? , 2003, Physiological measurement.

[40]  Seyed Kamaledin Setarehdan,et al.  Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal , 2008, Artif. Intell. Medicine.

[41]  Oliver Faust,et al.  Analysis of cardiac signals using spatial filling index and time-frequency domain , 2004, Biomedical engineering online.

[42]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[43]  A L Goldberger,et al.  The pNNx files: re-examining a widely used heart rate variability measure , 2002, Heart.

[44]  S. Sitharama Iyengar,et al.  Classification of heart rate data using artificial neural network and fuzzy equivalence relation , 2003, Pattern Recognit..

[45]  Daniele Marinazzo,et al.  Multiscale analysis of short term heart beat interval, arterial blood pressure, and instantaneous lung volume time series , 2007, Artif. Intell. Medicine.

[46]  A. Freking,et al.  Demonstration of nonlinear components in heart rate variability of healthy persons. , 1998, The American journal of physiology.

[47]  Patrick E. McSharry,et al.  Advanced Methods And Tools for ECG Data Analysis , 2006 .

[48]  D. Levy,et al.  Predicting survival in heart failure case and control subjects by use of fully automated methods for deriving nonlinear and conventional indices of heart rate dynamics. , 1997, Circulation.

[49]  A. Goldberger Fractal Variability Versus Pathologic Periodicity: Complexity Loss and Stereotypy in Disease , 1997, Perspectives in biology and medicine.