Automatic Diagnosis of Cardiac Disease from Twelve-Lead and Reduced-Lead ECGs Using Multilabel Classification

ECG is an essential tool for the clinical diagnosis of cardiac electrical abnormalities. As part of the PhysioNet/Computing in Cardiology Challenge 2021, eight and two folds from the 10-folds iterative splitting of public training data set were used as in-house training and internal validation sets. We used extracted features from RandOm Convolutional KErnel Transforms (ROCKETs) with a multilabel classification using XGBoost to predict cardiac abnormalities. Our team, LINC, developed an approach with minimal pre-processing (e.g., resampling data to 500Hz) and with no QRS detection or deep neural network design, which led to promising performance on the internal validation set. We didn't receive the official scores for the validation and test sets, because our entry failed during training in the official phase as we submitted an incomplete entry. Our classifiers received scores of 0.504, 0.466, 0.459, 0.458, and 0.438 for the 12-lead, 6-lead, 4-lead, 3-lead, and 2-lead versions on the internal validation set with the challenge evaluation metric (10 seconds ECG).

[1]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[2]  Cyril Rakovski,et al.  A 12-lead electrocardiogram database for arrhythmia research covering more than 10,000 patients , 2020, Scientific Data.

[3]  Gari D Clifford,et al.  Will Two Do? Varying Dimensions in Electrocardiography: The PhysioNet/Computing in Cardiology Challenge 2021 , 2021, 2021 Computing in Cardiology (CinC).

[4]  Wojciech Samek,et al.  PTB-XL, a large publicly available electrocardiography dataset , 2020, Scientific Data.

[5]  Cyril Rakovski,et al.  Optimal Multi-Stage Arrhythmia Classification Approach , 2020, Scientific Reports.

[6]  John M. O'Toole,et al.  Random Convolution Kernels with Multi-Scale Decomposition for Preterm EEG Inter-burst Detection , 2021, 2021 29th European Signal Processing Conference (EUSIPCO).

[7]  Franz J. Király,et al.  sktime: A Unified Interface for Machine Learning with Time Series , 2019, ArXiv.

[8]  Ali Bahrami Rad,et al.  Classification of 12-lead ECGs: the PhysioNet/Computing in Cardiology Challenge 2020 , 2020, 2020 Computing in Cardiology.

[9]  Geoffrey I. Webb,et al.  Elastic Similarity Measures for Multivariate Time Series Classification , 2021, ArXiv.

[10]  Ralf Bousseljot,et al.  Nutzung der EKG-Signaldatenbank CARDIODAT der PTB über das Internet , 2009 .

[11]  Paul Kligfield,et al.  The centennial of the Einthoven electrocardiogram. , 2002, Journal of electrocardiology.

[12]  Geoffrey I. Webb,et al.  MiniRocket: A Very Fast (Almost) Deterministic Transform for Time Series Classification , 2020, KDD.

[13]  Jonathan Rubin,et al.  Electrocardiogram Monitoring and Interpretation: From Traditional Machine Learning to Deep Learning, and Their Combination , 2018, 2018 Computing in Cardiology Conference (CinC).

[14]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[15]  Jonathan Rubin,et al.  Analyzing single-lead short ECG recordings using dense convolutional neural networks and feature-based post-processing to detect atrial fibrillation , 2018, Physiological measurement.

[16]  Jonathan Rubin,et al.  Cardiac arrhythmia detection using deep learning: A review. , 2019, Journal of electrocardiology.

[17]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[18]  Jonathan Rubin,et al.  Densely connected convolutional networks for detection of atrial fibrillation from short single-lead ECG recordings. , 2018, Journal of electrocardiology.