Abductive reasoning as a basis to reproduce expert criteria in ECG atrial fibrillation identification

OBJECTIVE This work aims at providing a new method for the automatic detection of atrial fibrillation, other arrhythmia and noise on short single-lead ECG signals, emphasizing the importance of the interpretability of the classification results. APPROACH A morphological and rhythm description of the cardiac behavior is obtained by a knowledge-based interpretation of the signal using the Construe abductive framework. Then, a set of meaningful features are extracted for each individual heartbeat and as a summary of the full record. The feature distributions can be used to elucidate the expert criteria underlying the labeling of the 2017 PhysioNet/CinC Challenge dataset, enabling a manual partial relabeling to improve the consistency of the training set. Finally, a tree gradient boosting model and a recurrent neural network are combined using the stacking technique to provide an answer on the basis of the feature values. MAIN RESULTS The proposal was independently validated against the hidden dataset of the Challenge, achieving a combined F 1 score of 0.83 and tying for the first place in the official stage of the Challenge. This result was even improved in the follow-up stage to 0.85 with a significant simplification of the model, attaining the highest score so far reported on the hidden dataset. SIGNIFICANCE The obtained results demonstrate the potential of Construe to provide robust and valuable descriptions of temporal data, even with the presence of significant amounts of noise. Furthermore, the importance of consistent classification criteria in manually labeled training datasets is emphasized, and the fundamental advantages of knowledge-based approaches to formalize and validate those criteria are discussed.

[1]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[2]  Paulo Félix,et al.  Heartbeat Classification Using Abstract Features From the Abductive Interpretation of the ECG , 2018, IEEE Journal of Biomedical and Health Informatics.

[3]  Muzahir Tayebjee,et al.  Electrocardiogram interpretation and arrhythmia management: a primary and secondary care survey. , 2016, The British journal of general practice : the journal of the Royal College of General Practitioners.

[4]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[5]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[7]  Johannes Gehrke,et al.  Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission , 2015, KDD.

[8]  Jeroen J. Bax,et al.  Guidelines for the management of atrial fibrillation: the Task Force for the Management of Atrial Fibrillation of the European Society of Cardiology (ESC). , 2010, Europace : European pacing, arrhythmias, and cardiac electrophysiology : journal of the working groups on cardiac pacing, arrhythmias, and cardiac cellular electrophysiology of the European Society of Cardiology.

[9]  A. Kadish,et al.  Uncertainty principle of signal-averaged electrocardiography. , 2000, Circulation.

[10]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[11]  Vaidotas Marozas,et al.  Low-complexity detection of atrial fibrillation in continuous long-term monitoring , 2015, Comput. Biol. Medicine.

[12]  Emmanuel Skordalakis,et al.  Syntactic Pattern Recognition of the ECG , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[14]  D. Levy,et al.  An improved method for adjusting the QT interval for heart rate (the Framingham Heart Study) , 1992, The American journal of cardiology.

[15]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[16]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[17]  Paulo Félix,et al.  On the adoption of abductive reasoning for time series interpretation , 2016, Artif. Intell..

[18]  David D. Cox,et al.  Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures , 2013, ICML.