Robust greedy deep dictionary learning for ECG arrhythmia classification

This work proposes a new deep learning method which we call robust deep dictionary learning RDDL. RDDL is suitable for learning representations from signals corrupted with sparse but large outliers such as artifacts and noise that are more heavy tailed than Gaussian distributions. Such outliers are common in biomedical signals e.g. EEG and ECG. RDDL learns multiple levels of non-linear dictionaries for representing the data. Instead of the standard Euclidean cost function that is usually employed in dictionary learning, we propose a robust l1-norm cost function. In order to achieve sparse representation, an l1-norm is imposed on the learned representation. The ‘depth’ arises from the fact that multiple levels of dictionaries are learnt. The full formulation is solved in a greedy fashion, one layer at a time. To study the extent of usefulness of RDDL, we first benchmark it with two wellknown deep learning tools — the stacked denoising autoencoder and the deep belief network methods; experiments are carried out on benchmark deep learning datasets — MNIST, CIFAR-10 and SVHN. In all cases, our method yields the best results. Then the proposed method is used for learning representations of ECG data (containing arficacts) and for their classification using the MIT-BIH arrhythmia classification database. We compare it with traditional techniques as well as on deep learning tools. Our method yields the best results.

[1]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[2]  B.V.K. Vijaya Kumar,et al.  Arrhythmia detection and classification using morphological and dynamic features of ECG signals , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[3]  Michael Elad,et al.  Dictionaries for Sparse Representation Modeling , 2010, Proceedings of the IEEE.

[4]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[5]  Frederick R. Forst,et al.  On robust estimation of the location parameter , 1980 .

[6]  Aidong Zhang,et al.  A Novel Semi-Supervised Deep Learning Framework for Affective State Recognition on EEG Signals , 2014, 2014 IEEE International Conference on Bioinformatics and Bioengineering.

[7]  Michael D. Gordon,et al.  Regularized Least Absolute Deviations Regression and an Efficient Algorithm for Parameter Tuning , 2006, Sixth International Conference on Data Mining (ICDM'06).

[8]  Nathalie Japkowicz,et al.  Nonlinear Autoassociation Is Not Equivalent to PCA , 2000, Neural Computation.

[9]  Rabab Kreidieh Ward,et al.  Robust dictionary learning: Application to signal disaggregation , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[11]  Fahim Sufi,et al.  Diagnosis of Cardiovascular Abnormalities From Compressed ECG: A Data Mining-Based Approach , 2009, IEEE Transactions on Information Technology in Biomedicine.

[12]  Angshul Majumdar,et al.  How to Train Your Neural Network with Dictionary Learning , 2017, 2017 Data Compression Conference (DCC).

[13]  Geoffrey E. Hinton,et al.  Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[14]  Angshul Majumdar,et al.  Majorization Minimization Technique for Optimally Solving Deep Dictionary Learning , 2017, Neural Processing Letters.

[15]  Volker Hohmann,et al.  Acoustic features for speech recognition based on Gammatone filterbank and instantaneous frequency , 2011, Speech Commun..

[16]  I. Barrodale,et al.  An Improved Algorithm for Discrete $l_1 $ Linear Approximation , 1973 .

[17]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[18]  Tapani Raiko,et al.  Gaussian-Bernoulli deep Boltzmann machine , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[19]  Goldin and Senneby M and A , 2014 .

[20]  Naif Alajlan,et al.  Deep learning approach for active classification of electrocardiogram signals , 2016, Inf. Sci..

[21]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[22]  G. O. Wesolowsky A new descent algorithm for the least absolute value regression problem , 1981 .

[23]  Gonzalo R. Arce,et al.  A Maximum Likelihood Approach to Least Absolute Deviation Regression , 2004, EURASIP J. Adv. Signal Process..

[24]  Pascal Vincent,et al.  Contractive Auto-Encoders: Explicit Invariance During Feature Extraction , 2011, ICML.

[25]  J. Branham,et al.  Alternatives to least squares , 1982 .

[26]  Mayank Vatsa,et al.  Deep Dictionary Learning , 2016, IEEE Access.

[27]  I. Daubechies,et al.  An iterative thresholding algorithm for linear inverse problems with a sparsity constraint , 2003, math/0307152.

[28]  E. Schlossmacher An Iterative Technique for Absolute Deviations Curve Fitting , 1973 .

[29]  Mark A. Lukas,et al.  An L 1 estimation algorithm with degeneracy and linear constraints , 2002 .

[30]  Rui Rodrigues,et al.  A Neural Network Approach to ECG Denoising , 2012, ArXiv.

[31]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[32]  Juan Pablo Martínez,et al.  Heartbeat Classification Using Feature Selection Driven by Database Generalization Criteria , 2011, IEEE Transactions on Biomedical Engineering.

[33]  U. Rajendra Acharya,et al.  ECG beat classification using PCA, LDA, ICA and Discrete Wavelet Transform , 2013, Biomed. Signal Process. Control..

[34]  Marc'Aurelio Ranzato,et al.  Sparse Feature Learning for Deep Belief Networks , 2007, NIPS.

[35]  Kwangsoo Shin,et al.  Robust algorithm for arrhythmia classification in ECG using extreme learning machine , 2009, Biomedical engineering online.

[36]  Geoffrey E. Hinton,et al.  Deep Boltzmann Machines , 2009, AISTATS.

[37]  Angshul Majumdar,et al.  Noisy Deep Dictionary Learning , 2017, CODS.

[38]  Farid Melgani,et al.  Active Learning Methods for Electrocardiographic Signal Classification , 2010, IEEE Transactions on Information Technology in Biomedicine.

[39]  S LuzEduardo José Da,et al.  ECG arrhythmia classification based on optimum-path forest , 2013 .

[40]  David Menotti,et al.  ECG arrhythmia classification based on optimum-path forest , 2013, Expert Syst. Appl..

[41]  Richa Singh,et al.  Detecting Silicone Mask-Based Presentation Attack via Deep Dictionary Learning , 2017, IEEE Transactions on Information Forensics and Security.

[42]  Angshul Majumdar,et al.  How to Train Your Deep Neural Network with Dictionary Learning , 2016, ArXiv.