Using Under-Trained Deep Ensembles to Learn Under Extreme Label Noise: A Case Study for Sleep Apnea Detection

Improper or erroneous labelling can pose a hindrance to reliable generalization for supervised learning. This can have negative consequences, especially for critical fields such as healthcare. We propose an effective new approach for learning under extreme label noise for medical applications like sleep apnea, that is based on under-trained deep ensembles. Each ensemble member is trained with a subset of the training data, to acquire a general overview of the decision boundary separation, without focusing on potentially erroneous details. The accumulated knowledge of the ensemble is combined to form new labels, that determine a better class separation than the original labels. A new model is trained with these labels to generalize reliably despite the label noise. We evaluate our approach on the tasks of sleep apnea detection and sleep apnea severity classification, and observe performance improvement in kappa from 0.02 up-to 0.55.

[1]  Theerawit Wilaiprasitporn,et al.  Single Channel ECG for Obstructive Sleep Apnea Severity Detection Using a Deep Learning Approach , 2018, TENCON 2018 - 2018 IEEE Region 10 Conference.

[2]  Charles Blundell,et al.  Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[3]  A. Pack,et al.  Agreement in the Scoring of Respiratory Events Among International Sleep Centers for Home Sleep Testing. , 2016, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[4]  S. Ancoli-Israel,et al.  Associations Between Sleep Architecture and Sleep‐Disordered Breathing and Cognition in Older Community‐Dwelling Men: The Osteoporotic Fractures in Men Sleep Study , 2011, Journal of the American Geriatrics Society.

[5]  Dawn M Tilbury,et al.  Evaluating predictions of critical oxygen desaturation events , 2014, Physiological measurement.

[6]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[7]  Simon K. Warfield,et al.  Deep learning with noisy labels: exploring techniques and remedies in medical image analysis , 2020, Medical Image Anal..

[8]  Yale Song,et al.  Learning from Noisy Labels with Distillation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[9]  Yanyao Shen,et al.  Learning with Bad Training Data via Iterative Trimmed Loss Minimization , 2018, ICML.

[10]  Thomas Plagemann,et al.  Augmenting Physiological Time Series Data: A Case Study for Sleep Apnea Detection , 2019, ECML/PKDD.

[11]  Carla E. Brodley,et al.  Identifying Mislabeled Training Data , 1999, J. Artif. Intell. Res..

[12]  Matthew Grissinger Misidentification of Alphanumeric Symbols Plays a Role in Errors. , 2017, P & T : a peer-reviewed journal for formulary management.

[13]  G. Moody,et al.  Development of the polysomnographic database on CD‐ROM , 1999, Psychiatry and clinical neurosciences.

[14]  Jan M. Köhler,et al.  Uncertainty Based Detection and Relabeling of Noisy Image Labels , 2019, CVPR Workshops.

[15]  Shai Shalev-Shwartz,et al.  Decoupling "when to update" from "how to update" , 2017, NIPS.

[16]  Xingrui Yu,et al.  Co-teaching: Robust training of deep neural networks with extremely noisy labels , 2018, NeurIPS.

[17]  Kibok Lee,et al.  Robust Inference via Generative Classifiers for Handling Noisy Labels , 2019, ICML.

[18]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[19]  Deliang Fan,et al.  A Semi-Supervised Two-Stage Approach to Learning from Noisy Labels , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[20]  Thomas Plagemann,et al.  Data Mining for Patient Friendly Apnea Detection , 2018, IEEE Access.

[21]  Theerawit Wilaiprasitporn,et al.  Deep Neural Networks with Weighted Averaged Overnight Airflow Features for Sleep Apnea-Hypopnea Severity Classification , 2018, TENCON 2018 - 2018 IEEE Region 10 Conference.

[22]  Dumitru Erhan,et al.  Training Deep Neural Networks on Noisy Labels with Bootstrapping , 2014, ICLR.

[23]  Weihong Deng,et al.  Unequal-Training for Deep Face Recognition With Long-Tailed Noisy Data , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Geoffrey E. Hinton,et al.  Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[25]  Parmjit Singh,et al.  The new AASM criteria for scoring hypopneas: impact on the apnea hypopnea index. , 2009, Sleep.

[26]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[27]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[28]  G. Moody,et al.  The apnea-ECG database , 2000, Computers in Cardiology 2000. Vol.27 (Cat. 00CH37163).

[29]  Chi-Wing Fu,et al.  Uncertainty-aware Self-ensembling Model for Semi-supervised 3D Left Atrium Segmentation , 2019, MICCAI.

[30]  Dawn M. Tilbury,et al.  Multi-Step Ahead Predictions for Critical Levels in Physiological Time Series , 2016, IEEE Transactions on Cybernetics.

[31]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[32]  Sergey I. Nikolenko,et al.  Label Denoising with Large Ensembles of Heterogeneous Neural Networks , 2018, ECCV Workshops.

[33]  Li Fei-Fei,et al.  MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels , 2017, ICML.

[34]  Andrew Y. Ng,et al.  Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[35]  Xingrui Yu,et al.  How does Disagreement Help Generalization against Label Corruption? , 2019, ICML.

[36]  Ekapol Chuangsuwanich,et al.  Universal Joint Feature Extraction for P300 EEG Classification Using Multi-Task Autoencoder , 2018, IEEE Access.

[37]  Richard Nock,et al.  Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Wei Chen,et al.  MetaSleepLearner: A Pilot Study on Fast Adaptation of Bio-Signals-Based Sleep Stage Classifier to New Individual Subject Using Meta-Learning , 2020, IEEE Journal of Biomedical and Health Informatics.

[39]  Hao Chen,et al.  Robust Learning at Noisy Labeled Medical Images: Applied to Skin Lesion Classification , 2019, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019).

[40]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[41]  Yoshua Bengio,et al.  A Closer Look at Memorization in Deep Networks , 2017, ICML.

[42]  Qi Xie,et al.  Push the Student to Learn Right: Progressive Gradient Correcting by Meta-learner on Corrupted Labels , 2019, ArXiv.

[43]  Yarin Gal,et al.  Uncertainty in Deep Learning , 2016 .

[44]  Guo-Qiang Zhang,et al.  The National Sleep Research Resource: towards a sleep data commons , 2018, BCB.