Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing

Labeled data is a critical resource for training and evaluating machine learning models. However, many real-life datasets are only partially labeled. We propose a semi-supervised machine learning training strategy to improve event detection performance on sequential data, such as video recordings, when only sparse labels are available, such as event start times without their corresponding end times. Our method uses noisy guesses of the events’ end times to train event detection models. Depending on how conservative these guesses are, mislabeled samples may be introduced into the training set. We further propose a mathematical model for explaining and estimating the evolution of the classification performance for increasingly noisier end time estimates. We show that neural networks can improve their detection performance by leveraging more training data with less conservative approximations despite the higher proportion of incorrect labels. We adapt sequential versions of CIFAR-10 and MNIST, and use the Berkeley MHAD and HMBD51 video datasets to empirically evaluate our method, and find that our risk-tolerant strategy outperforms conservative estimates by 3.5 points of mean average precision for CIFAR, 30 points for MNIST, 3 points for MHAD, and 14 points for HMBD51. Then, we leverage the proposed training strategy to tackle a real-life application: processing continuous video recordings of epilepsy patients, and show that our method outperforms baseline labeling methods by 17 points of average precision, and reaches a classification performance similar to that of fully supervised models. We share part of the code for this article at the following repository: fpgdubost/CIFAR-10-Sparsely-Labeled-Sequential-Data.

[1]  Saeid Asgari Taghanaki,et al.  Robust Representation Learning via Perceptual Similarity Metrics , 2021, ICML.

[2]  Andreas Nürnberger,et al.  DS6: Deformation-aware learning for small vessel segmentation with small, imperfectly labeled dataset , 2020, ArXiv.

[3]  Jonathan Tompson,et al.  Counting Out Time: Class Agnostic Video Repetition Counting in the Wild , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Jared A. Dunnmon,et al.  Weak supervision as an efficient approach for automated seizure detection in electroencephalography , 2020, npj Digital Medicine.

[5]  Hongcheng Wang,et al.  VideoSSL: Semi-Supervised Learning for Video Classification , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[6]  Shandong Wu,et al.  Inaccurate Labels in Weakly-Supervised Deep Learning: Automatic Identification and Correction and Their Impact on Classification Performance , 2020, IEEE Journal of Biomedical and Health Informatics.

[7]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[8]  S. Warfield,et al.  Deep learning with noisy labels: exploring techniques and remedies in medical image analysis , 2019, Medical Image Anal..

[9]  Ross B. Girshick,et al.  Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Marleen de Bruijne,et al.  Semi-supervised Medical Image Segmentation via Learning Consistency Under Transformations , 2019, MICCAI.

[11]  Olaf Booij,et al.  Exploiting Temporality for Semi-Supervised Video Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[12]  David Berthelot,et al.  MixMatch: A Holistic Approach to Semi-Supervised Learning , 2019, NeurIPS.

[13]  Yann LeCun,et al.  A Closer Look at Spatiotemporal Convolutions for Action Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14]  Fabio Viola,et al.  The Kinetics Human Action Video Dataset , 2017, ArXiv.

[15]  Marleen de Bruijne,et al.  Early Experiences with Crowdsourcing Airway Annotations in Chest CT , 2016, LABELS/DLMIA@MICCAI.

[16]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  David A. Forsyth,et al.  Representation Learning , 2015, Computer.

[19]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[20]  K. Abdel-Aziz,et al.  Automatic Epileptic Seizure Detection Using Scalp EEG and Advanced Artificial Intelligence Techniques , 2015, BioMed research international.

[21]  Thomas Brox,et al.  Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[22]  Lena Maier-Hein,et al.  Can Masses of Non-Experts Train Highly Accurate Image Classifiers? - A Crowdsourcing Approach to Instrument Segmentation in Laparoscopic Images , 2014, MICCAI.

[23]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[24]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[25]  Qing Zeng-Treitler,et al.  Predicting sample size required for classification performance , 2012, BMC Medical Informatics and Decision Making.

[26]  Thomas Serre,et al.  HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.

[27]  Ursula Gather,et al.  Collection of annotated data in a clinical validation study for alarm algorithms in intensive care--a methodologic framework. , 2010, Journal of critical care.

[28]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Yoshua Bengio,et al.  Semi-supervised Learning by Entropy Minimization , 2004, CAP.

[30]  Carla E. Brodley,et al.  Identifying Mislabeled Training Data , 1999, J. Artif. Intell. Res..

[31]  Zhi-Hua Zhou,et al.  A brief introduction to weakly supervised learning , 2018 .

[32]  Dong-Hyun Lee,et al.  Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .

[33]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .