论文信息 - Contrastive learning of global and local features for medical image segmentation with limited annotations

Contrastive learning of global and local features for medical image segmentation with limited annotations

A key requirement for the success of supervised deep learning is a large labeled dataset - a condition that is difficult to meet in medical image analysis. Self-supervised learning (SSL) can help in this regard by providing a strategy to pre-train a neural network with unlabeled data, followed by fine-tuning for a downstream task with limited annotations. Contrastive learning, a particular variant of SSL, is a powerful technique for learning image-level representations. In this work, we propose strategies for extending the contrastive learning framework for segmentation of volumetric medical images in the semi-supervised setting with limited annotations, by leveraging domain-specific and problem-specific cues. Specifically, we propose (1) novel contrasting strategies that leverage structural similarity across volumetric medical images (domain-specific cue) and (2) a local version of the contrastive loss to learn distinctive representations of local regions that are useful for per-pixel segmentation (problem-specific cue). We carry out an extensive evaluation on three Magnetic Resonance Imaging (MRI) datasets. In the limited annotation setting, the proposed method yields substantial improvements compared to other self-supervision and semi-supervised learning techniques. When combined with a simple data augmentation technique, the proposed method reaches within 8% of benchmark performance using only two labeled MRI volumes for training, corresponding to only 4% (for ACDC) of the training data used to train the benchmark.

[1] Stella X. Yu,et al. Unsupervised Feature Learning via Non-parametric Instance Discrimination , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2] Xiaojin Zhu,et al. Semi-Supervised Learning Literature Survey , 2005 .

[3] O. Chapelle,et al. Semi-Supervised Learning (Chapelle, O. et al., Eds.; 2006) [Book reviews] , 2009, IEEE Transactions on Neural Networks.

[4] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[5] Brian B. Avants,et al. N4ITK: Improved N3 Bias Correction , 2010, IEEE Transactions on Medical Imaging.

[6] Daniel Rueckert,et al. Multiatlas whole heart segmentation of CT data using conditional entropy for atlas ranking and selection. , 2015, Medical physics.

[7] Patrice Y. Simard,et al. Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[8] Alexei A. Efros,et al. Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Nikos Komodakis,et al. Unsupervised Representation Learning by Predicting Image Rotations , 2018, ICLR.

[10] Tolga Tasdizen,et al. Regularization With Stochastic Transformations and Perturbations for Deep Semi-Supervised Learning , 2016, NIPS.

[11] Thomas Brox,et al. Discriminative Unsupervised Feature Learning with Convolutional Neural Networks , 2014, NIPS.

[12] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[13] Oriol Vinyals,et al. Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.

[14] Ben Glocker,et al. Semi-supervised Learning for Network-Based Cardiac MR Image Segmentation , 2017, MICCAI.

[15] Laurens van der Maaten,et al. Self-Supervised Learning of Pretext-Invariant Representations , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Eduardo Valle,et al. Data Augmentation for Skin Lesion Analysis , 2018, OR 2.0/CARE/CLIP/ISIC@MICCAI.

[17] Michael Tschannen,et al. On Mutual Information Maximization for Representation Learning , 2019, ICLR.

[18] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[19] Kaiming He,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Liang Chen,et al. GAN Augmentation: Augmenting Training Data using Generative Adversarial Networks , 2018, ArXiv.

[21] Alexei A. Efros,et al. Colorful Image Colorization , 2016, ECCV.

[22] Nir Ailon,et al. Deep Metric Learning Using Triplet Network , 2014, SIMBAD.

[23] Yoshua Bengio,et al. Learning deep representations by mutual information estimation and maximization , 2018, ICLR.

[24] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[25] Xiaojin Zhu,et al. Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[26] Konstantinos Kamnitsas,et al. DeepMedic for Brain Tumor Segmentation , 2016, BrainLes@MICCAI.

[27] Xiahai Zhuang,et al. Challenges and methodologies of fully automatic whole heart segmentation: a review. , 2013, Journal of healthcare engineering.

[28] R Devon Hjelm,et al. Learning Representations by Maximizing Mutual Information Across Views , 2019, NeurIPS.

[29] Alexei A. Efros,et al. Unsupervised Visual Representation Learning by Context Prediction , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30] Xiaojin Zhu,et al. Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[31] David Yarowsky,et al. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[32] Andrew Zisserman,et al. Multi-task Self-Supervised Visual Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[33] Max A. Viergever,et al. elastix: A Toolbox for Intensity-Based Medical Image Registration , 2010, IEEE Transactions on Medical Imaging.

[34] Sotirios A. Tsaftaris,et al. Medical Image Computing and Computer Assisted Intervention , 2017 .

[35] Hyunjin Park,et al. Convolutional neural network classifier for distinguishing Barrett's esophagus and neoplasia endomicroscopy images , 2017, 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[36] Alexei A. Efros,et al. Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Hongyi Zhang,et al. mixup: Beyond Empirical Risk Minimization , 2017, ICLR.

[38] Ender Konukoglu,et al. Semi-Supervised and Task-Driven Data Augmentation , 2019, IPMI.

[39] Shin Ishii,et al. Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40] Nima Tajbakhsh,et al. Surrogate Supervision for Medical Image Analysis: Effective Deep Learning From Limited Quantities of Labeled Data , 2019, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019).

[41] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[42] Jeffrey L. Gunter,et al. Medical Image Synthesis for Data Augmentation and Anonymization using Generative Adversarial Networks , 2018, SASHIMI@MICCAI.

[43] Phillip Isola,et al. Contrastive Multiview Coding , 2019, ECCV.

[44] S. Gelly,et al. Self-Supervised Learning of Video-Induced Visual Invariances , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Tapani Raiko,et al. Semi-supervised Learning with Ladder Networks , 2015, NIPS.

[46] Liang Chen,et al. Self-supervised learning for medical image analysis using image context restoration , 2019, Medical Image Anal..

[47] Konstantinos Kamnitsas,et al. Efficient multi‐scale 3D CNN with fully connected CRF for accurate brain lesion segmentation , 2016, Medical Image Anal..

[48] Sebastian Ramos,et al. The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49] Eugene Charniak,et al. Effective Self-Training for Parsing , 2006, NAACL.

[50] Daniel Rueckert,et al. Self-Supervised Learning for Cardiac MR Image Segmentation by Anatomical Position Prediction , 2019, MICCAI.

[51] Timo Aila,et al. Temporal Ensembling for Semi-Supervised Learning , 2016, ICLR.

[52] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[53] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[54] Yoshua Bengio,et al. Semi-supervised Learning by Entropy Minimization , 2004, CAP.

[55] Kaiming He,et al. Improved Baselines with Momentum Contrastive Learning , 2020, ArXiv.

[56] Ali Razavi,et al. Data-Efficient Image Recognition with Contrastive Predictive Coding , 2019, ICML.

[57] Xiahai Zhuang,et al. Multi-scale patch and multi-modality atlases for whole heart segmentation of MRI , 2016, Medical Image Anal..

[58] Sébastien Ourselin,et al. A Registration-Based Propagation Framework for Automatic Whole Heart Segmentation of Cardiac MRI , 2010, IEEE Transactions on Medical Imaging.

[59] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[60] Yujiu Yang,et al. Self-supervised Feature Learning for 3D Medical Images by Playing a Rubik's Cube , 2019, MICCAI.

[61] Yann LeCun,et al. Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[62] Luca Maria Gambardella,et al. High-Performance Neural Networks for Visual Object Classification , 2011, ArXiv.

[63] M. Jorge Cardoso,et al. Improving Data Augmentation for Medical Image Segmentation , 2018 .

[64] Lin Yang,et al. Deep Adversarial Networks for Biomedical Image Segmentation Utilizing Unannotated Images , 2017, MICCAI.

[65] Xin Yang,et al. Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved? , 2018, IEEE Transactions on Medical Imaging.

[66] Seyed-Ahmad Ahmadi,et al. V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[67] Harri Valpola,et al. Weight-averaged consistency targets improve semi-supervised deep learning results , 2017, ArXiv.

[68] Ana Maria Mendonça,et al. End-to-End Adversarial Retinal Image Synthesis , 2018, IEEE Transactions on Medical Imaging.

[69] Paolo Favaro,et al. Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles , 2016, ECCV.

[70] Andrew Zisserman,et al. Self-supervised Learning for Spinal MRIs , 2017, DLMIA/ML-CDS@MICCAI.

[71] Max Welling,et al. Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[72] Frédo Durand,et al. Data augmentation using learned transforms for one-shot medical image segmentation , 2019, ArXiv.

[73] Dong-Hyun Lee,et al. Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .

[74] Yefeng Zheng,et al. Self supervised deep representation learning for fine-grained body part recognition , 2017, 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017).