Few-shot brain segmentation from weakly labeled data with deep heteroscedastic multi-task networks

In applications of supervised learning applied to medical image segmentation, the need for large amounts of labeled data typically goes unquestioned. In particular, in the case of brain anatomy segmentation, hundreds or thousands of weakly-labeled volumes are often used as training data. In this paper, we first observe that for many brain structures, a small number of training examples, (n=9), weakly labeled using Freesurfer 6.0, plus simple data augmentation, suffice as training data to achieve high performance, achieving an overall mean Dice coefficient of $0.84 \pm 0.12$ compared to Freesurfer over 28 brain structures in T1-weighted images of $\approx 4000$ 9-10 year-olds from the Adolescent Brain Cognitive Development study. We then examine two varieties of heteroscedastic network as a method for improving classification results. An existing proposal by Kendall and Gal, which uses Monte-Carlo inference to learn to predict the variance of each prediction, yields an overall mean Dice of $0.85 \pm 0.14$ and showed statistically significant improvements over 25 brain structures. Meanwhile a novel heteroscedastic network which directly learns the probability that an example has been mislabeled yielded an overall mean Dice of $0.87 \pm 0.11$ and showed statistically significant improvements over all but one of the brain structures considered. The loss function associated to this network can be interpreted as performing a form of learned label smoothing, where labels are only smoothed where they are judged to be uncertain.

[1]  Anders M. Dale,et al.  The Adolescent Brain Cognitive Development (ABCD) study: Imaging acquisition across 21 sites , 2018, Developmental Cognitive Neuroscience.

[2]  A. Weigend,et al.  Estimating the mean and variance of the target probability distribution , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[3]  Doina Precup,et al.  Exploring Uncertainty Measures in Deep Networks for Multiple Sclerosis Lesion Detection and Segmentation , 2018, MICCAI.

[4]  Satrajit S. Ghosh,et al.  Knowing what you know in brain segmentation using deep neural networks , 2018, ArXiv.

[5]  Nassir Navab,et al.  QuickNAT: A fully convolutional network for quick and accurate segmentation of neuroanatomy , 2018, NeuroImage.

[6]  M. Jorge Cardoso,et al.  Quality control in radiotherapy-treatment planning using multi-task learning and uncertainty estimation , 2018 .

[7]  Ghassan Hamarneh,et al.  Uncertainty Driven Multi-loss Fully Convolutional Networks for Histopathology , 2017, CVII-STENT/LABELS@MICCAI.

[8]  Mauricio Reyes,et al.  Uncertainty-driven Sanity Check: Application to Postoperative Brain Tumor Cavity Segmentation , 2018, ArXiv.

[9]  Ben Glocker,et al.  NeuroNet: Fast and Robust Reproduction of Multiple Brain Image Segmentation Pipelines , 2018, ArXiv.

[10]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[11]  Sébastien Ourselin,et al.  Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks , 2018, Neurocomputing.

[12]  Alex Kendall,et al.  What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.

[13]  N. Volkow,et al.  The conception of the ABCD study: From substance use to a broad NIH collaboration , 2017, Developmental Cognitive Neuroscience.

[14]  Mauricio Reyes,et al.  On the Effect of Inter-observer Variability for a Reliable Estimation of Uncertainty of Medical Image Segmentation , 2018, MICCAI.

[15]  Nassir Navab,et al.  Inherent Brain Segmentation Quality Control from Fully ConvNet Monte Carlo Sampling , 2018, MICCAI.

[16]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Frank Hutter,et al.  SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.