DoDNet: Learning to Segment Multi-Organ and Tumors from Multiple Partially Labeled Datasets

Due to the intensive cost of labor and expertise in annotating 3D medical images at a voxel level, most benchmark datasets are equipped with the annotations of only one type of organs and/or tumors, resulting in the so-called partially labeling issue. To address this issue, we propose a dynamic on-demand network (DoDNet) that learns to segment multiple organs and tumors on partially labeled datasets. DoD-Net consists of a shared encoder-decoder architecture, a task encoding module, a controller for dynamic filter generation, and a single but dynamic segmentation head. The information of current segmentation task is encoded as a task-aware prior to tell the model what the task is expected to achieve. Different from existing approaches which fix kernels after training, the kernels in dynamic head are generated adaptively by the controller, conditioned on both input image and assigned task. Thus, DoDNet is able to segment multiple organs and tumors, as done by multiple networks or a multi-head network, in a much efficient and flexible manner. We created a large-scale partially labeled dataset called MOTS and demonstrated the superior performance of our DoDNet over other competitors on seven organ and tumor segmentation tasks. We also transferred the weights pre-trained on MOTS to a downstream multi-organ segmentation task and achieved state-of-the-art performance. This study provides a general 3D medical image segmentation model that has been pre-trained on a large-scale partially labeled dataset and can be extended (after fine-tuning) to downstream volumetric medical data segmentation tasks. Code and models are available at: https://git.io/DoDNet

[1]  S. Kevin Zhou,et al.  Marginal loss and exclusion loss for partially supervised multi-organ segmentation , 2020, Medical Image Anal..

[2]  Andreas Nürnberger,et al.  CHAOS Challenge - Combined (CT-MR) Healthy Abdominal Organ Segmentation , 2020, Medical Image Anal..

[3]  Hongwei Li,et al.  Deep learning-enabled multi-organ segmentation in whole-body mouse scans , 2020, Nature Communications.

[4]  Yuanjie Zheng,et al.  Multi-organ Segmentation via Co-training Weight-averaged Models from Few-organ Datasets , 2020, MICCAI.

[5]  Yuxing Tang,et al.  E2Net: An Edge Enhanced Network for Accurate Liver and Tumor Segmentation on CT Scans , 2020, MICCAI.

[6]  Huchuan Lu,et al.  Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection , 2020, ECCV.

[7]  Quoc V. Le,et al.  Rethinking Pre-training and Self-training , 2020, NeurIPS.

[8]  Guotong Xie,et al.  A Triple-Stage Self-Guided Network for Kidney Tumor Segmentation , 2020, 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI).

[9]  Yan Wang,et al.  CTumorGAN: a unified framework for automatic computed tomography tumor segmentation , 2020, European Journal of Nuclear Medicine and Molecular Imaging.

[10]  Hao Chen,et al.  Conditional Convolutions for Instance Segmentation , 2020, ECCV.

[11]  Liang Zhang,et al.  Block Level Skip Connections Across Cascaded V-Net for Multi-Organ Segmentation , 2020, IEEE Transactions on Medical Imaging.

[12]  Yan Wang,et al.  Recurrent Saliency Transformation Network for Tiny Target Segmentation in Abdominal CT Scans , 2020, IEEE Transactions on Medical Imaging.

[13]  Xi Fang,et al.  Multi-Organ Segmentation Over Partially Labeled Datasets With Multi-Scale Feature Abstraction , 2020, IEEE Transactions on Medical Imaging.

[14]  Lu Yuan,et al.  Dynamic Convolution: Attention Over Convolution Kernels , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Lei Xing,et al.  Modified U-Net (mU-Net) With Incorporation of Object-Dependent High Level Features for Improved Liver and Liver-Tumor Segmentation in CT Images , 2019, IEEE Transactions on Medical Imaging.

[16]  Yu Qiao,et al.  Dynamic Multi-Scale Filters for Semantic Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[17]  Ali Hatamizadeh,et al.  3D Kidneys and Kidney Tumor Semantic Segmentation using Boundary-Aware Networks , 2019, ArXiv.

[18]  Hao Chen,et al.  Light-Weight Hybrid Convolutional Network for Liver Tumor Segmentation , 2019, IJCAI.

[19]  Arie E. Kaufman,et al.  Learning Multi-Class Segmentations From Single-Class Datasets , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Simon A. A. Kohl,et al.  Automated Design of Deep Learning Methods for Biomedical Image Segmentation , 2019 .

[21]  Xinlei Chen,et al.  Prior-Aware Neural Network for Partially-Supervised Multi-Organ Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  Quoc V. Le,et al.  CondConv: Conditionally Parameterized Convolutions for Efficient Inference , 2019, NeurIPS.

[23]  Kai Ma,et al.  Med3D: Transfer Learning for 3D Medical Image Analysis , 2019, ArXiv.

[24]  Nikolaos Papanikolopoulos,et al.  The KiTS19 Challenge Data: 300 Kidney Tumor Cases with Clinical Context, CT Semantic Segmentations, and Surgical Outcomes , 2019, ArXiv.

[25]  Ronald M. Summers,et al.  A large annotated medical image dataset for the development and evaluation of segmentation algorithms , 2019, ArXiv.

[26]  Hao Chen,et al.  The Liver Tumor Segmentation Benchmark (LiTS) , 2019, Medical Image Anal..

[27]  Kaiming He,et al.  Rethinking ImageNet Pre-Training , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[28]  Alan L. Yuille,et al.  Multi-Scale Coarse-to-Fine Segmentation for Screening Pancreatic Ductal Adenocarcinoma , 2018, MICCAI.

[29]  Yinghuan Shi,et al.  Crossbar-Net: A Novel Convolutional Neural Network for Kidney Tumor Segmentation in CT Images , 2018, IEEE Transactions on Image Processing.

[30]  Yan Wang,et al.  Abdominal multi-organ segmentation with organ-attention networks and statistical fusion , 2018, Medical Image Anal..

[31]  Kaiming He,et al.  Group Normalization , 2018, International Journal of Computer Vision.

[32]  Yuichiro Hayashi,et al.  A multi-scale pyramid of 3D fully convolutional networks for abdominal multi-organ segmentation , 2018, MICCAI.

[33]  Yann LeCun,et al.  A Closer Look at Spatiotemporal Convolutions for Action Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34]  Chi-Wing Fu,et al.  H-DenseUNet: Hybrid Densely Connected UNet for Liver and Tumor Segmentation From CT Volumes , 2018, IEEE Transactions on Medical Imaging.

[35]  Ben Glocker,et al.  DLTK: State of the Art Reference Implementations for Deep Learning on Medical Images , 2017, ArXiv.

[36]  Jia Xu,et al.  Fast Image Processing with Fully-Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[37]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[38]  Luc Van Gool,et al.  Dynamic Filter Networks , 2016, NIPS.

[39]  Sebastian Ramos,et al.  The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[42]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.