Male pelvic multi-organ segmentation using V-transformer network

Automatic multi-organ segmentation is a cost-effective tool for generating organ contours using computed tomography (CT) images. This work proposes a deep-learning algorithm for multi-organ (bladder, prostate, rectum, left and right femoral heads) segmentation in pelvic CT images for prostate radiation treatment planning. We propose an encoder-decoder network with a V-net backbone for local feature extraction and contour reconstruction. Novel to our network, we utilize a token-based transformer, which encourages long-range dependency, to forward more informative high-resolution feature maps from the encoder to the decoder. In addition, a knowledge distillation strategy was applied to improve the network’s generalization. We evaluate the network using a dataset collected from 50 patients with prostate cancer. A quantitative evaluation of the proposed network’s performance was performed on each organ based on: 1) volume similarity between the segmented contours and ground truth using Dice score, segmentation sensitivity, precision, and absolute percentage volume difference (AVD), 2) surface similarity evaluated by Hausdorff distance (HD), mean surface distance (MSD) and residual mean square distance (RMSD). The performance was then evaluated against other state-of-art methods. The average volume similarities achieved by the network over all organs were: Dice score = 0.83, sensitivity = 0.84, and precision = 0.83; the average surface similarities were HD = 5.77mm, MSD = 0.93mm, RMSD = 2.77mm, and AVD =12.85%. The proposed methods performed significantly better than competing methods in most evaluation metrics. The proposed network may be a promising segmentation approach for use in routine prostate radiation treatment planning.

[1]  Matthieu Cord,et al.  Training data-efficient image transformers & distillation through attention , 2020, ICML.

[2]  S. Gelly,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[3]  Kurt Keutzer,et al.  Visual Transformers: Token-based Image Representation and Processing for Computer Vision , 2020, ArXiv.

[4]  Yabo Fu,et al.  Deep Learning in Multi-organ Segmentation , 2020, ArXiv.

[5]  Yang Lei,et al.  Male pelvic multi-organ segmentation aided by CBCT-based synthetic MRI , 2019, Physics in medicine and biology.

[6]  Yang Lei,et al.  CT Prostate Segmentation Based on Synthetic MRI-aided Deep Attention Fully Convolution Network. , 2019, Medical physics.

[7]  Yang Lei,et al.  Synthetic MRI-aided multi-organ segmentation on male pelvic CT using cycle consistent deep attention network. , 2019, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[8]  Martin Torriani,et al.  Deep learning for automated segmentation of pelvic muscles, fat, and bone from CT studies for body composition assessment , 2019, Skeletal Radiology.

[9]  Yang Lei,et al.  Ultrasound prostate segmentation based on multidirectional deeply supervised V-Net. , 2019, Medical physics.

[10]  Tian Liu,et al.  Deeply supervised 3D fully convolutional networks with group dilated convolution for automatic MRI prostate segmentation , 2019, Medical physics.

[11]  Steve B. Jiang,et al.  Fully automated organ segmentation in male pelvic CT images , 2018, Physics in medicine and biology.

[12]  Loïc Le Folgoc,et al.  Attention U-Net: Learning Where to Look for the Pancreas , 2018, ArXiv.

[13]  Steve B. Jiang,et al.  Segmentation of the prostate and organs at risk in male pelvic CT images using deep learning , 2018, Biomedical Physics & Engineering Express.

[14]  Hongyi Zhang,et al.  mixup: Beyond Empirical Risk Minimization , 2017, ICLR.

[15]  Syed Muhammad Anwar,et al.  Medical Image Analysis using Convolutional Neural Networks: A Review , 2017, Journal of Medical Systems.

[16]  Raquel Urtasun,et al.  Understanding the Effective Receptive Field in Deep Convolutional Neural Networks , 2016, NIPS.

[17]  Andrea Vedaldi,et al.  Instance Normalization: The Missing Ingredient for Fast Stylization , 2016, ArXiv.

[18]  Seyed-Ahmad Ahmadi,et al.  V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[19]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[21]  Xiaofeng Yang,et al.  Prostate CT segmentation method based on nonrigid registration in ultrasound-guided CT-based HDR prostate brachytherapy. , 2014, Medical physics.

[22]  Tian Liu,et al.  Pelvic multi-organ segmentation on cone-beam CT for prostate adaptive radiotherapy , 2020 .