Planar 3D Transfer Learning for End to End Unimodal MRI Unbalanced Data Segmentation

We present a novel approach of 2D to 3D transfer learning based on mapping pre-trained 2D convolutional neural network weights into planar 3D kernels. The method is validated by the proposed planar 3D res-u-net network with encoder transferred from the 2D VGG-16, which is applied for a single-stage unbalanced 3D image data segmentation. In particular, we evaluate the method on the MICCAI 2016 MS lesion segmentation challenge dataset utilizing solely fluid-attenuated inversion recovery (FLAIR) sequence without brain extraction for training and inference to simulate real medical praxis. The planar 3D res-u-net network performed the best both in sensitivity and Dice score amongst end to end methods processing raw MRI scans and achieved comparable Dice score to a state-of-the-art unimodal not end to end approach. Complete source code was released under the open-source license, and this paper complies with the Machine learning reproducibility checklist. By implementing practical transfer learning for 3D data representation, we could segment heavily unbalanced data without selective sampling and achieved more reliable results using less training data in a single modality. From a medical perspective, the unimodal approach gives an advantage in real praxis as it does not require co-registration nor additional scanning time during an examination. Although modern medical imaging methods capture high-resolution 3D anatomy scans suitable for computer-aided detection system processing, deployment of automatic systems for interpretation of radiology imaging is still rather theoretical in many medical areas. Our work aims to bridge the gap by offering a solution for partial research questions.

[1]  Naimul Mefraz Khan,et al.  A Novel Focal Tversky Loss Function With Improved Attention U-Net for Lesion Segmentation , 2018, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019).

[2]  Kyoung Mu Lee,et al.  Triplanar convolution with shared 2D kernels for 3D classification and shape retrieval , 2020, Comput. Vis. Image Underst..

[3]  A. Yuille,et al.  Thickened 2 D Networks for 3 D Medical Image Segmentation , 2019 .

[4]  Chi-Wing Fu,et al.  H-DenseUNet: Hybrid Densely Connected UNet for Liver and Tumor Segmentation From CT Volumes , 2018, IEEE Transactions on Medical Imaging.

[5]  Reinhard Koch,et al.  2D and 3D Segmentation of uncertain local collagen fiber orientations in SHG microscopy , 2019, GCPR.

[6]  Simon K. Warfield,et al.  Asymmetric Loss Functions and Deep Densely-Connected Networks for Highly-Imbalanced Medical Image Segmentation: Application to Multiple Sclerosis Lesion Detection , 2018, IEEE Access.

[7]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[8]  Örjan Smedby,et al.  Automatic multiple sclerosis lesion segmentation using hybrid artificial neural networks , 2016 .

[9]  Le Lu,et al.  Improving Deep Pancreas Segmentation in CT and MRI Images via Recurrent Neural Contextual Learning and Direct Loss Function , 2017, ArXiv.

[10]  Carlo Sansone,et al.  Multi-planar 3D breast segmentation in MRI via deep convolutional neural networks , 2020, Artif. Intell. Medicine.

[11]  Pei Wang,et al.  Focal Dice Loss and Image Dilation for Brain Tumor Segmentation , 2018, DLMIA/ML-CDS@MICCAI.

[12]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[14]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[15]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Kaiming He,et al.  Group Normalization , 2018, ECCV.

[17]  Alexander Rakhlin,et al.  Automatic Instrument Segmentation in Robot-Assisted Surgery Using Deep Learning , 2018, bioRxiv.

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  Jon Kleinberg,et al.  Transfusion: Understanding Transfer Learning for Medical Imaging , 2019, NeurIPS.

[20]  Harriet Small,et al.  Handling Unbalanced Data in Deep Image Segmentation , 2017 .

[21]  Joseph Paul Cohen,et al.  Deep semantic segmentation of natural and medical images: a review , 2019, Artificial Intelligence Review.

[22]  Andrew Zisserman,et al.  Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Joelle Pineau,et al.  Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program) , 2020, J. Mach. Learn. Res..

[24]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[25]  Hongming Shan,et al.  3-D Convolutional Encoder-Decoder Network for Low-Dose CT via Transfer Learning From a 2-D Trained Network , 2018, IEEE Transactions on Medical Imaging.

[26]  Luis Ibáñez,et al.  The ITK Software Guide , 2005 .

[27]  Alan L. Yuille,et al.  Thickened 2D Networks for 3D Medical Image Segmentation , 2019, ArXiv.

[28]  Malay Kishore Dutta,et al.  Optimized High Resolution 3D Dense-U-Net Network for Brain and Spine Segmentation , 2019, Applied Sciences.

[29]  Kai Ma,et al.  Med3D: Transfer Learning for 3D Medical Image Analysis , 2019, ArXiv.

[30]  Xilin Chen,et al.  Object-Contextual Representations for Semantic Segmentation , 2019, ECCV.

[31]  Bingbing Ni,et al.  Reinventing 2D Convolutions for 3D Images , 2019, IEEE Journal of Biomedical and Health Informatics.

[32]  Alan L. Yuille,et al.  Thickened 2D Networks for Efficient 3D Medical Image Segmentation. , 2019 .

[33]  Martin Styner,et al.  Objective Evaluation of Multiple Sclerosis Lesion Segmentation using a Data Management and Processing Infrastructure , 2018, bioRxiv.

[34]  Alexey Shvets,et al.  TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation , 2018, Computer-Aided Analysis of Gastrointestinal Videos.

[35]  Daguang Xu,et al.  3D Anisotropic Hybrid Network: Transferring Convolutional Features from 2D Images to 3D Anisotropic Volumes , 2017, MICCAI.

[36]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[37]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Xin Li,et al.  Linear Registration of Brain MRI Using Knowledge-Based Multiple Intermediator Libraries , 2019, Front. Neurosci..

[39]  Tom Gundersen,et al.  Nabla-net: A Deep Dag-Like Convolutional Architecture for Biomedical Image Segmentation , 2016, BrainLes@MICCAI.

[40]  Segmentation Models , 2016, Brand Management Strategies.

[41]  Vladimir V. Khryashchev,et al.  Comparison of Different Convolutional Neural Network Architectures for Satellite Image Segmentation , 2018, 2018 23rd Conference of Open Innovations Association (FRUCT).

[42]  Seyed-Ahmad Ahmadi,et al.  V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).