Multi-class multimodal semantic segmentation with an improved 3D fully convolutional networks

Abstract Semantic segmentation is an important but challenging task in the field of medical image analysis. Automatic labeling for different anatomical structures can be useful for disease diagnosis, treatment planning and development/degeneration evaluation. However, due to the large shape and appearance variance among different subjects, accurate and reliable semantic segmentation is difficult for automatic labeling and time-consuming for manual labeling. In this paper, we propose a multi-class semantic segmentation algorithm based on the 3D fully convolutional networks, exploring the multimodal image blocks. The use of multimodal image blocks naturally allows an effective data augmentation. Besides, we also investigated the use of ensemble learning and conditional random fields (CRFs) as the post-processing step, which obtains the spatially consistent segmentation results. Our proposed method was evaluated on two public benchmarks: BRATS2017 and MNI-HiSUB25. For both datasets, the experimental results demonstrated that our method was able to establish an effective semantic segmentation framework with multi-modality data and achieve better performance compared to the conventional fully convolutional network.

[1]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[2]  Thomas Brox,et al.  3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation , 2016, MICCAI.

[3]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[4]  F. Cendes,et al.  Epilepsies associated with hippocampal sclerosis , 2014, Acta Neuropathologica.

[5]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  H. Matsuda,et al.  Automated subfield volumetric analysis of hippocampus in temporal lobe epilepsy using high-resolution T2-weighed MR imaging , 2016, NeuroImage: Clinical.

[7]  Shuo Li,et al.  Glioma Segmentation With a Unified Algorithm in Multimodal MRI Images , 2018, IEEE Access.

[8]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Mohan S. Kankanhalli,et al.  Multimodal fusion for multimedia analysis: a survey , 2010, Multimedia Systems.

[10]  Xavier Lladó,et al.  Automated sub‐cortical brain structure segmentation combining spatial and deep convolutional features , 2017, Medical Image Anal..

[11]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[12]  Binh P. Nguyen,et al.  Reworking Multilabel Brain Tumor Segmentation: An Automated Framework Using Structured Kernel Sparse Representation , 2017, IEEE Systems, Man, and Cybernetics Magazine.

[13]  Yaozong Gao,et al.  Robust brain ROI segmentation by deformation regression and deformable shape model , 2018, Medical Image Anal..

[14]  Yoram Singer,et al.  Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[15]  Deolinda M. L. D. Rasteiro,et al.  Hierarchical brain tumour segmentation using extremely randomized trees , 2018, Pattern Recognit..

[16]  Brian B. Avants,et al.  The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) , 2015, IEEE Transactions on Medical Imaging.

[17]  Victor Alves,et al.  Brain Tumor Segmentation Using Convolutional Neural Networks in MRI Images , 2016, IEEE Transactions on Medical Imaging.

[18]  Pierrick Coupé,et al.  HIPS: A new hippocampus subfield segmentation method , 2017, NeuroImage.

[19]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[21]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[22]  Andrea Bernasconi,et al.  Multi-contrast submillimetric 3 Tesla hippocampal subfield segmentation protocol and dataset , 2015, Scientific Data.

[23]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.