Flexible Fusion Network for Multi-Modal Brain Tumor Segmentation

Automated brain tumor segmentation is crucial for aiding brain disease diagnosis and evaluating disease progress. Currently, magnetic resonance imaging (MRI) is a routinely adopted approach in the field of brain tumor segmentation that can provide different modality images. It is critical to leverage multi-modal images to boost brain tumor segmentation performance. Existing works commonly concentrate on generating a shared representation by fusing multi-modal data, while few methods take into account modality-specific characteristics. Besides, how to efficiently fuse arbitrary numbers of modalities is still a difficult task. In this study, we present a flexible fusion network (termed F<inline-formula><tex-math notation="LaTeX">$^{2}$</tex-math></inline-formula>Net) for multi-modal brain tumor segmentation, which can flexibly fuse arbitrary numbers of multi-modal information to explore complementary information while maintaining the specific characteristics of each modality. Our F<inline-formula><tex-math notation="LaTeX">$^{2}$</tex-math></inline-formula>Net is based on the encoder-decoder structure, which utilizes two Transformer-based feature learning streams and a cross-modal shared learning network to extract individual and shared feature representations. To effectively integrate the knowledge from the multi-modality data, we propose a cross-modal feature-enhanced module (CFM) and a multi-modal collaboration module (MCM), which aims at fusing the multi-modal features into the shared learning network and incorporating the features from encoders into the shared decoder, respectively. Extensive experimental results on multiple benchmark datasets demonstrate the effectiveness of our F<inline-formula><tex-math notation="LaTeX">$^{2}$</tex-math></inline-formula>Net over other state-of-the-art segmentation methods.

[1]  Chen Gong,et al.  Cross-level Feature Aggregation Network for Polyp Segmentation , 2023, Pattern Recognit..

[2]  E. Song,et al.  A 3D Cross-Modality Feature Interaction Network With Volumetric Feature Alignment for Brain Tumor and Tissue Segmentation , 2022, IEEE Journal of Biomedical and Health Informatics.

[3]  Lequan Yu,et al.  NestedFormer: Nested Modality-Aware Transformer for Brain Tumor Segmentation , 2022, MICCAI.

[4]  Xia Li,et al.  TranSiam: Fusing Multimodal Visual Features Using Transformer for Medical Image Segmentation , 2022, arXiv.org.

[5]  Jiliu Zhou,et al.  Semi-supervised medical image segmentation via a tripled-uncertainty guided mean teacher model with contrastive learning , 2022, Medical Image Anal..

[6]  Jiliu Zhou,et al.  Unified medical image segmentation by learning from uncertainty in an end-to-end manner , 2022, Knowl. Based Syst..

[7]  Kim-Kwang Raymond Choo,et al.  MVFusFra: A Multi-View Dynamic Fusion Framework for Multimodal Brain Tumor Segmentation , 2021, IEEE Journal of Biomedical and Health Informatics.

[8]  Q. Dou,et al.  Learning With Privileged Multimodal Knowledge for Unimodal Segmentation , 2021, IEEE Transactions on Medical Imaging.

[9]  Osmar R. Zaïane,et al.  UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer , 2021, AAAI.

[10]  Yazhou Yao,et al.  Self-Supervised Multi-Modal Hybrid Fusion Network for Brain Tumor Segmentation , 2021, IEEE Journal of Biomedical and Health Informatics.

[11]  Deng-Ping Fan,et al.  Specificity-preserving RGB-D saliency detection , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[12]  Zhongchao Shi,et al.  Modality-aware Mutual Learning for Multi-modal Medical Image Segmentation , 2021, MICCAI.

[13]  H. Fu,et al.  Multimodal Transformer for Accelerated MR Imaging , 2021, IEEE Transactions on Medical Imaging.

[14]  P. Luo,et al.  PVT v2: Improved baselines with Pyramid Vision Transformer , 2021, Computational Visual Media.

[15]  Qi Tian,et al.  Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation , 2021, ECCV Workshops.

[16]  Andreas Geiger,et al.  Multi-Modal Fusion Transformer for End-to-End Autonomous Driving , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Stéphane Canu,et al.  Latent Correlation Representation Learning for Brain Tumor Segmentation With Missing MRI Modalities , 2021, IEEE Transactions on Image Processing.

[18]  Daguang Xu,et al.  UNETR: Transformers for 3D Medical Image Segmentation , 2021, 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

[19]  Wenxuan Wang,et al.  TransBTS: Multimodal Brain Tumor Segmentation Using Transformer , 2021, MICCAI.

[20]  Xiang Li,et al.  Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[21]  Yan Wang,et al.  TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation , 2021, ArXiv.

[22]  Jungong Han,et al.  Cross-modality deep feature learning for brain tumor segmentation , 2021, Pattern Recognit..

[23]  Fahad Shahbaz Khan,et al.  Transformers in Vision: A Survey , 2021, ACM Comput. Surv..

[24]  Matthieu Cord,et al.  Training data-efficient image transformers & distillation through attention , 2020, ICML.

[25]  S. Gelly,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[26]  Ling Shao,et al.  Bifurcated Backbone Strategy for RGB-D Salient Object Detection , 2020, IEEE Transactions on Image Processing.

[27]  Jialin Peng,et al.  HDC-Net: Hierarchical Decoupled Convolution Network for Brain Tumor Segmentation , 2020, IEEE Journal of Biomedical and Health Informatics.

[28]  Ling Shao,et al.  Hi-Net: Hybrid-Fusion Network for Multi-Modal MR Image Synthesis , 2020, IEEE Transactions on Medical Imaging.

[29]  Jianming Liang,et al.  UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation , 2019, IEEE Transactions on Medical Imaging.

[30]  Hao Chen,et al.  Robust Multimodal Brain Tumor Segmentation via Feature Disentanglement and Gated Fusion , 2019, MICCAI.

[31]  Mehrdad Noori,et al.  Attention-Guided Version of 2D UNet for Automatic Brain Tumor Segmentation , 2019, 2019 9th International Conference on Computer and Knowledge Engineering (ICCKE).

[32]  Muhammad Imran,et al.  Efficient Brain Tumor Segmentation With Multiscale Two-Pathway-Group Conventional Neural Networks , 2019, IEEE Journal of Biomedical and Health Informatics.

[33]  Anuj Bhardwaj,et al.  A review on brain tumor segmentation of MRI images. , 2019, Magnetic resonance imaging.

[34]  Zhiming Luo,et al.  Weighted Res-UNet for High-Quality Retina Vessel Segmentation , 2018, 2018 9th International Conference on Information Technology in Medicine and Education (ITME).

[35]  Jose Dolz,et al.  IVD-Net: Intervertebral disc localization and segmentation in MRI with a multi-modal UNet , 2018, CSI@MICCAI.

[36]  Loïc Le Folgoc,et al.  Attention U-Net: Learning Where to Look for the Pancreas , 2018, ArXiv.

[37]  Jing Yuan,et al.  HyperDense-Net: A Hyper-Densely Connected CNN for Multi-Modal Image Segmentation , 2018, IEEE Transactions on Medical Imaging.

[38]  Gongning Luo,et al.  Concatenated and Connected Random Forests With Multiscale Patch Driven Active Contour Model for Automated Brain Tumor Segmentation of MR Images , 2018, IEEE Transactions on Medical Imaging.

[39]  Chi-Wing Fu,et al.  H-DenseUNet: Hybrid Densely Connected UNet for Liver and Tumor Segmentation From CT Volumes , 2018, IEEE Transactions on Medical Imaging.

[40]  Mazhar Shaikh,et al.  Brain Tumor Segmentation Using Dense Fully Convolutional Neural Network , 2017, BrainLes@MICCAI.

[41]  LinLin Shen,et al.  Deep Learning Based Multimodal Brain Tumor Diagnosis , 2017, BrainLes@MICCAI.

[42]  Christoph Meinel,et al.  Conditional Adversarial Network for Semantic Segmentation of Brain Tumor , 2017, ArXiv.

[43]  George Papandreou,et al.  Rethinking Atrous Convolution for Semantic Image Segmentation , 2017, ArXiv.

[44]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[45]  Seyed-Ahmad Ahmadi,et al.  V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[46]  Yaozong Gao,et al.  Fully convolutional networks for multi-modality isointense infant brain image segmentation , 2016, 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI).

[47]  Victor Alves,et al.  Brain Tumor Segmentation Using Convolutional Neural Networks in MRI Images , 2016, IEEE Transactions on Medical Imaging.

[48]  Brian B. Avants,et al.  The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) , 2015, IEEE Transactions on Medical Imaging.

[49]  Pheng-Ann Heng,et al.  Automatic Brain Tumor Segmentation from MR Images via a Multimodal Sparse Coding Based Probabilistic Model , 2015, 2015 International Workshop on Pattern Recognition in NeuroImaging.

[50]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[51]  Nicholas Ayache,et al.  A Generative Model for Brain Tumor Segmentation in Multi-Modal Images , 2010, MICCAI.

[52]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[53]  Alan L. Yuille,et al.  Efficient Multilevel Brain Tumor Segmentation With Integrated Bayesian Model Classification , 2008, IEEE Transactions on Medical Imaging.

[54]  Cynthia Hawkins,et al.  Identification of a cancer stem cell in human brain tumors. , 2003, Cancer research.

[55]  et al.,et al.  ISLES 2015 ‐ A public evaluation benchmark for ischemic stroke lesion segmentation from multispectral MRI , 2017, Medical Image Anal..