Disentangle, align and fuse for multimodal and zero-shot image segmentation

Magnetic resonance (MR) protocols rely on several sequences to properly assess pathology and organ status. Yet, despite advances in image analysis we tend to treat each sequence, here termed modality, in isolation. Taking advantage of the information shared between modalities (largely an organ's anatomy) is beneficial for multi-modality multi-input processing and learning. However, we must overcome inherent anatomical misregistrations and disparities in signal intensity across the modalities to claim this benefit. We present a method that offers improved segmentation accuracy of the modality of interest (over a single input model), by learning to leverage information present in other modalities, enabling semi-supervised and zero shot learning. Core to our method is learning a disentangled decomposition into anatomical and imaging factors. Shared anatomical factors from the different inputs are jointly processed and fused to extract more accurate segmentation masks. Image misregistrations are corrected with a Spatial Transformer Network, that non-linearly aligns the anatomical factors. The imaging factor captures signal intensity characteristics across different modality data, and is used for image reconstruction, enabling semi-supervised learning. Temporal and slice pairing between inputs are learned dynamically. We demonstrate applications in Late Gadolinium Enhanced (LGE) and Blood Oxygenation Level Dependent (BOLD) cardiac segmentation, as well as in T2 abdominal segmentation.

[1]  Andreas Nürnberger,et al.  CHAOS Challenge - Combined (CT-MR) Healthy Abdominal Organ Segmentation , 2020, Medical Image Anal..

[2]  Truyen Tran,et al.  Theory and Evaluation Metrics for Learning Disentangled Representations , 2019, ICLR.

[3]  Yong Yin,et al.  MMFNet: A Multi-modality MRI Fusion Network for Segmentation of Nasopharyngeal Carcinoma , 2018, Neurocomputing.

[4]  Sotirios A. Tsaftaris,et al.  Multimodal Cardiac Segmentation Using Disentangled Representation Learning , 2019, STACOM@MICCAI.

[5]  Maxime Sermesant,et al.  Style Data Augmentation for Robust Segmentation of Multi-modality Cardiac MRI , 2019, STACOM@MICCAI.

[6]  Ling He,et al.  SK-Unet: An Improved U-Net Model with Selective Kernel for the Segmentation of Multi-sequence Cardiac MR , 2019, STACOM@MICCAI.

[7]  Hao Chen,et al.  Robust Multimodal Brain Tumor Segmentation via Feature Disentanglement and Gated Fusion , 2019, MICCAI.

[8]  Daguang Xu,et al.  Cardiac Segmentation of LGE MRI with Noisy Labels , 2019, STACOM@MICCAI.

[9]  Gongning Luo,et al.  An Automatic Cardiac Segmentation Framework based on Multi-sequence MR Image , 2019, STACOM@MICCAI.

[10]  Víctor M. Campello,et al.  Combining Multi-Sequence and Synthetic Images for Improved Segmentation of Late Gadolinium Enhancement Cardiac MRI , 2019, STACOM@MICCAI.

[11]  Juntang Zhuang,et al.  Domain-Agnostic Learning With Anatomy-Consistent Embedding for Cross-Modality Liver Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[12]  Daniel Rueckert,et al.  Representation Disentanglement for Multi-task Learning with application to Fetal Ultrasound , 2019, SUSI/PIPPI@MICCAI.

[13]  D. Rueckert,et al.  Unsupervised Multi-modal Style Transfer for Cardiac MR Segmentation , 2019, STACOM@MICCAI.

[14]  Wufeng Xue,et al.  Segmentation of Multimodal Myocardial Images Using Shape-Transfer GAN , 2019, STACOM@MICCAI.

[15]  Fan Zhang,et al.  Unsupervised Domain Adaptation via Disentangled Representations: Application to Cross-Modality Liver Segmentation , 2019, MICCAI.

[16]  Daniel Rueckert,et al.  Detection and Correction of Cardiac MR Motion Artefacts during Reconstruction from K-space , 2019, ArXiv.

[17]  Chen Qian,et al.  TransGaGa: Geometry-Aware Unsupervised Image-To-Image Translation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Daniel Rueckert,et al.  Unsupervised Deformable Registration for Multi-Modal Images via Disentangled Representations , 2019, IPMI.

[19]  Taesung Park,et al.  Semantic Image Synthesis With Spatially-Adaptive Normalization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Marleen de Bruijne,et al.  Learning Cross-Modality Representations From Multi-Modal Images , 2019, IEEE Trans. Medical Imaging.

[21]  Hao Chen,et al.  Synergistic Image and Feature Adaptation: Towards Cross-Modality Domain Adaptation for Medical Image Segmentation , 2019, AAAI.

[22]  Shunxing Bao,et al.  SynSeg-Net: Synthetic Segmentation Without Target Modality Ground Truth , 2018, IEEE Transactions on Medical Imaging.

[23]  Jing Yuan,et al.  HyperDense-Net: A Hyper-Densely Connected CNN for Multi-Modal Image Segmentation , 2018, IEEE Transactions on Medical Imaging.

[24]  Zhen Wang,et al.  On the Effectiveness of Least Squares Generative Adversarial Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Xiahai Zhuang,et al.  Multivariate Mixture Model for Myocardial Segmentation Combining Multi-Source Images , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Sotirios A. Tsaftaris,et al.  Disentangled representation learning in cardiac image analysis , 2019, Medical Image Anal..

[27]  R. Raskar,et al.  R EDUCING LEAKAGE IN DISTRIBUTED DEEP LEARNING FOR SENSITIVE HEALTH DATA , 2019 .

[28]  Lixu Gu,et al.  Multi-sequence myocardium segmentation with cross-constrained shape and neural network-based initialization , 2019, Comput. Medical Imaging Graph..

[29]  Gijs van Tulder,et al.  Learning Cross-Modality Representations From Multi-Modal Images , 2019, IEEE Transactions on Medical Imaging.

[30]  Jie Yang,et al.  Decompose to manipulate: Manipulable Object Synthesis in 3D Medical Images with Structured Image Decomposition , 2018, ArXiv.

[31]  Luc Van Gool,et al.  Exemplar Guided Unsupervised Image-to-Image Translation , 2018, ArXiv.

[32]  Jan Kautz,et al.  Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[33]  Guillaume Desjardins,et al.  Understanding disentangling in β-VAE , 2018, ArXiv.

[34]  Sotirios A. Tsaftaris,et al.  Factorised spatial representation learning: application in semi-supervised myocardial segmentation , 2018, MICCAI.

[35]  Andrew Gordon Wilson,et al.  Averaging Weights Leads to Wider Optima and Better Generalization , 2018, UAI.

[36]  Ben Glocker,et al.  Multi-modal Learning from Unpaired Images: Application to Multi-organ Segmentation in CT and MRI , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[37]  Lin Yang,et al.  Translating and Segmenting Multimodal Medical Volumes with Cycle- and Shape-Consistency Generative Adversarial Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38]  Yuichi Yoshida,et al.  Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.

[39]  Aaron C. Courville,et al.  FiLM: Visual Reasoning with a General Conditioning Layer , 2017, AAAI.

[40]  Sotirios A. Tsaftaris,et al.  Robust Multi-modal MR Image Synthesis , 2017, MICCAI.

[41]  David E Newby,et al.  Ferumoxytol-enhanced magnetic resonance imaging assessing inflammation after myocardial infarction , 2017, Heart.

[42]  Kilian Q. Weinberger,et al.  On Calibration of Modern Neural Networks , 2017, ICML.

[43]  Kuan-Lun Tseng,et al.  Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Alexander A. Alemi,et al.  Deep Variational Information Bottleneck , 2017, ICLR.

[45]  Christopher Joseph Pal,et al.  Brain tumor segmentation with Deep Neural Networks , 2015, Medical Image Anal..

[46]  Xiahai Zhuang,et al.  Multi-scale patch and multi-modality atlases for whole heart segmentation of MRI , 2016, Medical Image Anal..

[47]  Andrew Zisserman,et al.  Spatial Transformer Networks , 2015, NIPS.

[48]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[49]  Shaogang Gong,et al.  Transductive Multi-View Zero-Shot Learning , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Yang Yang,et al.  Advanced Normalization Tools for Cardiac Motion Correction , 2014, STACOM.

[51]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[52]  Nikos Paragios,et al.  Deformable Medical Image Registration: A Survey , 2013, IEEE Transactions on Medical Imaging.

[53]  Sotirios A. Tsaftaris,et al.  Detecting Myocardial Ischemia at Rest With Cardiac Phase–Resolved Blood Oxygen Level–Dependent Cardiovascular Magnetic Resonance , 2013, Circulation. Cardiovascular imaging.

[54]  Dong Wei,et al.  Myocardial Segmentation of Late Gadolinium Enhanced MR Images by Propagation of Contours from Cine MR Images , 2011, MICCAI.

[55]  R. Kim,et al.  Cardiovascular magnetic resonance in patients with myocardial infarction: current and emerging applications. , 2009, Journal of the American College of Cardiology.

[56]  Maria L. Rizzo,et al.  Measuring and testing dependence by correlation of distances , 2007, 0803.4101.