JSSR: A Joint Synthesis, Segmentation, and Registration System for 3D Multi-Modal Image Alignment of Large-scale Pathological CT Scans

Multi-modal image registration is a challenging problem that is also an important clinical task for many real applications and scenarios. As a first step in analysis, deformable registration among different image modalities is often required in order to provide complementary visual information. During registration, semantic information is key to match homologous points and pixels. Nevertheless, many conventional registration methods are incapable in capturing high-level semantic anatomical dense correspondences. In this work, we propose a novel multi-task learning system, JSSR, based on an end-to-end 3D convolutional neural network that is composed of a generator, a registration and a segmentation component. The system is optimized to satisfy the implicit constraints between different tasks in an unsupervised manner. It first synthesizes the source domain images into the target domain, then an intra-modal registration is applied on the synthesized images and target images. The segmentation module are then applied on the synthesized and target images, providing additional cues based on semantic correspondences. The supervision from another fully-annotated dataset is used to regularize the segmentation. We extensively evaluate JSSR on a large-scale medical image dataset containing 1,485 patient CT imaging studies of four different contrast phases (i.e., 5,940 3D CT scans with pathological livers) on the registration, segmentation and synthesis tasks. The performance is improved after joint training on the registration and segmentation tasks by 0.9% and 1.9% respectively compared to a highly competitive and accurate deep learning baseline. The registration also consistently outperforms conventional state-of-the-art multi-modal registration methods.

[1]  Michael Brady,et al.  Globally Optimal Deformable Registration on a Minimum Spanning Tree Using Dense Displacement Sampling , 2012, MICCAI.

[2]  Isaac N. Bankman,et al.  Handbook of medical image processing and analysis , 2009 .

[3]  Dwarikanath Mahapatra,et al.  Deformable medical image registration using generative adversarial networks , 2018, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[4]  Sébastien Ourselin,et al.  Weakly-supervised convolutional neural networks for multimodal image registration , 2018, Medical Image Anal..

[5]  Seyed-Ahmad Ahmadi,et al.  V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[6]  M. Bel Registration , 1892, Science.

[7]  Baowei Fei,et al.  3D non-rigid registration using surface and local salient features for transrectal ultrasound image-guided prostate biopsy , 2011, Medical Imaging.

[8]  Qi Tian,et al.  Phase Collaborative Network for Multi-Phase Medical Imaging Segmentation , 2018, ArXiv.

[9]  Ender Konukoglu,et al.  Generative Adversarial Networks for MR-CT Deformable Image Registration , 2018, ArXiv.

[10]  Yan Wang,et al.  Hyper-Pairing Network for Multi-Phase Pancreatic Ductal Adenocarcinoma Segmentation , 2019, MICCAI.

[11]  Junghoon Lee,et al.  A deformable multimodal image registration using PET/CT and TRUS for intraoperative focal prostate brachytherapy , 2019, Medical Imaging.

[12]  Max A. Viergever,et al.  End-to-End Unsupervised Deformable Image Registration with a Convolutional Neural Network , 2017, DLMIA/ML-CDS@MICCAI.

[13]  Viergever,et al.  An Overview of Medical Image Registration Methods , 1998 .

[14]  Sébastien Ourselin,et al.  Evaluation of Six Registration Methods for the Human Abdomen on Clinically Acquired CT , 2016, IEEE Transactions on Biomedical Engineering.

[15]  Marc Niethammer,et al.  DeepAtlas: Joint Semi-Supervised Learning of Image Registration and Segmentation , 2019, MICCAI.

[16]  Yong Fan,et al.  Non-rigid image registration using self-supervised fully convolutional networks without training data , 2018, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[17]  Jan Kautz,et al.  Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[18]  Boudewijn P. F. Lelieveldt,et al.  Nonrigid Image Registration Using Multi-scale 3D Convolutional Neural Networks , 2017, MICCAI.

[19]  Shunxing Bao,et al.  SynSeg-Net: Synthetic Segmentation Without Target Modality Ground Truth , 2018, IEEE Transactions on Medical Imaging.

[20]  Michael Brady,et al.  MRF-Based Deformable Registration and Ventilation Estimation of Lung CT , 2013, IEEE Transactions on Medical Imaging.

[21]  Jürgen Weese,et al.  Landmark-based elastic registration using approximating thin-plate splines , 2001, IEEE Transactions on Medical Imaging.

[22]  Daniel Rueckert,et al.  Joint Learning of Motion Estimation and Segmentation for Cardiac MR Image Sequences , 2018, MICCAI.

[23]  Brian B. Avants,et al.  Symmetric diffeomorphic image registration with cross-correlation: Evaluating automated labeling of elderly and neurodegenerative brain , 2008, Medical Image Anal..

[24]  Ronald M. Summers,et al.  A large annotated medical image dataset for the development and evaluation of segmentation algorithms , 2019, ArXiv.

[25]  Dinggang Shen,et al.  Synthesis and Inpainting-Based MR-CT Registration for Image-Guided Thermal Ablation of Liver Tumors , 2019, MICCAI.

[26]  Daniel Cohen-Or,et al.  Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Eli Shechtman,et al.  Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Michael Brady,et al.  Towards Realtime Multimodal Fusion for Image-Guided Interventions Using Self-similarities , 2013, MICCAI.

[29]  Heinz Handels,et al.  Multi-modal Multi-Atlas Segmentation using Discrete Optimisation and Self-Similarities , 2015, VISCERAL Challenge@ISBI.

[30]  Stefan Klein,et al.  SimpleElastix: A User-Friendly, Multi-lingual Library for Medical Image Registration , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[31]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Yaozong Gao,et al.  Dual‐core steered non‐rigid registration for multi‐modal images via bi‐directional image synthesis , 2017, Medical Image Anal..

[33]  Runze Han,et al.  Learning-based deformable image registration: effect of statistical mismatch between train and test images , 2019, Journal of medical imaging.

[34]  Colin Studholme,et al.  An overlap invariant entropy measure of 3D medical image alignment , 1999, Pattern Recognit..

[35]  Wiro Niessen,et al.  A hybrid deep learning framework for integrated segmentation and registration: evaluation on longitudinal white matter tract changes , 2019, MICCAI.

[36]  Guy Marchal,et al.  Multimodality image registration by maximization of mutual information , 1997, IEEE Transactions on Medical Imaging.

[37]  Maxime Sermesant,et al.  SVF-Net: Learning Deformable Image Registration Using Shape Matching , 2017, MICCAI.

[38]  Lin Yang,et al.  Translating and Segmenting Multimodal Medical Volumes with Cycle- and Shape-Consistency Generative Adversarial Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[39]  Andrew Zisserman,et al.  Spatial Transformer Networks , 2015, NIPS.

[40]  Michael Brady,et al.  MIND: Modality independent neighbourhood descriptor for multi-modal deformable registration , 2012, Medical Image Anal..

[41]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[42]  Daniel Rueckert,et al.  Unsupervised Deformable Registration for Multi-Modal Images via Disentangled Representations , 2019, IPMI.

[43]  Yong Fan,et al.  Non-rigid image registration using fully convolutional networks with deep self-supervision , 2017, ArXiv.

[44]  Ronald M. Summers,et al.  Progressive and Multi-path Holistically Nested Neural Networks for Pathological Lung Segmentation from CT Images , 2017, MICCAI.

[45]  Hervé Delingette,et al.  Learning a Probabilistic Model for Diffeomorphic Registration , 2018, IEEE Transactions on Medical Imaging.

[46]  Max A. Viergever,et al.  A deep learning framework for unsupervised affine and deformable image registration , 2018, Medical Image Anal..

[47]  Mattias P. Heinrich,et al.  Learning interpretable multi-modal features for alignment with supervised iterative descent , 2019, MIDL.

[48]  Mert R. Sabuncu,et al.  VoxelMorph: A Learning Framework for Deformable Medical Image Registration , 2018, IEEE Transactions on Medical Imaging.