XCAT-GAN for Synthesizing 3D Consistent Labeled Cardiac MR Images on Anatomically Variable XCAT Phantoms

Generative adversarial networks (GANs) have provided promising data enrichment solutions by synthesizing high-fidelity images. However, generating large sets of labeled images with new anatomical variations remains unexplored. We propose a novel method for synthesizing cardiac magnetic resonance (CMR) images on a population of virtual subjects with a large anatomical variation, introduced using the 4D eXtended Cardiac and Torso (XCAT) computerized human phantom. We investigate two conditional image synthesis approaches grounded on a semantically-consistent mask-guided image generation technique: 4-class and 8-class XCAT-GANs. The 4-class technique relies on only the annotations of the heart; while the 8-class technique employs a predicted multi-tissue label map of the heart-surrounding organs and provides better guidance for our conditional image synthesis. For both techniques, we train our conditional XCAT-GAN with real images paired with corresponding labels and subsequently at the inference time, we substitute the labels with the XCAT derived ones. Therefore, the trained network accurately transfers the tissue-specific textures to the new label maps. By creating 33 virtual subjects of synthetic CMR images at the end-diastolic and end-systolic phases, we evaluate the usefulness of such data in the downstream cardiac cavity segmentation task under different augmentation strategies. Results demonstrate that even with only 20% of real images (40 volumes) seen during training, segmentation performance is retained with the addition of synthetic CMR images. Moreover, the improvement in utilizing synthetic images for augmenting the real data is evident through the reduction of Hausdorff distance up to 28% and an increase in the Dice score up to 5%, indicating a higher similarity to the ground truth in all dimensions.

[1]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[2]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[3]  Larry S. Davis,et al.  ACE: Adapting to Changing Environments for Semantic Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[4]  Yuxing Tang,et al.  CT-realistic data augmentation using generative adversarial network for robust lymph node segmentation , 2019, Medical Imaging.

[5]  Thomas Joyce,et al.  3D Medical Image Synthesis by Factorised Representation and Deformable Model Learning , 2019, SASHIMI@MICCAI.

[6]  Osamu Abe,et al.  Deep learning and artificial intelligence in radiology: Current applications and future directions , 2018, PLoS medicine.

[7]  W. Segars,et al.  MRXCAT: Realistic numerical phantoms for cardiovascular magnetic resonance , 2014, Journal of Cardiovascular Magnetic Resonance.

[8]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[9]  Jan Kautz,et al.  Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[10]  Hao Xu,et al.  SMOD - Data Augmentation Based on Statistical Models of Deformation to Enhance Segmentation in 2D Cine Cardiac MRI , 2019, FIMH.

[11]  Salvatore Panico,et al.  Nutritional quality of food as represented by the FSAm-NPS nutrient profiling system underlying the Nutri-Score label and cancer risk in Europe: Results from the EPIC prospective cohort study , 2018, PLoS medicine.

[12]  Nassir Navab,et al.  GANs for Medical Image Analysis , 2018, Artif. Intell. Medicine.

[13]  Ender Konukoglu,et al.  Semi-Supervised and Task-Driven Data Augmentation , 2019, IPMI.

[14]  Daniel Rueckert,et al.  Unsupervised Multi-modal Style Transfer for Cardiac MR Segmentation , 2019, STACOM@MICCAI.

[15]  Jan Kautz,et al.  High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[16]  Marcel Breeuwer,et al.  4D Semantic Cardiac Magnetic Resonance Image Synthesis on XCAT Anatomical Model , 2020, MIDL.

[17]  Mingchen Gao,et al.  Neural Style Transfer Improves 3D Cardiovascular MR Image Segmentation on Inconsistent Data , 2019, MICCAI.

[18]  Klaus H. Maier-Hein,et al.  nnU-Net: Breaking the Spell on Successful Medical Image Segmentation , 2019, ArXiv.

[19]  John K. Tsotsos,et al.  Efficient and generalizable statistical models of shape and appearance for analysis of cardiac MRI , 2008, Medical Image Anal..

[20]  Xin Yang,et al.  Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved? , 2018, IEEE Transactions on Medical Imaging.

[21]  Taesung Park,et al.  Semantic Image Synthesis With Spatially-Adaptive Normalization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Marcel Breeuwer,et al.  Towards generating realistic and heterogeneous cardiac magnetic resonance simulated image database for deep learning based image segmentation algorithms , 2020 .

[23]  Lena Maier-Hein,et al.  Generating large labeled data sets for laparoscopic image processing tasks using unpaired image-to-image translation , 2019, MICCAI.

[24]  Sotirios A. Tsaftaris,et al.  Adversarial Image Synthesis for Unpaired Multi-modal Cardiac Data , 2017, SASHIMI@MICCAI.

[25]  Paul Babyn,et al.  Generative Adversarial Network in Medical Imaging: A Review , 2018, Medical Image Anal..

[26]  W. Segars,et al.  4D XCAT phantom for multimodality imaging research. , 2010, Medical physics.