Hi-Net: Hybrid-Fusion Network for Multi-Modal MR Image Synthesis

Magnetic resonance imaging (MRI) is a widely used neuroimaging technique that can provide images of different contrasts (i.e., modalities). Fusing this multi-modal data has proven particularly effective for boosting model performance in many tasks. However, due to poor data quality and frequent patient dropout, collecting all modalities for every patient remains a challenge. Medical image synthesis has been proposed as an effective solution, where any missing modalities are synthesized from the existing ones. In this paper, we propose a novel Hybrid-fusion Network (Hi-Net) for multi-modal MR image synthesis, which learns a mapping from multi-modal source images (i.e., existing modalities) to target images (i.e., missing modalities). In our Hi-Net, a modality-specific network is utilized to learn representations for each individual modality, and a fusion network is employed to learn the common latent representation of multi-modal data. Then, a multi-modal synthesis network is designed to densely combine the latent representation with hierarchical features from each modality, acting as a generator to synthesize the target images. Moreover, a layer-wise multi-modal fusion strategy effectively exploits the correlations among multiple modalities, where a Mixed Fusion Block (MFB) is proposed to adaptively weight different fusion strategies. Extensive experiments demonstrate the proposed model outperforms other state-of-the-art medical image synthesis methods.

[1]  Andrzej Cichocki,et al.  Group Component Analysis for Multiblock Data: Common and Individual Feature Extraction , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[3]  J. L. Herraiz,et al.  Fast Patch-Based Pseudo-CT Synthesis from T1-Weighted MR Images for PET/MR Attenuation Correction in Brain Studies , 2016, The Journal of Nuclear Medicine.

[4]  M I Miller,et al.  Mathematical textbook of deformable neuroanatomies. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Yusuf Huseyin Sahin,et al.  Generative Adversarial Training for MRA Image Synthesis Using Multi-Contrast MRI , 2018, PRIME@MICCAI.

[6]  Pedro Costa,et al.  Towards Adversarial Retinal Image Synthesis , 2017, ArXiv.

[7]  Yinghuan Shi,et al.  Ea-GANs: Edge-Aware Generative Adversarial Networks for Cross-Modality MR Image Synthesis , 2019, IEEE Transactions on Medical Imaging.

[8]  Sotirios A. Tsaftaris,et al.  Multimodal MR Synthesis via Modality-Invariant Latent Representation , 2018, IEEE Transactions on Medical Imaging.

[9]  Jun Wang,et al.  Deep Multi-modal Latent Representation Learning for Automated Dementia Diagnosis , 2019, MICCAI.

[10]  Paul Babyn,et al.  Generative Adversarial Network in Medical Imaging: A Review , 2018, Medical Image Anal..

[11]  Gang Chen,et al.  Generalized K-fan Multimodal Deep Model with Shared Representations , 2015, ArXiv.

[12]  Gang Wang,et al.  Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition , 2015, IEEE Transactions on Multimedia.

[13]  Jeffrey L. Gunter,et al.  Medical Image Synthesis for Data Augmentation and Anonymization using Generative Adversarial Networks , 2018, SASHIMI@MICCAI.

[14]  Yuxin Peng,et al.  Cross-Media Shared Representation by Hierarchical Learning with Multiple Deep Networks , 2016, IJCAI.

[15]  Qinghua Hu,et al.  Generalized Latent Multi-View Subspace Clustering , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[17]  Gongning Luo,et al.  Concatenated and Connected Random Forests With Multiscale Patch Driven Active Contour Model for Automated Brain Tumor Segmentation of MR Images , 2018, IEEE Transactions on Medical Imaging.

[18]  Shenghua Gao,et al.  SkrGAN: Sketching-rendering Unconditional Generative Adversarial Networks for Medical Image Synthesis , 2019, MICCAI.

[19]  Xiaohuan Cao,et al.  Adversarial learning for mono- or multi-modal registration , 2019, Medical Image Anal..

[20]  Chiou-Shann Fuh,et al.  Multiple Kernel Learning for Dimensionality Reduction , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Ling Shao,et al.  Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images Using Weakly-Supervised Joint Convolutional Sparse Coding , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Tao Zhou,et al.  Dual Shared-Specific Multiview Subspace Clustering , 2020, IEEE Transactions on Cybernetics.

[23]  Feng Shi,et al.  Multi-modal latent space inducing ensemble SVM classifier for early dementia diagnosis with neuroimaging data , 2019, Medical Image Anal..

[24]  Zhiwei Wang,et al.  Bi-Modality Medical Image Synthesis Using Semi-Supervised Sequential Generative Adversarial Networks , 2020, IEEE Journal of Biomedical and Health Informatics.

[25]  et al.,et al.  ISLES 2015 ‐ A public evaluation benchmark for ischemic stroke lesion segmentation from multispectral MRI , 2017, Medical Image Anal..

[26]  Brian B. Avants,et al.  The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) , 2015, IEEE Transactions on Medical Imaging.

[27]  Ana Maria Mendonça,et al.  End-to-End Adversarial Retinal Image Synthesis , 2018, IEEE Transactions on Medical Imaging.

[28]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[29]  Debdoot Sheet,et al.  Simulating patho-realistic ultrasound images using deep generative networks with adversarial learning , 2017, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[30]  Jianhai Zhang,et al.  Deep Multimodal Multilinear Fusion with High-order Polynomial Pooling , 2019, NeurIPS.

[31]  Su Ruan,et al.  Medical Image Synthesis with Context-Aware Generative Adversarial Networks , 2016, MICCAI.

[32]  Yan Shen,et al.  Brain Tumor Segmentation on MRI with Missing Modalities , 2019, IPMI.

[33]  Ninon Burgos,et al.  Attenuation Correction Synthesis for Hybrid PET-MR Scanners: Application to Brain Studies , 2014, IEEE Transactions on Medical Imaging.

[34]  Ling Shao,et al.  Cross-Modality Image Synthesis via Weakly Coupled and Geometry Co-Regularized Joint Dictionary Learning , 2018, IEEE Transactions on Medical Imaging.

[35]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[36]  John T. Guibas,et al.  Synthetic Medical Images from Dual Generative Adversarial Networks , 2017, ArXiv.

[37]  Stan Z. Li,et al.  Shared representation learning for heterogenous face recognition , 2014, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[38]  Liang Chen,et al.  Multi-modal classification of Alzheimer's disease using nonlinear graph fusion , 2017, Pattern Recognit..

[39]  Ling Shao,et al.  Inter-modality Dependence Induced Data Recovery for MCI Conversion Prediction , 2019, MICCAI.

[40]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Dinggang Shen,et al.  Deep Learning Based Imaging Data Completion for Improved Brain Disease Diagnosis , 2014, MICCAI.

[42]  Max A. Viergever,et al.  Generative Adversarial Networks for Noise Reduction in Low-Dose CT , 2017, IEEE Transactions on Medical Imaging.

[43]  Snehashis Roy,et al.  Magnetic Resonance Image Example-Based Contrast Synthesis , 2013, IEEE Transactions on Medical Imaging.

[44]  Lin Yang,et al.  Translating and Segmenting Multimodal Medical Volumes with Cycle- and Shape-Consistency Generative Adversarial Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[45]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[46]  Dinggang Shen,et al.  3D Deep Learning for Multi-modal Imaging-Guided Survival Time Prediction of Brain Tumor Patients , 2016, MICCAI.

[47]  David A. Leopold,et al.  A digital 3D atlas of the marmoset brain based on multi-modal MRI , 2018, NeuroImage.

[48]  Snehashis Roy,et al.  Magnetic resonance image synthesis through patch regression , 2013, 2013 IEEE 10th International Symposium on Biomedical Imaging.

[49]  Dinggang Shen,et al.  A novel relational regularization feature selection method for joint regression and classification in AD diagnosis , 2017, Medical Image Anal..

[50]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[51]  Chen Gong,et al.  Exploring Commonality and Individuality for Multi-Modal Curriculum Learning , 2017, AAAI.

[52]  Dinggang Shen,et al.  Subspace Regularized Sparse Multitask Learning for Multiclass Neurodegenerative Disease Identification , 2016, IEEE Transactions on Biomedical Engineering.

[53]  Dinggang Shen,et al.  Effective feature learning and fusion of multimodality data using stage‐wise deep neural network for dementia diagnosis , 2018, Human brain mapping.

[54]  Dinggang Shen,et al.  3D Auto-Context-Based Locality Adaptive Multi-Modality GANs for PET Synthesis , 2019, IEEE Transactions on Medical Imaging.

[55]  Aykut Erdem,et al.  Image Synthesis in Multi-Contrast MRI With Conditional Generative Adversarial Networks , 2018, IEEE Transactions on Medical Imaging.