UMS-Rep: Unified modality-specific representation for efficient medical image analysis

Abstract Medical image analysis typically includes several tasks such as enhancement, segmentation, and classification. Traditionally, these tasks are implemented using separate deep learning models for separate tasks, which is not efficient because it involves unnecessary training repetitions, demands greater computational resources, and requires a relatively large amount of labeled data. In this paper, we propose a multi-task training approach for medical image analysis, where individual tasks are fine-tuned simultaneously through relevant knowledge transfer using a unified modality-specific feature representation (UMS-Rep). We explore different fine-tuning strategies to demonstrate the impact of the strategy on the performance of target medical image tasks. We experiment with different visual tasks (e.g., image denoising, segmentation, and classification) to highlight the advantages offered with our approach for two imaging modalities, chest X-ray and Doppler echocardiography. Our results demonstrate that the proposed approach reduces the overall demand for computational resources and improves target task generalization and performance. Specifically, the proposed approach improves accuracy (up to ∼ 9% ↑ ) and decreases computational time (up to ∼ 86% ↓ ) as compared to the baseline approach. Further, our results prove that the performance of target tasks in medical images is highly influenced by the utilized fine-tuning strategy.

[1]  Ahmed M. Elgammal,et al.  Convolutional Models for Joint Object Categorization and Pose Estimation , 2015, ArXiv.

[2]  Dwarikanath Mahapatra,et al.  Progressive Generative Adversarial Networks for Medical Image Super resolution , 2019, ArXiv.

[3]  Nima Tajbakhsh,et al.  Models Genesis: Generic Autodidactic Models for 3D Medical Image Analysis , 2019, MICCAI.

[4]  Quoc V. Le,et al.  Multi-task Sequence to Sequence Learning , 2015, ICLR.

[5]  Sameer Antani,et al.  Visualization and Interpretation of Convolutional Neural Network Predictions in Detecting Pneumonia in Pediatric Chest Radiographs , 2018, Applied sciences.

[6]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[7]  Roberto Cipolla,et al.  Multi-task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8]  Sotirios A. Tsaftaris,et al.  Medical Image Computing and Computer Assisted Intervention , 2017 .

[9]  Zhao Chen,et al.  GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks , 2017, ICML.

[10]  Ling Zhang,et al.  Automated breast cancer detection and classification using ultrasound images: A survey , 2015, Pattern Recognit..

[11]  Vasudevan Lakshminarayanan,et al.  Application of an enhanced deep super-resolution network in retinal image analysis , 2020, BiOS.

[12]  Myeongsu Kang,et al.  A Hybrid Technique for Medical Image Segmentation , 2012, Journal of biomedicine & biotechnology.

[13]  Dianhai Yu,et al.  Multi-Task Learning for Multiple Language Translation , 2015, ACL.

[14]  Jonathan Baxter,et al.  A Bayesian/Information Theoretic Model of Learning to Learn via Multiple Task Sampling , 1997, Machine Learning.

[15]  Daniel Rueckert,et al.  Multi-Task Learning for Left Atrial Segmentation on GE-MRI , 2018, STACOM@MICCAI.

[16]  Sameer Antani,et al.  Accelerating Super-Resolution and Visual Task Analysis in Medical Images , 2020 .

[17]  Farida Cheriet,et al.  A multi-scale tensor voting approach for small retinal vessel segmentation in high resolution fundus images , 2016, Comput. Medical Imaging Graph..

[18]  S. Welstead Fractal and Wavelet Image Compression Techniques , 1999 .

[19]  Kim-Han Thung,et al.  A brief review on multi-task learning , 2018, Multimedia Tools and Applications.

[20]  Graham W. Taylor,et al.  Deconvolutional networks , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Max A. Viergever,et al.  Deep Learning for Multi-Task Medical Image Segmentation in Multiple Modalities , 2016, MICCAI.

[22]  Xiaoou Tang,et al.  Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[23]  Mert R. Sabuncu,et al.  An Unsupervised Learning Model for Deformable Medical Image Registration , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Stefan Jaeger,et al.  Two public chest X-ray datasets for computer-aided screening of pulmonary diseases. , 2014, Quantitative imaging in medicine and surgery.

[25]  Jaime S. Cardoso,et al.  Deep Learning and Data Labeling for Medical Applications , 2016, Lecture Notes in Computer Science.

[26]  Jia-Bin Huang,et al.  DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency , 2018, ECCV.

[27]  Carol C Wu,et al.  Augmenting the National Institutes of Health Chest Radiograph Dataset with Expert Annotations of Possible Pneumonia. , 2019, Radiology. Artificial intelligence.

[28]  Roberto Cipolla,et al.  MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving , 2016, 2018 IEEE Intelligent Vehicles Symposium (IV).

[29]  N. Mahmood,et al.  Comparison between Median, Unsharp and Wiener filter and its effect on ultrasound stomach tissue image segmentation for Pyloric Stenosis , 2011 .

[30]  Daniel Rueckert,et al.  Self-Supervised Learning for Cardiac MR Image Segmentation by Anatomical Position Prediction , 2019, MICCAI.

[31]  Pavel Kisilev,et al.  Medical Image Description Using Multi-task-loss CNN , 2016, LABELS/DLMIA@MICCAI.

[32]  Chuan Zhou,et al.  Deep Learning in Medical Image Analysis. , 2017, Advances in experimental medicine and biology.

[33]  Daniel S. Kermany,et al.  Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning , 2018, Cell.

[34]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Jitendra Malik,et al.  Which Tasks Should Be Learned Together in Multi-task Learning? , 2019, ICML.