Deep Learning-Based Computed Tomography Image Standardization to Improve Generalizability of Deep Learning-Based Hepatic Segmentation

Objective We aimed to investigate whether image standardization using deep learning-based computed tomography (CT) image conversion would improve the performance of deep learning-based automated hepatic segmentation across various reconstruction methods. Materials and Methods We collected contrast-enhanced dual-energy CT of the abdomen that was obtained using various reconstruction methods, including filtered back projection, iterative reconstruction, optimum contrast, and monoenergetic images with 40, 60, and 80 keV. A deep learning based image conversion algorithm was developed to standardize the CT images using 142 CT examinations (128 for training and 14 for tuning). A separate set of 43 CT examinations from 42 patients (mean age, 10.1 years) was used as the test data. A commercial software program (MEDIP PRO v2.0.0.0, MEDICALIP Co. Ltd.) based on 2D U-NET was used to create liver segmentation masks with liver volume. The original 80 keV images were used as the ground truth. We used the paired t-test to compare the segmentation performance in the Dice similarity coefficient (DSC) and difference ratio of the liver volume relative to the ground truth volume before and after image standardization. The concordance correlation coefficient (CCC) was used to assess the agreement between the segmented liver volume and ground-truth volume. Results The original CT images showed variable and poor segmentation performances. The standardized images achieved significantly higher DSCs for liver segmentation than the original images (DSC [original, 5.40%–91.27%] vs. [standardized, 93.16%–96.74%], all P < 0.001). The difference ratio of liver volume also decreased significantly after image conversion (original, 9.84%–91.37% vs. standardized, 1.99%–4.41%). In all protocols, CCCs improved after image conversion (original, -0.006–0.964 vs. standardized, 0.990–0.998). Conclusion Deep learning-based CT image standardization can improve the performance of automated hepatic segmentation using CT images reconstructed using various methods. Deep learning-based CT image conversion may have the potential to improve the generalizability of the segmentation network.

[1]  Alice C. Yu,et al.  External Validation of Deep Learning Algorithms for Radiologic Diagnosis: A Systematic Review. , 2022, Radiology. Artificial intelligence.

[2]  David A Weiss,et al.  Interinstitutional Portability of a Deep Learning Brain MRI Lesion Segmentation Algorithm. , 2021, Radiology. Artificial intelligence.

[3]  L. Schwartz,et al.  Toward Generalizability in the Deployment of Artificial Intelligence in Radiology: Role of Computation Stress Testing to Overcome Underspecification. , 2021, Radiology. Artificial intelligence.

[4]  L. Folio,et al.  Training Strategies for Radiology Deep Learning Models in Data-limited Scenarios. , 2021, Radiology. Artificial intelligence.

[5]  P. Cheng,et al.  Deep Learning: An Update for Radiologists. , 2021, Radiographics : a review publication of the Radiological Society of North America, Inc.

[6]  Ying Shan,et al.  Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[7]  N. Karssemeijer,et al.  Impact of artificial intelligence support on accuracy and reading time in breast tomosynthesis image interpretation: a multi-reader multi-case study , 2021, European Radiology.

[8]  Edward H. Lee,et al.  Artificial intelligence for automatic cerebral ventricle segmentation and volume calculation: a clinical tool for the evaluation of pediatric hydrocephalus. , 2020, Journal of neurosurgery. Pediatrics.

[9]  D. Comaniciu,et al.  Autonomous Detection and Classification of PI-RADS Lesions in an MRI Screening Population Incorporating Multicenter-Labeled Deep Learning and Biparametric Imaging: Proof of Concept , 2020, Diagnostics.

[10]  P. Choyke,et al.  Data Augmentation and Transfer Learning to Improve Generalizability of an Automated Prostate Segmentation Model. , 2020, AJR. American journal of roentgenology.

[11]  Finale Doshi-Velez,et al.  The myth of generalisability in clinical research and machine learning in health care , 2020, The Lancet Digital Health.

[12]  Hyun Seok Choi,et al.  A deep learning algorithm may automate intracranial aneurysm detection on MR angiography with high diagnostic performance , 2020, European Radiology.

[13]  M. Lungren,et al.  Preparing Medical Imaging Data for Machine Learning. , 2020, Radiology.

[14]  Daniel Rueckert,et al.  Improving the Generalizability of Convolutional Neural Network-Based Segmentation on CMR Images , 2019, Frontiers in Cardiovascular Medicine.

[15]  Kathryn J Fowler,et al.  Assessing Radiology Research on Artificial Intelligence: A Brief Guide for Authors, Reviewers, and Readers-From the Radiology Editorial Board. , 2019, Radiology.

[16]  Jitendra Malik,et al.  Expert-level detection of acute intracranial hemorrhage on head computed tomography using deep learning , 2019, Proceedings of the National Academy of Sciences.

[17]  D. Rueckert,et al.  A Multicenter, Scan-Rescan, Human and Machine Learning CMR Study to Test Generalizability and Precision in Imaging Biomarker Analysis. , 2019, Circulation. Cardiovascular imaging.

[18]  Taghi M. Khoshgoftaar,et al.  A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[19]  Dana I. Casetti-Dinescu,et al.  Generalizable Multi-Site Training and Testing Of Deep Neural Networks Using Image Normalization , 2019, 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019).

[20]  J. Dean,et al.  A guide to deep learning in healthcare , 2019, Nature Medicine.

[21]  A. Ng,et al.  Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists , 2018, PLoS medicine.

[22]  Srini Tridandapani,et al.  Deep Learning in Radiology. , 2018, Academic radiology.

[23]  S. Park,et al.  Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction. , 2018, Radiology.

[24]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[26]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[27]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[28]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[29]  L. Lin,et al.  A concordance correlation coefficient to evaluate reproducibility. , 1989, Biometrics.