Quantification of pulmonary involvement in COVID-19 pneumonia by means of a cascade of two U-nets: training and assessment on multiple datasets using different annotation criteria

This study aims at exploiting artificial intelligence (AI) for the identification, segmentation and quantification of COVID-19 pulmonary lesions. The limited data availability and the annotation quality are relevant factors in training AI-methods. We investigated the effects of using multiple datasets, heterogeneously populated and annotated according to different criteria. We developed an automated analysis pipeline, the LungQuant system, based on a cascade of two U-nets. The first one (U-net1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$_1$$\end{document}) is devoted to the identification of the lung parenchyma; the second one (U-net2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$_2$$\end{document}) acts on a bounding box enclosing the segmented lungs to identify the areas affected by COVID-19 lesions. Different public datasets were used to train the U-nets and to evaluate their segmentation performances, which have been quantified in terms of the Dice Similarity Coefficients. The accuracy in predicting the CT-Severity Score (CT-SS) of the LungQuant system has been also evaluated. Both the volumetric DSC (vDSC) and the accuracy showed a dependency on the annotation quality of the released data samples. On an independent dataset (COVID-19-CT-Seg), both the vDSC and the surface DSC (sDSC) were measured between the masks predicted by LungQuant system and the reference ones. The vDSC (sDSC) values of 0.95±0.01 and 0.66±0.13 (0.95±0.02 and 0.76±0.18, with 5 mm tolerance) were obtained for the segmentation of lungs and COVID-19 lesions, respectively. The system achieved an accuracy of 90% in CT-SS identification on this benchmark dataset. We analysed the impact of using data samples with different annotation criteria in training an AI-based quantification system for pulmonary involvement in COVID-19 pneumonia. In terms of vDSC measures, the U-net segmentation strongly depends on the quality of the lesion annotations. Nevertheless, the CT-SS can be accurately predicted on independent test sets, demonstrating the satisfactory generalization ability of the LungQuant.

[1]  Arko Barman,et al.  Novel Autosegmentation Spatial Similarity Metrics Capture the Time Required to Correct Segmentations Better Than Traditional Metrics in a Thoracic Cavity Segmentation Workflow , 2021, Journal of Digital Imaging.

[2]  M. Kalra,et al.  Association of AI quantified COVID-19 chest CT and patient outcome , 2021, International Journal of Computer Assisted Radiology and Surgery.

[3]  Kendall J. Kiser,et al.  PleThora: Pleural effusion and thoracic cavity segmentations in diseased lungs for benchmarking chest CT processing pipelines , 2020, Medical physics.

[4]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[5]  Russell T. Shinohara,et al.  Harmonization of cortical thickness measurements across scanners and sites , 2017, NeuroImage.

[6]  Georg Langs,et al.  Automatic lung segmentation in routine imaging is a data diversity problem, not a methodology problem , 2020, ArXiv.

[7]  Bram van Ginneken,et al.  Relational Modeling for Robust and Efficient Pulmonary Lobe Segmentation in CT Scans , 2020, IEEE Transactions on Medical Imaging.

[8]  L. Giancardo,et al.  Novel Autosegmentation Spatial Similarity Metrics Capture the Time Required to Correct Segmentations Better Than Traditional Metrics in a Thoracic Cavity Segmentation Workflow , 2020, Journal of Digital Imaging.

[9]  Georg Langs,et al.  Automatic lung segmentation in routine imaging is primarily a data diversity problem, not a methodology problem , 2020, European Radiology Experimental.

[10]  B. van Ginneken,et al.  Relational Modeling for Robust and Efficient Pulmonary Lobe Segmentation in CT Scans. , 2020, IEEE transactions on medical imaging.

[11]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[12]  S. P. Morozov,et al.  MosMedData: Chest CT Scans with COVID-19 Related Findings , 2020, medRxiv.

[13]  Kazuma Yamamoto,et al.  Resolving Class Imbalance in Object Detection with Weighted Cross Entropy Losses , 2020, ArXiv.

[14]  Xi Fang,et al.  Multi-Organ Segmentation Over Partially Labeled Datasets With Multi-Scale Feature Abstraction , 2020, IEEE Transactions on Medical Imaging.

[15]  Nikolas Lessmann,et al.  Automated Assessment of CO-RADS and Chest CT Severity Scores in Patients with Suspected COVID-19 Using Artificial Intelligence , 2020, Radiology.

[16]  A. Giovagnoni,et al.  Chest CT features of coronavirus disease 2019 (COVID-19) pneumonia: key points for radiologists , 2020, La radiologia medica.

[17]  Patrice Y. Simard,et al.  Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18]  Zhiqiang He,et al.  Towards Data-Efficient Learning: A Benchmark for COVID-19 CT Lung and Infection Segmentation. , 2020, Medical physics.