Sensitivity and Specificity Evaluation of Deep Learning Models for Detection of Pneumoperitoneum on Chest Radiographs

Background: Deep learning has great potential to assist with detecting and triaging critical findings such as pneumoperitoneum on medical images. To be clinically useful, the performance of this technology still needs to be validated for generalizability across different types of imaging systems. Materials and Methods: This retrospective study included 1,287 chest X-ray images of patients who underwent initial chest radiography at 13 different hospitals between 2011 and 2019. The chest X-ray images were labelled independently by four radiologist experts as positive or negative for pneumoperitoneum. State-of-the-art deep learning models (ResNet101, InceptionV3, DenseNet161, and ResNeXt101) were trained on a subset of this dataset, and the automated classification performance was evaluated on the rest of the dataset by measuring the AUC, sensitivity, and specificity for each model. Furthermore, the generalizability of these deep learning models was assessed by stratifying the test dataset according to the type of the utilized imaging systems. Results: All deep learning models performed well for identifying radiographs with pneumoperitoneum, while DenseNet161 achieved the highest AUC of 95.7%, Specificity of 89.9%, and Sensitivity of 91.6%. DenseNet161 model was able to accurately classify radiographs from different imaging systems (Accuracy: 90.8%), while it was trained on images captured from a specific imaging system from a single institution. This result suggests the generalizability of our model for learning salient features in chest X-ray images to detect pneumoperitoneum, independent of the imaging system.

[1]  Ruoyu Li,et al.  Weakly Supervised Deep Learning for Thoracic Disease Classification and Localization on Chest X-rays , 2018, BCB.

[2]  G. Velmahos,et al.  Free air on plain film: Do we need a computed tomography too? , 2014, Journal of emergencies, trauma, and shock.

[3]  Z. Yen,et al.  Ultrasonography is superior to plain radiography in the diagnosis of pneumoperitoneum , 2002, The British journal of surgery.

[4]  A. McMillan,et al.  Deep learning Mr imaging–based attenuation correction for PeT/Mr imaging 1 , 2017 .

[5]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[6]  M. Rengo,et al.  Multidetector CT in emergency radiology: acute and generalized non-traumatic abdominal pain. , 2016, The British journal of radiology.

[7]  Shuicheng Yan,et al.  Dual Path Networks , 2017, NIPS.

[8]  Manu Goyal,et al.  Breast ultrasound region of interest detection and lesion localisation , 2020, Artif. Intell. Medicine.

[9]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[11]  Saeed Hassanpour,et al.  Deep neural networks for automatic detection of osteoporotic vertebral fractures on CT scans , 2018, Comput. Biol. Medicine.

[12]  Andrew Y. Ng,et al.  CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning , 2017, ArXiv.

[13]  Manu Goyal,et al.  Semantic Segmentation of Human Thigh Quadriceps Muscle in Magnetic Resonance Images , 2018, ArXiv.

[14]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  D. Thickman,et al.  Diagnosis of Pneumoperitoneum: Abdominal CT vs. Upright Chest Film , 1992, Journal of computer assisted tomography.

[17]  Ronald M. Summers,et al.  ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[18]  Forrest N. Iandola,et al.  DenseNet: Implementing Efficient ConvNet Descriptor Pyramids , 2014, ArXiv.

[19]  J. Woodring,et al.  Detection of pneumoperitoneum on chest radiographs: comparison of upright lateral and posteroanterior projections. , 1995, AJR. American journal of roentgenology.