Deep Learning Improves the Temporal Reproducibility of Aortic Measurement

Imaging-based measurements form the basis of surgical decision making in patients with aortic aneurysm. Unfortunately, manual measurement suffer from suboptimal temporal reproducibility, which can lead to delayed or unnecessary intervention. We tested the hypothesis that deep learning could improve upon the temporal reproducibility of CT angiography-derived thoracic aortic measurements in the setting of imperfect ground-truth training data. To this end, we trained a standard deep learning segmentation model from which measurements of aortic volume and diameter could be extracted. First, three blinded cardiothoracic radiologists visually confirmed non-inferiority of deep learning segmentation maps with respect to manual segmentation on a 50-patient hold-out test cohort, demonstrating a slight preference for the deep learning method (p < 1e-5). Next, reproducibility was assessed by evaluating measured change (coefficient of reproducibility and standard deviation) in volume and diameter values extracted from segmentation maps in patients for whom multiple scans were available and whose aortas had been deemed stable over time by visual assessment (n = 57 patients, 206 scans). Deep learning temporal reproducibility was superior for measures of both volume (p < 0.008) and diameter (p < 1e-5) and reproducibility metrics compared favorably with previously reported values of manual inter-rater variability. Our work motivates future efforts to apply deep learning to aortic evaluation.

[1]  C. Bruce,et al.  CT and MRI assessment of the aortic root and ascending aorta. , 2013, AJR. American journal of roentgenology.

[2]  R. Bonser,et al.  Reproducibility of thoracic aortic diameter measurement using computed tomographic scans. , 1999, European journal of cardio-thoracic surgery : official journal of the European Association for Cardio-thoracic Surgery.

[3]  D. Altman,et al.  STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT , 1986, The Lancet.

[4]  Joel Nothman,et al.  SciPy 1.0-Fundamental Algorithms for Scientific Computing in Python , 2019, ArXiv.

[5]  Yuanyuan Wang,et al.  MRI Manufacturer Shift and Adaptation: Increasing the Generalizability of Deep Learning Segmentation for MR Images Acquired with Different Scanners. , 2020, Radiology. Artificial intelligence.

[6]  Joel Nothman,et al.  Author Correction: SciPy 1.0: fundamental algorithms for scientific computing in Python , 2020, Nature Methods.

[7]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[8]  Adam Parr,et al.  Comparison of volume and diameter measurement in assessing small abdominal aortic aneurysm expansion examined using computed tomographic angiography. , 2011, European journal of radiology.

[9]  J. Elefteriades,et al.  Height alone, rather than body surface area, suffices for risk estimation in ascending aortic aneurysm , 2017, The Journal of thoracic and cardiovascular surgery.

[10]  K. Jarrod Millman,et al.  Array programming with NumPy , 2020, Nat..

[11]  M. Barreiro-Pérez,et al.  Measuring the aorta in the era of multimodality imaging: still to be agreed. , 2017, Journal of thoracic disease.

[12]  G. Corrado,et al.  End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography , 2019, Nature Medicine.

[13]  L. Quint,et al.  Proximal thoracic aortic diameter measurements at CT: repeatability and reproducibility according to measurement method , 2013, The International Journal of Cardiovascular Imaging.

[14]  Ella A. Kazerooni,et al.  Guidelines for the Diagnosis and Management of Patients With Thoracic Aortic Disease , 2010 .

[15]  D. Bluemke,et al.  How to Measure the Aorta Using MRI: A Practical Guide , 2020, Journal of magnetic resonance imaging : JMRI.

[16]  Marcus A. Badgeley,et al.  Confounding variables can degrade generalization performance of radiological deep learning models , 2018, ArXiv.

[17]  Weilong Yang,et al.  Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels , 2019, ICML.

[18]  Sadeer G. Al-Kindi,et al.  Epidemiology of fatal ruptured aortic aneurysms in the United States (1999‐2016) , 2019, Journal of vascular surgery.

[19]  Hung-mo Lin,et al.  A prospective study of growth and rupture risk of small-to-moderate size ascending aortic aneurysms. , 2014, The Journal of thoracic and cardiovascular surgery.

[20]  Jitendra Malik,et al.  Expert-level detection of acute intracranial hemorrhage on head computed tomography using deep learning , 2019, Proceedings of the National Academy of Sciences.

[21]  J. Powell,et al.  The role of diameter versus volume as the best prognostic measurement of abdominal aortic aneurysms. , 2013, Journal of vascular surgery.

[22]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[23]  Alka Sharma,et al.  ABDOMINAL AORTIC ANEURYSM: A COMPREHENSIVE REVIEW , 2011 .

[24]  S. Lemaire,et al.  Epidemiology of thoracic aortic dissection , 2011, Nature Reviews Cardiology.

[25]  F. Ecarnot,et al.  Measurement of maximum diameter of native abdominal aortic aneurysm by angio-CT: reproducibility is better with the semi-automated method. , 2014, European journal of vascular and endovascular surgery : the official journal of the European Society for Vascular Surgery.

[26]  Stephane Elkouri,et al.  Reproducibility of Abdominal Aortic Aneurysm Diameter Measurement and Growth Evaluation on Axial and Multiplanar Computed Tomography Reformations , 2012, CardioVascular and Interventional Radiology.