Automated quality control in image segmentation: application to the UK Biobank cardiovascular magnetic resonance imaging study

BackgroundThe trend towards large-scale studies including population imaging poses new challenges in terms of quality control (QC). This is a particular issue when automatic processing tools such as image segmentation methods are employed to derive quantitative measures or biomarkers for further analyses. Manual inspection and visual QC of each segmentation result is not feasible at large scale. However, it is important to be able to automatically detect when a segmentation method fails in order to avoid inclusion of wrong measurements into subsequent analyses which could otherwise lead to incorrect conclusions.MethodsTo overcome this challenge, we explore an approach for predicting segmentation quality based on Reverse Classification Accuracy, which enables us to discriminate between successful and failed segmentations on a per-cases basis. We validate this approach on a new, large-scale manually-annotated set of 4800 cardiovascular magnetic resonance (CMR) scans. We then apply our method to a large cohort of 7250 CMR on which we have performed manual QC.ResultsWe report results used for predicting segmentation quality metrics including Dice Similarity Coefficient (DSC) and surface-distance measures. As initial validation, we present data for 400 scans demonstrating 99% accuracy for classifying low and high quality segmentations using the predicted DSC scores. As further validation we show high correlation between real and predicted scores and 95% classification accuracy on 4800 scans for which manual segmentations were available. We mimic real-world application of the method on 7250 CMR where we show good agreement between predicted quality metrics and manual visual QC scores.ConclusionsWe show that Reverse classification accuracy has the potential for accurate and fully automatic segmentation QC on a per-case basis in the context of large-scale population imaging as in the UK Biobank Imaging Study.

[1]  Ben Glocker,et al.  Stratified Decision Forests for Accurate Anatomical Landmark Localization in Cardiac Images , 2017, IEEE Transactions on Medical Imaging.

[2]  Alejandro F. Frangi,et al.  Automated Quality Assessment of Cardiac MR Images Using Convolutional Neural Networks , 2016, SASHIMI@MICCAI.

[3]  P. Matthews,et al.  UK Biobank’s cardiovascular magnetic resonance protocol , 2015, Journal of Cardiovascular Magnetic Resonance.

[4]  Ian Horrocks,et al.  Towards the Semantic Enrichment of Free-Text Annotation of Image Quality Assessment for UK Biobank Cardiac Cine MRI Scans , 2016, LABELS/DLMIA@MICCAI.

[5]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[6]  Konstantinos Kamnitsas,et al.  Reverse Classification Accuracy: Predicting Segmentation Performance in the Absence of Ground Truth , 2017, IEEE Transactions on Medical Imaging.

[7]  Allan Hanbury,et al.  Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool , 2015, BMC Medical Imaging.

[8]  Marleen de Bruijne,et al.  Machine learning approaches in medical image analysis: From detection to diagnosis , 2016, Medical Image Anal..

[9]  Daniel Rueckert,et al.  A Probabilistic Patch-Based Label Fusion Model for Multi-Atlas Segmentation With Registration Refinement: Application to Cardiac MR Images , 2013, IEEE Transactions on Medical Imaging.

[10]  Aabid Shariff,et al.  Automated Image Analysis for High-Content Screening and Analysis , 2010, Journal of biomolecular screening.

[11]  Ben Glocker,et al.  Encoding atlases by randomized classification forests for efficient multi-atlas label propagation , 2014, Medical Image Anal..

[12]  Qiang Yang,et al.  Cross Validation Framework to Choose amongst Models and Datasets for Transfer Learning , 2010, ECML/PKDD.

[13]  Ben Glocker,et al.  Automatic Quality Control of Cardiac MRI Segmentation in Large-Scale Population Imaging , 2017, MICCAI.

[14]  Jackie A Cooper,et al.  The impact of cardiovascular risk factors on cardiac structure and function: Insights from the UK Biobank imaging enhancement study , 2017, PloS one.

[15]  Ben Glocker,et al.  Real-time Prediction of Segmentation Quality , 2018, MICCAI.

[16]  Stefan K. Piechnik,et al.  Reference ranges for cardiac structure and function using cardiovascular magnetic resonance (CMR) in Caucasians from the UK Biobank population cohort , 2017, Journal of Cardiovascular Magnetic Resonance.

[17]  Oscar Camara,et al.  Generalized Overlap Measures for Evaluation and Validation in Medical Image Analysis , 2006, IEEE Transactions on Medical Imaging.

[18]  Ben Glocker,et al.  Human-level CMR image analysis with deep fully convolutional networks , 2017, ArXiv.

[19]  Ian Davidson,et al.  Reverse testing: an efficient framework to select amongst classifiers under sample selection bias , 2006, KDD '06.