An Interpretable Approach to Automated Severity Scoring in Pelvic Trauma

Pelvic ring disruptions result from blunt injury mechanisms and are often found in patients with multi-system trauma. To grade pelvic fracture severity in trauma victims based on whole-body CT, the Tile AO/OTA classification is frequently used. Due to the high volume of whole-body trauma CTs generated in busy trauma centers, an automated approach to Tile classification would provide substantial value, e. g., to prioritize the reading queue of the attending trauma radiologist. In such scenario, an automated method should perform grading based on a transparent process and based on interpretable features to enable interaction with human readers and lower their workload by offering insights from a first automated read of the scan. This paper introduces an automated yet interpretable pelvic trauma decision support system to assist radiologists in fracture detection and Tile grade classification. The method operates similarly to human interpretation of CT scans and first detects distinct pelvic fractures on CT with high specificity using a Faster-RCNN model that are then interpreted using a structural causal model based on clinical best practices to infer an initial Tile grade. The Bayesian causal model and finally, the object detector are then queried for likely co-occurring fractures that may have been rejected initially due to the highly specific operating point of the detector, resulting in an updated list of detected fractures and corresponding final Tile grade. Our method is transparent in that it provides finding location and type using the object detector, as well as information on important counterfactuals that would invalidate the system’s recommendation and achieves an AUC of 83.3%/85.1% for translational/rotational instability. Despite being designed for human-machine teaming, our approach does not compromise on performance compared to previous black-box approaches.

[1]  M Tile,et al.  Pelvic ring fractures: should they be fixed? , 1988, The Journal of bone and joint surgery. British volume.

[2]  F. Becce,et al.  Interobserver reliability of the Tile classification system for pelvic fractures among radiologists and surgeons , 2020, European Radiology.

[3]  Yoshua Bengio,et al.  Towards Causal Representation Learning , 2021, ArXiv.

[4]  Le Lu,et al.  A scalable physician-level deep learning algorithm detects universal trauma on pelvic radiographs , 2021, Nature Communications.

[5]  David Duvenaud,et al.  Explaining Image Classifiers by Counterfactual Generation , 2018, ICLR.

[6]  M. McHugh Interrater reliability: the kappa statistic , 2012, Biochemia medica.

[7]  Michael Werman,et al.  Detection of distal radius fractures trained by a small set of X-ray images and Faster R-CNN , 2018, Advances in Intelligent Systems and Computing.

[9]  P. Lambin,et al.  Deep learning in fracture detection: a narrative review , 2020, Acta orthopaedica.

[10]  Daniel C. Castro,et al.  Causality matters in medical imaging , 2019, Nature Communications.

[11]  Mathias Unberath,et al.  An Automated Deep Learning Method for Tile AO/OTA Pelvic Fracture Severity Grading from Trauma whole-Body CT , 2021, Journal of Digital Imaging.

[12]  D. Dreizin,et al.  Can MDCT Unmask Instability in Binder-Stabilized Pelvic Ring Disruptions? , 2016, AJR. American journal of roentgenology.

[13]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Joseph E. Burns,et al.  Artificial Intelligence in Musculoskeletal Imaging: A Paradigm Shift , 2019, Journal of bone and mineral research : the official journal of the American Society for Bone and Mineral Research.

[15]  D. Dreizin Commentary on "Multidetector CT in Vascular Injuries Resulting from Pelvic Fractures". , 2019, Radiographics : a review publication of the Radiological Society of North America, Inc.

[16]  A. V. van Vugt,et al.  [Acute pelvic fractures]. , 2009, Nederlands tijdschrift voor geneeskunde.

[17]  Yoichi Sato,et al.  A Computer-Aided Diagnosis System Using Artificial Intelligence for Hip Fractures -Multi-Institutional Joint Development Research- , 2020 .

[18]  D. Dreizin,et al.  Blunt polytrauma: evaluation with 64-section whole-body CT angiography. , 2012, Radiographics : a review publication of the Radiological Society of North America, Inc.

[19]  Andrea Vedaldi,et al.  Understanding Deep Networks via Extremal Perturbations and Smooth Masks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[20]  Marvin Tile,et al.  Acute Pelvic Fractures: I. Causation and Classification , 1996, The Journal of the American Academy of Orthopaedic Surgeons.

[21]  R. Vaidya,et al.  Patients with pelvic fractures from blunt trauma. What is the cause of mortality and when? , 2016, American journal of surgery.

[22]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[23]  Syed Saqlain Hassan,et al.  Lower Leg Bone Fracture Detection and Classification Using Faster RCNN for X-Rays Images , 2020, 2020 IEEE 23rd International Multitopic Conference (INMIC).

[24]  A. Peitzman,et al.  Pelvic trauma: WSES classification and guidelines , 2017, World Journal of Emergency Surgery.

[25]  Anna Goldenberg,et al.  What Clinicians Want: Contextualizing Explainable Machine Learning for Clinical End Use , 2019, MLHC.

[26]  Eliot Siegel,et al.  CT Prediction Model for Major Arterial Injury after Blunt Pelvic Ring Disruption. , 2018, Radiology.

[27]  Ankur Teredesai,et al.  Interpretable Machine Learning in Healthcare , 2018, BCB.

[28]  Katja Bühler,et al.  Domain aware medical image classifier interpretation by counterfactual impact analysis , 2020, MICCAI.