Assessing the speed-accuracy trade-offs of popular convolutional neural networks for single-crop rib fracture classification

Rib fractures are injuries commonly assessed in trauma wards. Deep learning has demonstrated state-of-the-art accuracy for a variety of tasks, including image classification. This paper assesses the speed-accuracy trade-offs and general suitability of four popular convolutional neural networks to classify rib fractures from axial computed tomography imagery. We transfer learned InceptionV3, ResNet50, MobileNetV2, and VGG16 models, additionally training "decomposed" models comprised of taking only the first n blocks for each block for each architecture. Given that acute (new) fractures are generally most important to detect, we trained two types of models: a classful model with classes acute, old (healed), and normal (non-fractured); and a binary model with acute vs. the other classes. We found that the first 7 blocks of InceptionV3 achieved the best results and general speed-accuracy trade-off. The classful model achieved a 5-fold cross-validation average accuracy and macro recall of 96.00% and 94.0%, respectively. The binary model achieved a 5-fold cross-validation average accuracy, macro recall, and area under receiver operator characteristic curve of 97.76%, 94.6%, and 94.7%, respectively. On a Windows 10 PC with 32GB RAM and an Nvidia 1080ti GPU, the model's average CPU and GPU per-crop inference times were 13.6 and 12.2 ms, respectively. Compared to the InceptionV3 Block 7 classful model, a radiologist with 9 years of experience was less accurate but more sensitive to acute fractures; meanwhile, the deep learning model had fewer false positive diagnoses and better sensitivity to old fractures and normal ribs. The Cohen's Kappa between the two was 0.813.

[1]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  A. Razavian,et al.  Artificial intelligence for analyzing orthopedic trauma radiographs , 2017, Acta orthopaedica.

[3]  Seok-Bum Ko,et al.  Early detection of ankylosing spondylitis using texture features and statistical machine learning, and deep learning, with some patient age analysis , 2020, Comput. Medical Imaging Graph..

[4]  Xindao Yin,et al.  Automatic Detection and Classification of Rib Fractures on Thoracic CT Using Convolutional Neural Network: Accuracy and Feasibility , 2020, Korean journal of radiology.

[5]  Yi Wang,et al.  Retinal blood vessel segmentation using fully convolutional network with transfer learning , 2018, Comput. Medical Imaging Graph..

[6]  K. Jarrod Millman,et al.  Array programming with NumPy , 2020, Nat..

[7]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[8]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[10]  Seok-Bum Ko,et al.  Stride 2 1-D, 2-D, and 3-D Winograd for Convolutional Neural Networks , 2020, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[11]  M. Beer,et al.  Patterns of serial rib fractures after blunt chest trauma: An analysis of 380 cases , 2019, PloS one.

[12]  Katsumi Abe,et al.  A Computer-Assisted System for Diagnostic Workstations: Automated Bone Labeling for CT Images , 2008, Journal of Digital Imaging.

[13]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14]  Max A. Viergever,et al.  Automatic rib segmentation and labeling in computed tomography scans using a general framework for detection, recognition and segmentation of objects in volumetric data , 2007, Medical Image Anal..

[15]  Seok-Bum Ko,et al.  License plate segmentation and recognition system using deep learning and OpenVINO , 2020 .

[16]  D. Ziegler,et al.  The morbidity and mortality of rib fractures. , 1994, The Journal of trauma.

[17]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[18]  Frank Hutter,et al.  SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.

[19]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[20]  James H Thrall,et al.  Artificial Intelligence and Machine Learning in Radiology: Opportunities, Challenges, Pitfalls, and Criteria for Success. , 2018, Journal of the American College of Radiology : JACR.

[21]  Gregor Sommer,et al.  Assessment of a Deep Learning Algorithm for the Detection of Rib Fractures on Whole-Body Trauma Computed Tomography , 2019, Korean journal of radiology.

[22]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[23]  D. H. Kim,et al.  Artificial intelligence in fracture detection: transfer learning from deep convolutional neural networks. , 2017, Clinical radiology.

[24]  Grzegorz Soza,et al.  The ribs unfolded - a CT visualization algorithm for fast detection of rib fractures: effect on sensitivity and specificity in trauma patients , 2015, European Radiology.

[25]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).