Artificial intelligence-assisted interpretation of bone age radiographs improves accuracy and decreases variability

ObjectiveRadiographic bone age assessment (BAA) is used in the evaluation of pediatric endocrine and metabolic disorders. We previously developed an automated artificial intelligence (AI) deep learning algorithm to perform BAA using convolutional neural networks. We compared the BAA performance of a cohort of pediatric radiologists with and without AI assistance.Materials and methodsSix board-certified, subspecialty trained pediatric radiologists interpreted 280 age- and gender-matched bone age radiographs ranging from 5 to 18 years. Three of those radiologists then performed BAA with AI assistance. Bone age accuracy and root mean squared error (RMSE) were used as measures of accuracy. Intraclass correlation coefficient evaluated inter-rater variation.ResultsAI BAA accuracy was 68.2% overall and 98.6% within 1 year, and the mean six-reader cohort accuracy was 63.6 and 97.4% within 1 year. AI RMSE was 0.601 years, while mean single-reader RMSE was 0.661 years. Pooled RMSE decreased from 0.661 to 0.508 years, all individually decreasing with AI assistance. ICC without AI was 0.9914 and with AI was 0.9951.ConclusionsAI improves radiologist’s bone age assessment by increasing accuracy and decreasing variability and RMSE. The utilization of AI by radiologists improves performance compared to AI alone, a radiologist alone, or a pooled cohort of experts. This suggests that AI may optimally be utilized as an adjunct to radiologist interpretation of imaging studies to improve performance.

[1]  Soo Young Kim,et al.  Comparison of the Greulich-Pyle and Tanner Whitehouse (TW3) Methods in Bone age Assessment , 2008 .

[2]  R. Barzilay,et al.  High-Risk Breast Lesions: A Machine Learning Model to Predict Pathologic Upgrade and Reduce Unnecessary Surgical Excision. , 2017, Radiology.

[3]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[4]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[5]  Osman Ratib,et al.  Hand Bone Age: A Digital Atlas of Skeletal Maturity , 2005 .

[6]  A. Ehrenberg Assessment of Skeletal Maturity and Prediction of Adult Height (Tw2 Method) , 1977 .

[7]  W. Shim,et al.  Computerized Bone Age Estimation Using Deep Learning Based Program: Evaluation of the Accuracy and Efficiency. , 2017, AJR. American journal of roentgenology.

[8]  Raúl San José Estépar,et al.  Disease Staging and Prognosis in Smokers Using Deep Learning in Chest Computed Tomography , 2018, American journal of respiratory and critical care medicine.

[9]  C. Langlotz,et al.  Performance of a Deep-Learning Neural Network Model in Assessing Skeletal Maturity on Pediatric Hand Radiographs. , 2017, Radiology.

[10]  Roland Maas,et al.  Domain-Specific Utterance End-Point Detection for Speech Recognition , 2017, INTERSPEECH.

[11]  Bram van Ginneken,et al.  Fast Convolutional Neural Network Training Using Selective Data Sampling: Application to Hemorrhage Detection in Color Fundus Images , 2016, IEEE Transactions on Medical Imaging.

[12]  Sven Kreiborg,et al.  The BoneXpert Method for Automated Determination of Skeletal Maturity , 2009, IEEE Transactions on Medical Imaging.

[13]  Omar Abuzaghleh,et al.  Noninvasive Real-Time Automated Skin Lesion Analysis System for Melanoma Early Detection and Prevention , 2015, IEEE Journal of Translational Engineering in Health and Medicine.

[14]  W. Greulich,et al.  Radiographic Atlas of Skeletal Development of the Hand and Wrist , 1999 .

[15]  Jenny Lee,et al.  Fully Automated Deep Learning System for Bone Age Assessment , 2017, Journal of Digital Imaging.

[16]  D. Michael,et al.  HANDX: a model-based system for automatic segmentation of bones from digital hand radiographs. , 1989, IEEE transactions on medical imaging.

[17]  D. King,et al.  Reproducibility of bone ages when performed by radiology registrars: an audit of Tanner and Whitehouse II versus Greulich and Pyle methods. , 1994, The British journal of radiology.

[18]  P. Lakhani,et al.  Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks. , 2017, Radiology.

[19]  Vicente Gilsanz,et al.  Hand bone age , 2012 .

[20]  Georg Fuchs,et al.  Pixel-Level Deep Segmentation: Artificial Intelligence Quantifies Muscle on Computed Tomography for Body Morphometric Analysis , 2017, Journal of Digital Imaging.

[21]  E Pietka,et al.  Digital hand atlas and web-based bone age assessment: system design and implementation. , 2000, Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society.

[22]  Martin Wattenberg,et al.  Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation , 2016, TACL.

[23]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[24]  Amanda Porterfield The Great Awakening , 2020 .

[25]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[26]  J. Talairach,et al.  L??EXPLORATION CHIRURGICALE ST??R??OTAXIQUE DU LOBE TEMPORAL DANS L??EPILEPSIE TEMPORALE , 1959 .