Ensemble neural network model for detecting thyroid eye disease using external photographs

Purpose To describe an artificial intelligence platform that detects thyroid eye disease (TED). Design Development of a deep learning model. Methods 1944 photographs from a clinical database were used to train a deep learning model. 344 additional images (‘test set’) were used to calculate performance metrics. Receiver operating characteristic, precision–recall curves and heatmaps were generated. From the test set, 50 images were randomly selected (‘survey set’) and used to compare model performance with ophthalmologist performance. 222 images obtained from a separate clinical database were used to assess model recall and to quantitate model performance with respect to disease stage and grade. Results The model achieved test set accuracy of 89.2%, specificity 86.9%, recall 93.4%, precision 79.7% and an F1 score of 86.0%. Heatmaps demonstrated that the model identified pixels corresponding to clinical features of TED. On the survey set, the ensemble model achieved accuracy, specificity, recall, precision and F1 score of 86%, 84%, 89%, 77% and 82%, respectively. 27 ophthalmologists achieved mean performance of 75%, 82%, 63%, 72% and 66%, respectively. On the second test set, the model achieved recall of 91.9%, with higher recall for moderate to severe (98.2%, n=55) and active disease (98.3%, n=60), as compared with mild (86.8%, n=68) or stable disease (85.7%, n=63). Conclusions The deep learning classifier is a novel approach to identify TED and is a first step in the development of tools to improve diagnostic accuracy and lower barriers to specialist evaluation.

[1]  Aaron Y. Lee,et al.  PeriorbitAI: Artificial intelligence automation of eyelid and periorbital measurements. , 2021, American journal of ophthalmology.

[2]  R. Kersten,et al.  Teprotumumab: interpreting the clinical trials in the context of thyroid eye disease pathogenesis and current therapies. , 2021, Ophthalmology.

[3]  Hyungyu Lee,et al.  Early detection of diabetic retinopathy based on deep learning and ultra-wide-field fundus images , 2021, Scientific Reports.

[4]  Begonya Garcia-Zapirain,et al.  Breast Cancer Histopathology Image Classification Using an Ensemble of Deep Learning Models , 2020, Sensors.

[5]  L. Heindl,et al.  A Simple Standardized Three-dimensional Anthropometry for the Periocular Region in a European Population. , 2019, Plastic and reconstructive surgery.

[6]  David H. Way,et al.  A deep learning system for differential diagnosis of skin diseases , 2019, Nature Medicine.

[7]  Achim Hekler,et al.  Deep learning outperformed 136 of 157 dermatologists in a head-to-head dermoscopic melanoma image classification task. , 2019, European journal of cancer.

[8]  April Y. Maa,et al.  Deep Learning and Glaucoma Specialists: The Relative Importance of Optic Disc Features to Predict Glaucoma Referral in Fundus Photographs. , 2018, Ophthalmology.

[9]  E. Agrón,et al.  DeepSeeNet: A deep learning model for automated classification of patient-based age-related macular degeneration severity from color fundus photographs , 2018, Ophthalmology.

[10]  Rajiv Raman,et al.  Fundus photograph-based deep learning algorithms in detecting diabetic retinopathy , 2018, Eye.

[11]  Cong Wu,et al.  Application of Transfer Learning in the Recognition of TAO , 2018, 2018 13th International Conference on Computer Science & Education (ICCSE).

[12]  H. Haenssle,et al.  Man against machine: diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists , 2018, Annals of oncology : official journal of the European Society for Medical Oncology.

[13]  Marcus A. Badgeley,et al.  Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study , 2018, PLoS medicine.

[14]  A. Peters,et al.  A Deep Learning Algorithm for Prediction of Age-Related Eye Disease Study Severity Scale for Age-Related Macular Degeneration from Color Fundus Photography. , 2018, Ophthalmology.

[15]  M. He,et al.  Efficacy of a Deep Learning System for Detecting Glaucomatous Optic Neuropathy Based on Color Fundus Photographs. , 2018, Ophthalmology.

[16]  E. Finkelstein,et al.  Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes , 2017, JAMA.

[17]  C. Meinel,et al.  Deep Learning for Medical Image Analysis , 2017, ArXiv.

[18]  M. Nardi,et al.  Does Graves’ Orbitopathy Ever Disappear Answers to an Old Question , 2017, European Thyroid Journal.

[19]  Heung-Il Suk,et al.  Deep Learning in Medical Image Analysis. , 2017, Annual review of biomedical engineering.

[20]  Terry J. Smith,et al.  Teprotumumab for Thyroid‐Associated Ophthalmopathy , 2017, The New England journal of medicine.

[21]  P. Lakhani,et al.  Deep Learning at Chest Radiography: Automated Classification of Pulmonary Tuberculosis by Using Convolutional Neural Networks. , 2017, Radiology.

[22]  P. Perros,et al.  Management of thyroid eye disease in the United Kingdom: A multi-centre thyroid eye disease audit , 2017, Orbit.

[23]  M. Nardi,et al.  Does early response to intravenous glucocorticoids predict the final outcome in patients with moderate-to-severe and active Graves’ orbitopathy? , 2017, Journal of Endocrinological Investigation.

[24]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[25]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[26]  Ramprasaath R. Selvaraju,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, International Journal of Computer Vision.

[27]  Luiz Eduardo Soares de Oliveira,et al.  Breast cancer histopathological image classification using Convolutional Neural Networks , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[28]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[30]  M. Nardi,et al.  Spontaneous improvement of untreated mild Graves' ophthalmopathy: Rundle's curve revisited. , 2014, Thyroid : official journal of the American Thyroid Association.

[31]  Davis E. King,et al.  Dlib-ml: A Machine Learning Toolkit , 2009, J. Mach. Learn. Res..

[32]  C. Dayan,et al.  The patient experience of services for thyroid eye disease in the United Kingdom: results of a nationwide survey. , 2009, European journal of endocrinology.

[33]  M. Nardi,et al.  Consensus statement of the European Group on Graves' orbitopathy (EUGOGO) on management of GO. , 2008, European journal of endocrinology.

[34]  W. Wiersinga,et al.  Epidemiology and prevention of Graves' ophthalmopathy. , 2002, Thyroid : official journal of the American Thyroid Association.

[35]  Rich Caruana,et al.  Overfitting in Neural Nets: Backpropagation, Conjugate Gradient, and Early Stopping , 2000, NIPS.

[36]  J. Orcutt,et al.  Graves' ophthalmopathy. , 1992, The Western journal of medicine.