Validation of a Deep Learning Model to Screen for Glaucoma Using Images from Different Fundus Cameras and Data Augmentation.

PURPOSE To validate a deep residual learning algorithm to diagnose glaucoma from fundus photography using different fundus cameras at different institutes. DESIGN Cross-sectional study. PARTICIPANTS A training dataset consisted of 1364 color fundus photographs with glaucomatous indications and 1768 color fundus photographs without glaucomatous features. Two testing datasets consisted of (1) 95 images of 95 glaucomatous eyes and 110 images of 110 normative eyes, and (2) 93 images of 93 glaucomatous eyes and 78 images of 78 normative eyes. METHODS A deep learning algorithm known as Residual Network (ResNet) was used to diagnose glaucoma using a training dataset. The 2 testing datasets were obtained using different fundus cameras (different manufacturers) across multiple institutes. The size of the training data was artificially increased by adding minor alterations to the original data, known as "image augmentation." Diagnostic accuracy was assessed using the area under the receiver operating characteristic curve (AROC). MAIN OUTCOME MEASURES Area under the receiver operating characteristic curve. RESULTS When image augmentation was not used, the AROC was 94.8% (90.3-96.8) in the first testing dataset and 99.7% (99.4-100.0) in the second dataset. These AROC values were significantly (P < 0.05) smaller without augmentation (87.7% [82.8-92.6] in the first testing dataset and 94.5% [91.3-97.6] in the second testing dataset). CONCLUSIONS The previously developed deep residual learning algorithm achieved high diagnostic performance with different fundus cameras across multiple institutes, in particular when image augmentation was used.

[1]  J. J. Wang,et al.  The relationship between glaucoma and myopia: the Blue Mountains Eye Study. , 1999, Ophthalmology.

[2]  Yusuke Arai,et al.  Applying artificial intelligence to disease staging: Deep learning for improved staging of diabetic retinopathy , 2017, PloS one.

[3]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[4]  R. Hitchings,et al.  The optic disc in glaucoma. I: Classification. , 1976, The British journal of ophthalmology.

[5]  M. He,et al.  Efficacy of a Deep Learning System for Detecting Glaucomatous Optic Neuropathy Based on Color Fundus Photographs. , 2018, Ophthalmology.

[6]  E. Finkelstein,et al.  Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes , 2017, JAMA.

[7]  Yoshua Bengio,et al.  How transferable are features in deep neural networks? , 2014, NIPS.

[8]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[9]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[10]  A. Iwase,et al.  Risk factors for open-angle glaucoma in a Japanese population: the Tajimi Study. , 2006, Ophthalmology.

[11]  S. K. Seah,et al.  Population prevalence of tilted and torted optic discs among an adult Chinese population in Singapore: the Tanjong Pagar Study. , 2009, Archives of ophthalmology.

[12]  J. Fujimoto,et al.  Optical Coherence Tomography , 1991 .

[13]  M. Aickin,et al.  Adjusting for multiple testing when reporting research results: the Bonferroni vs Holm methods. , 1996, American journal of public health.

[14]  J. Jonas,et al.  High myopia and glaucoma susceptibility the Beijing Eye Study. , 2007, Ophthalmology.

[15]  B. S. Carlson,et al.  Comparison of modern CCD and CMOS image sensor technologies and systems for low resolution imaging , 2002, Proceedings of IEEE Sensors.

[16]  A. Iwase,et al.  Refractive errors in an elderly Japanese population: the Tajimi study. , 2008, Ophthalmology.

[17]  Hiroshi Murata,et al.  Development of a deep residual learning algorithm to screen for glaucoma from fundus photography , 2018, Scientific Reports.

[18]  Peter H Whincup,et al.  Ethnic differences in the prevalence of myopia and ocular biometry in 10- and 11-year-old children: the Child Heart and Health Study in England (CHASE). , 2010, Investigative ophthalmology & visual science.

[19]  Louis Tong,et al.  Myopia-related optic disc and retinal changes in adolescent children from singapore. , 2011, Ophthalmology.

[20]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[21]  Weidong (Tom) Cai,et al.  A Deep Learning-Based Algorithm Identifies Glaucomatous Discs Using Monoscopic Fundus Photographs. , 2018, Ophthalmology. Glaucoma.

[22]  Hiroshi Murata,et al.  Using Deep Learning and Transfer Learning to Accurately Diagnose Early-Onset Glaucoma From Macular Optical Coherence Tomography Images. , 2019, American journal of ophthalmology.

[23]  T. Wong,et al.  Refractive error, axial dimensions, and primary open-angle glaucoma: the Singapore Malay Eye Study. , 2010, Archives of ophthalmology.

[24]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[25]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[26]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.