Not Color Blind: AI Predicts Racial Identity from Black and White Retinal Vessel Segmentations

Background Artificial intelligence (AI) may demonstrate racial biases when skin or choroidal pigmentation is present in medical images. Recent studies have shown that convolutional neural networks (CNNs) can predict race from images that were not previously thought to contain race-specific features. We evaluate whether grayscale retinal vessel maps (RVMs) of patients screened for retinopathy of prematurity (ROP) contain race-specific features. Methods 4095 retinal fundus images (RFIs) were collected from 245 Black and White infants. A U-Net generated RVMs from RFIs, which were subsequently thresholded, binarized, or skeletonized. To determine whether RVM differences between Black and White eyes were physiological, CNNs were trained to predict race from color RFIs, raw RVMs, and thresholded, binarized, or skeletonized RVMs. Area under the precision-recall curve (AUC-PR) was evaluated. Findings CNNs predicted race from RFIs near perfectly (image-level AUC-PR: 0.999, subject-level AUC-PR: 1.000). Raw RVMs were almost as informative as color RFIs (image-level AUC-PR: 0.938, subject-level AUC-PR: 0.995). Ultimately, CNNs were able to detect whether RFIs or RVMs were from Black or White babies, regardless of whether images contained color, vessel segmentation brightness differences were nullified, or vessel segmentation widths were normalized. Interpretation AI can detect race from grayscale RVMs that were not thought to contain racial information. Two potential explanations for these findings are that: retinal vessels physiologically differ between Black and White babies or the U-Net segments the retinal vasculature differently for various fundus pigmentations. Either way, the implications remain the same: AI algorithms have potential to demonstrate racial bias in practice, even when preliminary attempts to remove such information from the underlying images appear to be successful.

[1]  Tien Yin Wong,et al.  Genetic determinants of retinal vascular caliber: additional insights into hypertension pathogenesis. , 2006, Hypertension.

[2]  R. Bourne,et al.  Ethnicity and ocular imaging , 2011, Eye.

[3]  Anna L. Ells,et al.  The International Classification of Retinopathy of Prematurity revisited. , 2005, Archives of ophthalmology.

[4]  Neil J. Joshi,et al.  Addressing Artificial Intelligence Bias in Retinal Diagnostics , 2021, Translational vision science & technology.

[5]  Rangasami L. Kashyap,et al.  Building Skeleton Models via 3-D Medial Surface/Axis Thinning Algorithms , 1994, CVGIP Graph. Model. Image Process..

[6]  B. DeGeorge,et al.  Racial and Gender Discrimination in Hand Surgery Letters of Recommendation. , 2021, The Journal of hand surgery.

[7]  Other Contributors Are Indicated Where They Contribute Python Software Foundation , 2017 .

[8]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Klaus H. Maier-Hein,et al.  Retina U-Net: Embarrassingly Simple Exploitation of Segmentation Supervision for Medical Object Detection , 2018, ML4H@NeurIPS.

[10]  William V Good,et al.  The Incidence and Course of Retinopathy of Prematurity: Findings From the Early Treatment for Retinopathy of Prematurity Study , 2005, Pediatrics.

[11]  Saptarshi Purkayastha,et al.  AI recognition of patient race in medical imaging: a modelling study , 2021, The Lancet. Digital health.

[12]  Catherine M. Appleton,et al.  Determinants of Mammographic Breast Density by Race Among a Large Screening Population , 2020, JNCI cancer spectrum.

[13]  Tamera Coyne-Beasley,et al.  Implicit Racial/Ethnic Bias Among Health Care Professionals and Its Influence on Health Care Outcomes: A Systematic Review. , 2015, American journal of public health.

[14]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[15]  R. Klein,et al.  Retinal vascular caliber, cardiovascular risk factors, and inflammation: the multi-ethnic study of atherosclerosis (MESA). , 2006, Investigative ophthalmology & visual science.

[16]  R. Elk The intersection of racism, discrimination, bias, and homophobia toward African American sexual minority patients with cancer within the health care system , 2021, Cancer.

[17]  James M. Brown,et al.  Applications of Arti fi cial Intelligence for Retinopathy of Prematurity Screening , 2022 .

[18]  Ludvig Renbo Olsen Creating Groups from Data [R package groupdata2 version 1.3.0] , 2020 .

[19]  Christiane,et al.  World Medical Association Declaration of Helsinki: ethical principles for medical research involving human subjects. , 2004, Journal international de bioethique = International journal of bioethics.

[20]  Ronald Klein,et al.  Genome-Wide Linkage Study of Retinal Vessel Diameters in the Beaver Dam Eye Study , 2006, Hypertension.

[21]  Wei Lu,et al.  Application of machine learning in ophthalmic imaging modalities , 2020, Eye and Vision.

[22]  P. Mitchell,et al.  Ethnic variability in retinal vessel caliber: a potential source of measurement error from ocular pigmentation?--the Sydney Childhood Eye Study. , 2008, Investigative ophthalmology & visual science.

[23]  W. Shalash,et al.  Diabetic retinopathy detection through deep learning techniques: A review , 2020, Informatics in Medicine Unlocked.

[24]  J. Ko,et al.  Foundational Considerations for Artificial Intelligence Utilizing Ophthalmic Images. , 2021, Ophthalmology.

[25]  Wei Liang,et al.  Microaneurysms segmentation with a U-Net based on recurrent residual convolutional neural network , 2019, Journal of medical imaging.

[26]  Weight Stigma and Disease and Disability Concepts of Obesity: A Survey of the German Population , 2021, Obesity Facts.

[27]  R. Pollack,et al.  Racial differences in pigmentation of the Fundus oculi , 1967 .

[28]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[29]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[30]  E. Finkelstein,et al.  Development and Validation of a Deep Learning System for Diabetic Retinopathy and Related Eye Diseases Using Retinal Images From Multiethnic Populations With Diabetes , 2017, JAMA.

[31]  A. Adamson,et al.  Machine Learning and Health Care Disparities in Dermatology. , 2018, JAMA dermatology.

[32]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  R. Khazanchi,et al.  Racial/Ethnic Inequities in Healthcare-associated Infections Under the Shadow of Structural Racism: Narrative Review and Call to Action , 2021, Current Infectious Disease Reports.

[34]  T. Wong,et al.  Racial differences in retinal vessel geometric characteristics: a multiethnic study in healthy Asians. , 2013, Investigative ophthalmology & visual science.

[35]  James M. Brown,et al.  Evaluation of artificial intelligence-based telemedicine screening for retinopathy of prematurity. , 2020, Journal of AAPOS : the official publication of the American Association for Pediatric Ophthalmology and Strabismus.

[36]  P. Mitchell,et al.  Distribution and associations of retinal vascular caliber with ethnicity, gender, and birth parameters in young children. , 2007, Investigative ophthalmology & visual science.

[37]  James M. Brown,et al.  Automated Diagnosis of Plus Disease in Retinopathy of Prematurity Using Deep Convolutional Neural Networks , 2018, JAMA ophthalmology.