Assessing Social Determinants-Related Performance Bias of Machine Learning Models: A case of Hyperchloremia Prediction in ICU Population

Machine learning in medicine leverages the wealth of healthcare data to extract knowledge, facilitate clinical decision-making, and ultimately improve care delivery. However, ML models trained on datasets that lack demographic diversity could yield suboptimal performance when applied to the underrepresented populations (e.g. ethnic minorities, lower social-economic status), thus perpetuating health disparity. In this study, we evaluated four classifiers built to predict Hyperchloremia—a condition that often results from aggressive fluids administration in the ICU population—and compared their performance in racial, gender, and insurance subgroups. We observed that adding social determinants features in addition to the lab-based ones improved model performance on all patients. The subgroup testing yielded significantly different AUC scores in 40 out of the 44 model-subgroup, suggesting disparities when applying ML models to social determinants subgroups. We urge future researchers to design models that proactively adjust for potential biases and include subgroup reporting in their studies.

[1]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[2]  G. Clermont,et al.  Relationship between Race and the Effect of Fluids on Long‐term Mortality after Acute Respiratory Distress Syndrome. Secondary Analysis of the National Heart, Lung, and Blood Institute Fluid and Catheter Treatment Trial , 2017, Annals of the American Thoracic Society.

[3]  M. Howell,et al.  Ensuring Fairness in Machine Learning to Advance Health Equity , 2018, Annals of Internal Medicine.

[4]  Brian W. Powers,et al.  Dissecting racial bias in an algorithm used to manage the health of populations , 2019, Science.

[5]  Timnit Gebru,et al.  Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification , 2018, FAT.

[6]  Yuan Luo,et al.  Hyperchloremia in critically ill patients: association with outcomes and prediction using electronic health record data , 2020, BMC Medical Informatics and Decision Making.

[7]  Xiaoqian Jiang,et al.  Early Prediction of Acute Kidney Injury in Critical Care Setting Using Clinical Notes , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[8]  Xiaoxi Yao,et al.  Assessing and Mitigating Bias in Medical Artificial Intelligence , 2020, Circulation. Arrhythmia and electrophysiology.

[9]  Keith R. Walley,et al.  Hyperchloremia and moderate increase in serum chloride are associated with acute kidney injury in severe sepsis and septic shock patients , 2016, Critical Care.

[10]  Hanyin Wang,et al.  Using Machine Learning to Integrate Socio-Behavioral Factors in Predicting Cardiovascular-Related Mortality Risk , 2019, MedInfo.

[11]  M. Ghassemi,et al.  Can AI Help Reduce Disparities in General Medical and Mental Health Care? , 2019, AMA journal of ethics.

[12]  Jerry Yee,et al.  Association of Hyperchloremia With Hospital Mortality in Critically Ill Septic Patients , 2015, Critical care medicine.

[13]  Gilles Clermont,et al.  Chloride Content of Fluids Used for Large-Volume Resuscitation Is Associated With Reduced Survival , 2017, Critical care medicine.

[14]  C. Coopersmith,et al.  Simulation of Ventilator Allocation in Critically Ill Patients with COVID-19 , 2021, American journal of respiratory and critical care medicine.

[15]  Ognjen Gajic,et al.  Dyschloremia Is a Risk Factor for the Development of Acute Kidney Injury in Critically Ill Patients , 2016, PloS one.

[16]  S. Tamang,et al.  Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data , 2018, JAMA internal medicine.

[17]  Daniel A. Reuter,et al.  The dark sides of fluid administration in the critically ill patient , 2018, Intensive Care Medicine.