Preventing Disparities: Bayesian and Frequentist Methods for Assessing Fairness in Machine-Learning Decision-Support Models

Machine-learning (ML) methods are finding increasing application to guide human decision-making in many fields. Such guidance can have important consequences, including treatments and outcomes in health care. Recently, growing attention has focused on the potential that machine-learning might automatically learn unjust or discriminatory, but unrecognized or undisclosed, patterns that are manifested in available observational data and the human processes that gave rise to them, and thereby inadvertently perpetuating and propagating injustices that are embodied in the historical data. We applied two frequentist methods that have long been utilized in the courts and elsewhere for the purpose of ascertaining fairness (Cochran-Mantel-Haenszel test and beta regression) and one Bayesian method (Bayesian Model Averaging). These methods revealed that our ML model for guiding physicians’ prescribing discharge beta-blocker medication for post-coronary artery bypass patients do not manifest significant untoward race-associated disparity. The methods also showed that our ML model for directing repeat performance of MRI imaging in children with medulloblastoma did manifest racial disparities that are likely associated with ethnic differences in informed consent and desire for information in the context of serious malignancies. The relevance of these methods to ascertaining and assuring fairness in other ML-based decision-support model-development and -curation contexts is discussed.

[1]  S. Liggett,et al.  Race, common genetic variation, and therapeutic response disparities in heart failure. , 2014, JACC. Heart failure.

[2]  S. Valles Heterogeneity of risk within racial groups, a challenge for public health programs. , 2012, Preventive medicine.

[3]  M. Steel,et al.  Benchmark Priors for Bayesian Model Averaging , 2001 .

[4]  E. Bedrick,et al.  Perioperative Mortality in Nonelderly Adult Patients With Cancer: A Population-based Study Evaluating Health Care Disparities in the United States According to Insurance Status , 2016, American journal of clinical oncology.

[5]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[6]  Xin Lu,et al.  Coronary Revascularization at Specialty Cardiac Hospitals and Peer General Hospitals in Black Medicare Beneficiaries , 2008, Circulation. Cardiovascular quality and outcomes.

[7]  Stefan Zeugner,et al.  Benchmark Priors Revisited: On Adaptive Shrinkage and the Supermodel Effect in Bayesian Model Averaging , 2009, SSRN Electronic Journal.

[8]  Sean M. O'Brien,et al.  Association of Hospital and Physician Characteristics and Care Processes With Racial Disparities in Procedural Outcomes Among Contemporary Patients Undergoing Coronary Artery Bypass Grafting Surgery , 2016, Circulation.

[9]  C. Uggen,et al.  Statistical Power in Experimental Audit Studies , 2016 .

[10]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[11]  Erratum to: Strategies African-American Cancer Survivors Use to Overcome Fears and Fatalistic Attitudes , 2015, Journal of Cancer Education.

[12]  Kassandra I. Alcaraz,et al.  Examining the mediating role of cancer‐related problems on spirituality and self‐rated health among African American cancer survivors: a report from the American Cancer Society's Studies of Cancer Survivors‐II , 2015, Psycho-oncology.

[13]  Toniann Pitassi,et al.  Learning Fair Representations , 2013, ICML.

[14]  Intrinsic Religiousness as a Mediator Between Fatalism and Cancer-Specific Fear: Clarifying the Role of Fear in Prostate Cancer Screening , 2014, Journal of Religion and Health.

[15]  A. Zeileis,et al.  Beta Regression in R , 2010 .

[16]  Marten Postma,et al.  The societal burden of HIV/AIDS in Northern Italy: An analysis of costs and quality of life , 2008, AIDS care.

[17]  B. Spiegel,et al.  Explaining persistent under-use of colonoscopic cancer screening in African Americans: a systematic review. , 2015, Preventive medicine.

[18]  Avi Feller,et al.  Algorithmic Decision Making and the Cost of Fairness , 2017, KDD.

[19]  G. Boudonas β-Blockers in coronary artery disease management. , 2010, Hippokratia.

[20]  Jack M. Robertson,et al.  Cake-cutting algorithms - be fair if you can , 1998 .

[21]  N. Samadder,et al.  Barriers to Colorectal Cancer Screening in a Racially Diverse Population Served by a Safety-Net Clinic , 2017, Journal of Community Health.

[22]  Carlos Eduardo Scheidegger,et al.  Certifying and Removing Disparate Impact , 2014, KDD.

[23]  J. Dimick,et al.  Explaining racial disparities in outcomes after cardiac surgery: the role of hospital quality. , 2011, JAMA surgery.

[24]  Joseph L. Gastwirth,et al.  Statistical Methods for Assessing the Fairness of the Allocation of Shares in Initial Public Offerings , 2005 .

[25]  L. Ross,et al.  Disparities in the receipt of cardiac revascularization procedures between blacks and whites: an analysis of secular trends. , 2008, Ethnicity & disease.

[26]  M. Smithson,et al.  Guilty, not guilty, or …? multiple options in jury verdict choices , 2007 .

[27]  Benjamin Fish,et al.  A Confidence-Based Approach for Balancing Fairness and Accuracy , 2016, SDM.

[28]  A. Buja,et al.  Tackling inequalities: are secondary prevention therapies for reducing post-infarction mortality used without disparities? , 2014, European journal of preventive cardiology.

[29]  Adrian E. Raftery,et al.  Bayesian Model Averaging: A Tutorial , 2016 .

[30]  Joseph L. Gastwirth,et al.  Statistical Methods for Analyzing Claims of Employment Discrimination , 1984 .

[31]  Anno Bunnik,et al.  Big Data Challenges : Society, Security, Innovation and Ethics , 2016 .

[32]  Nils Lid Hjort,et al.  Model Selection and Model Averaging , 2001 .

[33]  Katrina Ligett,et al.  Learning Fair Classifiers: A Regularization-Inspired Approach , 2017, ArXiv.

[34]  Stephen B. Thomas,et al.  Race/ethnic disparities in risk factor control and survival in the bypass angioplasty revascularization investigation 2 diabetes (BARI 2D) trial. , 2013, The American journal of cardiology.

[35]  Sander Klous,et al.  We are Big Data , 2016 .

[36]  Luciano Floridi,et al.  The Ethics of Biomedical Big Data , 2016 .

[37]  Jérôme Béranger Big Data and Ethics: The Medical Datasphere , 2016 .

[38]  S. Asch,et al.  Disparities in receipt of recommended care among younger versus older medicare beneficiaries: a cohort study , 2017, BMC Health Services Research.

[39]  Jenna L. Davis,et al.  Sociodemographic Differences in Fears and Mistrust Contributing to Unwillingness to Participate in Cancer Screenings , 2012, Journal of health care for the poor and underserved.

[40]  Alexandra Chouldechova,et al.  Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.

[41]  Cary Coglianese,et al.  Regulating by Robot: Administrative Decision Making in the Machine-Learning Era , 2017 .

[42]  J. Hébert,et al.  African American Men’s and Women’s Perceptions of Clinical Trials Research: Focusing on Prostate Cancer among a High-Risk Population in the South , 2013, Journal of health care for the poor and underserved.

[43]  Nils Lid Hjort,et al.  Model Selection and Model Averaging: Contents , 2008 .

[44]  Xiaocheng Wu,et al.  General Multiple Mediation Analysis With an Application to Explore Racial Disparities in Breast Cancer Survival , 2013 .

[45]  M. Shishehbor,et al.  Socioeconomic Position, Not Race, Is Linked to Death After Cardiac Surgery , 2010, Circulation. Cardiovascular quality and outcomes.

[46]  R. Schonberger,et al.  The problem of controlling for imperfectly measured confounders on dissimilar populations: a database simulation study. , 2014, Journal of cardiothoracic and vascular anesthesia.

[47]  Tomohiro Ando,et al.  Bayesian Model Selection and Statistical Modeling , 2010 .

[48]  Michael Skirpan,et al.  The Authority of "Fair" in Machine Learning , 2017, arXiv.org.

[49]  J. Birkmeyer,et al.  Black patients more likely than whites to undergo surgery at low-quality hospitals in segregated regions. , 2013, Health affairs.

[50]  Michael Smithson,et al.  A better lemon squeezer? Maximum-likelihood regression with beta-distributed dependent variables. , 2006, Psychological methods.

[51]  T. Ferguson,et al.  Increased Long-Term Mortality among Black CABG Patients Receiving Preoperative Inotropic Agents , 2015, International journal of environmental research and public health.

[52]  Alexandra Chouldechova,et al.  Fairer and more accurate, but for whom? , 2017, ArXiv.

[53]  M. Khan Cardiac Drug Therapy , 1992 .

[54]  E. Huntley,et al.  An Exploratory Analysis of Fear of Recurrence among African-American Breast Cancer Survivors , 2012, International Journal of Behavioral Medicine.

[55]  C. Leonardi,et al.  Exploring racial disparity in obesity: A mediation analysis considering geo-coded environmental factors. , 2017, Spatial and Spatio-temporal Epidemiology.

[56]  Michael Veale Logics and practices of transparency and opacity in real-world applications of public sector machine learning , 2017, ArXiv.

[57]  K. Anstrom,et al.  Challenges in enrollment of minority, pediatric, and geriatric patients in emergency and acute care clinical research. , 2008, Annals of emergency medicine.

[58]  E. Declercq,et al.  Racial and Ethnic Differences in the Likelihood of Vaginal Birth After Cesarean Delivery. , 2015, Birth.

[59]  A. Jaffe,et al.  A Report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines , 2015 .

[60]  Sharad Goel,et al.  Personalized risk assessments in the criminal justice system , 2016 .

[61]  Sander Klous,et al.  We are Big Data: The Future of the Information Society , 2016 .

[62]  S. Matei,et al.  Ethical Reasoning in Big Data: An Exploratory Analysis , 2016 .

[63]  T. Ferguson,et al.  Comparison of Risk of Atrial Fibrillation in Black Versus White Patients After Coronary Artery Bypass Grafting. , 2016, The American journal of cardiology.

[64]  Aaron Roth,et al.  Fairness in Learning: Classic and Contextual Bandits , 2016, NIPS.

[65]  Andrea Manca,et al.  Regression Estimators for Generic Health-Related Quality of Life and Quality-Adjusted Life Years , 2012, Medical decision making : an international journal of the Society for Medical Decision Making.

[66]  Carmen L. Lewis,et al.  Perceptions of informed decision making about cancer screening in a diverse primary care population. , 2010, Family medicine.

[67]  K. Lum,et al.  To predict and serve? , 2016 .

[68]  A. Raftery,et al.  Default Priors and Predictive Performance in Bayesian Model Averaging, with Application to Growth Determinants , 2007 .

[69]  M. Clyde,et al.  Mixtures of g Priors for Bayesian Variable Selection , 2008 .

[70]  P. Hartge,et al.  Trends in premature mortality in the USA by sex, race, and ethnicity from 1999 to 2014: an analysis of death certificate data , 2017, The Lancet.

[71]  Justin M. Rao,et al.  Precinct or Prejudice? Understanding Racial Disparities in New York City's Stop-and-Frisk Policy , 2016 .

[72]  John F. Bertram,et al.  Hypertension, glomerular hypertrophy and nephrosclerosis: the effect of race , 2013, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[73]  G. Corbie-Smith,et al.  African American patients' perspectives on medical decision making. , 2004, Archives of internal medicine.

[74]  T. Ferguson,et al.  Discharge β-Blocker Use and Race after Coronary Artery Bypass Grafting , 2014, Front. Public Health.

[75]  C. Buttá,et al.  QT Indexes in Cirrhotic Patients: Relationship with Clinical Variables and Potential Diagnostic Predictive Value. , 2015, Archives of Medical Research.

[76]  S. Servaes,et al.  Pediatric malignancies: synopsis of current imaging techniques. , 2008, Cancer treatment and research.

[77]  J. Shepperd,et al.  A survey of barriers to screening for oral cancer among rural Black Americans , 2014, Psycho-oncology.

[78]  Boudonas Ge β-Blockers in coronary artery disease management. , 2010 .

[79]  M. J. Bayarri,et al.  Criteria for Bayesian model choice with application to variable selection , 2012, 1209.5240.

[80]  Apurv Jain Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy , 2017, Business Economics.

[81]  D. Hasin,et al.  Alcohol Consumption in Demographic Subpopulations: An Epidemiologic Overview. , 2016 .

[82]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[83]  Zhe Zhang,et al.  Identifying Significant Predictive Bias in Classifiers , 2016, ArXiv.

[84]  Toniann Pitassi,et al.  Fairness through awareness , 2011, ITCS '12.

[85]  H. Valdimarsdottir,et al.  Colonoscopy-Specific Fears in African Americans and Hispanics , 2015, Behavioral medicine.

[86]  A. Jaffe,et al.  2014 AHA/ACC guideline for the management of patients with non-ST-elevation acute coronary syndromes: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. , 2014, Circulation.

[87]  J. Kadane Statistics in the Law , 2008 .

[88]  Jane Wardle,et al.  Cancer fear and fatalism among ethnic minority women in the United Kingdom , 2016, British Journal of Cancer.

[89]  Y. Blumenfeld,et al.  Racial and Ethnic Disparities in Mode of Anesthesia for Cesarean Delivery , 2016, Anesthesia and Analgesia.

[90]  B. Pollock,et al.  A comparison of races and leukemia subtypes among patients in different cancer survivorship phases. , 2011, Clinical lymphoma, myeloma & leukemia.

[91]  Sharad Goel,et al.  The Problem of Infra-Marginality in Outcome Tests for Discrimination , 2016, 1607.05376.

[92]  A. Leshner Accountability and Transparency , 2009, Science.

[93]  Sorin Adam Matei,et al.  Ethical Reasoning in Big Data , 2016, Computational Social Sciences.

[94]  Seth Neel,et al.  A Convex Framework for Fair Regression , 2017, ArXiv.

[95]  R. Macklin Ethical Relativism in a Multicultural Society , 1998, Kennedy Institute of Ethics journal.

[96]  Jennifer W Talton,et al.  Disparities in barriers to follow-up care between African American and White breast cancer survivors , 2015, Supportive Care in Cancer.

[97]  Aaron Roth,et al.  Fair Learning in Markovian Environments , 2016, ArXiv.

[98]  A. Krahn,et al.  Experience with bisoprolol in long-QT1 and long-QT2 syndrome , 2016, Journal of Interventional Cardiac Electrophysiology.

[99]  Kord Davis Ethics of Big Data: Balancing Risk and Innovation , 2012 .

[100]  Philip Paolino,et al.  Maximum Likelihood Estimation of Models with Beta-Distributed Dependent Variables , 2001, Political Analysis.

[101]  S. Ferrari,et al.  Beta Regression for Modelling Rates and Proportions , 2004 .

[102]  Francesco Bonchi,et al.  Algorithmic Bias: From Discrimination Discovery to Fairness-aware Data Mining , 2016, KDD.

[103]  L. Goldsmith,et al.  Perceptions of pediatric clinical research among African American and Caucasian parents. , 2009, Journal of the National Medical Association.

[104]  A. Kiser,et al.  Perioperative Inotrope Therapy and Atrial Fibrillation Following Coronary Artery Bypass Graft Surgery: Evidence of a Racial Disparity , 2017, Pharmacotherapy.

[105]  D. Rockey,et al.  Age and Ethnicity in Cirrhosis , 2014, Journal of Investigative Medicine.