Analyzing the impact of missing values and selection bias on fairness

Algorithmic decision making is becoming more prevalent, increasingly impacting people’s daily lives. Recently, discussions have been emerging about the fairness of decisions made by machines. Researchers have proposed different approaches for improving the fairness of these algorithms. While these approaches can help machines make fairer decisions, they have been developed and validated on fairly clean data sets. Unfortunately, most real-world data have complexities that make them more dirty. This work considers two of these complexities by analyzing the impact of two real-world data issues on fairness—missing values and selection bias—for categorical data. After formulating this problem and showing its existence, we propose fixing algorithms for data sets containing missing values and/or selection bias that use different forms of reweighting and resampling based upon the missing value generation process. We conduct an extensive empirical evaluation on both real-world and synthetic data using various fairness metrics, and demonstrate how different missing values generated from different mechanisms and selection bias impact prediction fairness, even when prediction accuracy remains fairly constant.

[1]  A. Acock Working With Missing Values , 2005 .

[2]  D. Rubin Multiple imputation for nonresponse in surveys , 1989 .

[3]  Krishna P. Gummadi,et al.  Fairness Constraints: Mechanisms for Fair Classification , 2015, AISTATS.

[4]  Emre Kıcıman,et al.  Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries , 2018, Front. Big Data.

[5]  D. Bennett How can I deal with missing data in my study? , 2001, Australian and New Zealand journal of public health.

[6]  Kush R. Varshney,et al.  Optimized Pre-Processing for Discrimination Prevention , 2017, NIPS.

[7]  Ricardo Baeza-Yates,et al.  Bias on the web , 2018, Commun. ACM.

[8]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[9]  H. Binder,et al.  A DAG-based comparison of interventional effect underestimation between composite endpoint and multi-state analysis in cardiovascular trials , 2017, BMC Medical Research Methodology.

[10]  Dan A. Biddle Adverse Impact and Test Validation: A Practitioner's Guide to Valid and Defensible Employment Testing , 2005 .

[11]  Alexandra Chouldechova,et al.  The Frontiers of Fairness in Machine Learning , 2018, ArXiv.

[12]  Toniann Pitassi,et al.  Fairness through awareness , 2011, ITCS '12.

[13]  Jun Sakuma,et al.  Fairness-Aware Classifier with Prejudice Remover Regularizer , 2012, ECML/PKDD.

[14]  Boi Faltings,et al.  Non-Discriminatory Machine Learning through Convex Fairness Criteria , 2018, AAAI.

[15]  Carlos Eduardo Scheidegger,et al.  Certifying and Removing Disparate Impact , 2014, KDD.

[16]  Toon Calders,et al.  Data preprocessing techniques for classification without discrimination , 2011, Knowledge and Information Systems.

[17]  Max Kuhn,et al.  Feature Engineering and Selection , 2019 .

[18]  Nathan Srebro,et al.  Equality of Opportunity in Supervised Learning , 2016, NIPS.

[19]  Andrew Pickles,et al.  Missing Data, Problems and Solutions , 2003 .

[20]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[21]  D. Kehl,et al.  Algorithms in the Criminal Justice System: Assessing the Use of Risk Assessments in Sentencing , 2017 .

[22]  J. Graham,et al.  Missing data analysis: making it work in the real world. , 2009, Annual review of psychology.

[23]  Adam Tauman Kalai,et al.  Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[24]  Ting Wang,et al.  Why Amazon's Ratings Might Mislead You: The Story of Herding Effects , 2014, Big Data.

[25]  Carlos Castillo,et al.  Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries , 2019, Front. Big Data.

[26]  C. Y. Peng,et al.  Principled missing data methods for researchers , 2013, SpringerPlus.

[27]  Toon Calders,et al.  Three naive Bayes approaches for discrimination-free classification , 2010, Data Mining and Knowledge Discovery.

[28]  Per Winkel,et al.  When and how should multiple imputation be used for handling missing data in randomised clinical trials – a practical guide with flowcharts , 2017, BMC Medical Research Methodology.

[29]  Hyun Kang The prevention and handling of the missing data , 2013, Korean journal of anesthesiology.

[30]  Toniann Pitassi,et al.  Learning Fair Representations , 2013, ICML.

[31]  Helen Nissenbaum,et al.  Bias in computer systems , 1996, TOIS.

[32]  Ken P Kleinman,et al.  Much Ado About Nothing , 2007, The American statistician.

[33]  Julia Rubin,et al.  Fairness Definitions Explained , 2018, 2018 IEEE/ACM International Workshop on Software Fairness (FairWare).

[34]  Ben Green,et al.  Disparate Interactions: An Algorithm-in-the-Loop Analysis of Fairness in Risk Assessments , 2019, FAT.

[35]  Stef van Buuren,et al.  Flexible Imputation of Missing Data , 2012 .

[36]  Jun Sakuma,et al.  Fairness-aware Learning through Regularization Approach , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[37]  Toon Calders,et al.  Building Classifiers with Independency Constraints , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[38]  Lukasz A. Kurgan,et al.  Impact of imputation of missing values on classification error for discrete data , 2008, Pattern Recognit..

[39]  Timnit Gebru,et al.  Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification , 2018, FAT.

[40]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[41]  Reuben Binns,et al.  Fairness in Machine Learning: Lessons from Political Philosophy , 2017, FAT.

[42]  Toon Calders,et al.  Discrimination Aware Decision Tree Learning , 2010, 2010 IEEE International Conference on Data Mining.

[43]  José Hernández-Orallo,et al.  Fairness and Missing Values , 2019, ArXiv.

[44]  Alexandra Chouldechova,et al.  Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.

[45]  G. King,et al.  Analyzing Incomplete Political Science Data: An Alternative Algorithm for Multiple Imputation , 2001, American Political Science Review.