Penalizing Unfairness in Binary Classification

We present a new approach for mitigating unfairness in learned classifiers. In particular, we focus on binary classification tasks over individuals from two populations, where, as our criterion for fairness, we wish to achieve similar false positive rates in both populations, and similar false negative rates in both populations. As a proof of concept, we implement our approach and empirically evaluate its ability to achieve both fairness and accuracy, using datasets from the fields of criminal risk assessment, credit, lending, and college admissions.

[1]  Jun Sakuma,et al.  Fairness-aware Learning through Regularization Approach , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[2]  Toniann Pitassi,et al.  Fairness through awareness , 2011, ITCS '12.

[3]  Toniann Pitassi,et al.  Learning Fair Representations , 2013, ICML.

[4]  Latanya Sweeney,et al.  Discrimination in online ad delivery , 2013, CACM.

[5]  Carlos Eduardo Scheidegger,et al.  Certifying and Removing Disparate Impact , 2014, KDD.

[6]  Justin M. Rao,et al.  Precinct or Prejudice? Understanding Racial Disparities in New York City's Stop-and-Frisk Policy , 2015 .

[7]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[8]  Stephen P. Boyd,et al.  Disciplined convex-concave programming , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[9]  K. Crawford Artificial Intelligence's White Guy Problem , 2016 .

[10]  Christopher T. Lowenkamp,et al.  False Positives, False Negatives, and False Analyses: A Rejoinder to "Machine Bias: There's Software Used across the Country to Predict Future Criminals. and It's Biased against Blacks" , 2016 .

[11]  Aaron Roth,et al.  Fairness in Learning: Classic and Contextual Bandits , 2016, NIPS.

[12]  Adam Tauman Kalai,et al.  Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[13]  Nathan Srebro,et al.  Equality of Opportunity in Supervised Learning , 2016, NIPS.

[14]  Justin M. Rao,et al.  Precinct or Prejudice? Understanding Racial Disparities in New York City's Stop-and-Frisk Policy , 2016 .

[15]  Alexandra Chouldechova,et al.  Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.

[16]  Krishna P. Gummadi,et al.  Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification without Disparate Mistreatment , 2016, WWW.

[17]  Nathan Srebro,et al.  Learning Non-Discriminatory Predictors , 2017, COLT.

[18]  Jon M. Kleinberg,et al.  Inherent Trade-Offs in the Fair Determination of Risk Scores , 2016, ITCS.

[19]  Jon M. Kleinberg,et al.  On Fairness and Calibration , 2017, NIPS.

[20]  M. Kearns,et al.  Fairness in Criminal Justice Risk Assessments: The State of the Art , 2017, Sociological Methods & Research.