论文信息 - A Neural Network Framework for Fair Classifier

A Neural Network Framework for Fair Classifier

Machine learning models are extensively being used in decision making, especially for prediction tasks. These models could be biased or unfair towards a specific sensitive group either of a specific race, gender or age. Researchers have put efforts into characterizing a particular definition of fairness and enforcing them into the models. In this work, mainly we are concerned with the following three definitions, Disparate Impact, Demographic Parity and Equalized Odds. Researchers have shown that Equalized Odds cannot be satisfied in calibrated classifiers unless the classifier is perfect. Hence the primary challenge is to ensure a degree of fairness while guaranteeing as much accuracy as possible. Fairness constraints are complex and need not be convex. Incorporating them into a machine learning algorithm is a significant challenge. Hence, many researchers have tried to come up with a surrogate loss which is convex in order to build fair classifiers. Besides, certain papers try to build fair representations by preprocessing the data, irrespective of the classifier used. Such methods, not only require a lot of unrealistic assumptions but also require human engineered analytical solutions to build a machine learning model. We instead propose an automated solution which is generalizable over any fairness constraint. We use a neural network which is trained on batches and directly enforces the fairness constraint as the loss function without modifying it further. We have also experimented with other complex performance measures such as H-mean loss, Q-mean-loss, F-measure; without the need for any surrogate loss functions. Our experiments prove that the network achieves similar performance as state of the art. Thus, one can just plug-in appropriate loss function as per required fairness constraint and performance measure of the classifier and train a neural network to achieve that.

Sujit Gujar | P. Manisha

[1] Blake Lemoine,et al. Mitigating Unwanted Biases with Adversarial Learning , 2018, AIES.

[2] Toniann Pitassi,et al. Learning Fair Representations , 2013, ICML.

[3] A. Dawid. The Well-Calibrated Bayesian , 1982 .

[4] Toniann Pitassi,et al. Learning Adversarially Fair and Transferable Representations , 2018, ICML.

[5] Katrina Ligett,et al. Learning Fair Classifiers: A Regularization-Inspired Approach , 2017, ArXiv.

[6] Amos J. Storkey,et al. Censoring Representations with an Adversary , 2015, ICLR.

[7] Harikrishna Narasimhan,et al. Learning with Complex Loss Functions and Constraints , 2018, AISTATS.

[8] Zhe Zhao,et al. Data Decisions and Theoretical Implications when Adversarially Learning Fair Representations , 2017, ArXiv.

[9] Jun Sakuma,et al. Fairness-aware Learning through Regularization Approach , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[10] Carlos Eduardo Scheidegger,et al. Certifying and Removing Disparate Impact , 2014, KDD.

[11] Jon M. Kleinberg,et al. On Fairness and Calibration , 2017, NIPS.

[12] Krishna P. Gummadi,et al. Fairness Constraints: Mechanisms for Fair Classification , 2015, AISTATS.

[13] John Langford,et al. A Reductions Approach to Fair Classification , 2018, ICML.

[14] Toniann Pitassi,et al. Fairness through awareness , 2011, ITCS '12.

[15] M. Kearns,et al. Fairness in Criminal Justice Risk Assessments: The State of the Art , 2017, Sociological Methods & Research.

[16] Toon Calders,et al. Three naive Bayes approaches for discrimination-free classification , 2010, Data Mining and Knowledge Discovery.

[17] Married,et al. Classification with no discrimination by preferential sampling , 2010 .

[18] Max Welling,et al. The Variational Fair Autoencoder , 2015, ICLR.

[19] Andrew D. Selbst,et al. Big Data's Disparate Impact , 2016 .

[20] Toon Calders,et al. Classifying without discriminating , 2009, 2009 2nd International Conference on Computer, Control and Communication.

[21] Alexandra Chouldechova,et al. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.