论文信息 - FlipTest: fairness testing via optimal transport - 字舞流文

FlipTest: fairness testing via optimal transport

We present FlipTest, a black-box technique for uncovering discrimination in classifiers. FlipTest is motivated by the intuitive question: had an individual been of a different protected status, would the model have treated them differently? Rather than relying on causal information to answer this question, FlipTest leverages optimal transport to match individuals in different protected groups, creating similar pairs of in-distribution samples. We show how to use these instances to detect discrimination by constructing a flipset: the set of individuals whose classifier output changes post-translation, which corresponds to the set of people who may be harmed because of their group membership. To shed light on why the model treats a given subgroup differently, FlipTest produces a transparency report: a ranking of features that are most associated with the model's behavior on the flipset. Evaluating the approach on three case studies, we show that this provides a computationally inexpensive way to identify subgroups that may be harmed by model discrimination, including in cases where the model satisfies group fairness criteria.

Matt Fredrikson | Emily Black | Samuel Yeom | Matt Fredrikson | Samuel Yeom | Emily Black

[1] Harold W. Kuhn,et al. The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[2] Roxana Geambasu,et al. FairTest: Discovering Unwarranted Associations in Data-Driven Applications , 2015, 2017 IEEE European Symposium on Security and Privacy (EuroS&P).

[3] Matt Fredrikson,et al. Hunting for Discriminatory Proxies in Linear Regression Models , 2018, NeurIPS.

[4] Matt Fredrikson,et al. Proxy Discrimination∗ in Data-Driven Systems Theory and Experiments with Machine Learnt Programs , 2017 .

[5] Nathan Srebro,et al. Equality of Opportunity in Supervised Learning , 2016, NIPS.

[6] Carlos Eduardo Scheidegger,et al. Certifying and Removing Disparate Impact , 2014, KDD.

[7] Guy N. Rothblum,et al. Multicalibration: Calibration for the (Computationally-Identifiable) Masses , 2018, ICML.

[8] Diptikalyan Saha,et al. Automated Test Generation to Detect Individual Discrimination in AI Models , 2018, ArXiv.

[9] Yuxin Chen,et al. Implicit Regularization in Nonconvex Statistical Estimation: Gradient Descent Converges Linearly for Phase Retrieval and Matrix Completion , 2018, ICML.

[10] Matt Fredrikson,et al. Use Privacy in Data-Driven Systems: Theory and Experiments with Machine Learnt Programs , 2017, CCS.

[11] Nathan Srebro,et al. Learning Non-Discriminatory Predictors , 2017, COLT.

[12] Yuriy Brun,et al. Fairness testing: testing software for discrimination , 2017, ESEC/SIGSOFT FSE.

[13] Gabriel Peyré,et al. Computational Optimal Transport , 2018, Found. Trends Mach. Learn..

[14] Seth Neel,et al. An Empirical Study of Rich Subgroup Fairness for Machine Learning , 2018, FAT.

[15] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[16] Aaron Roth,et al. Fairness in Learning: Classic and Contextual Bandits , 2016, NIPS.

[17] Jason Altschuler,et al. Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration , 2017, NIPS.

[18] Josep Domingo-Ferrer,et al. A Methodology for Direct and Indirect Discrimination Prevention in Data Mining , 2013, IEEE Transactions on Knowledge and Data Engineering.

[19] Yang Liu,et al. Actionable Recourse in Linear Classification , 2018, FAT.

[20] Rachel K. E. Bellamy,et al. AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias , 2018, ArXiv.

[21] Alexandra Chouldechova,et al. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.

[22] Zhe Zhang,et al. Identifying Significant Predictive Bias in Classifiers , 2016, ArXiv.

[23] Toniann Pitassi,et al. Fairness through awareness , 2011, ITCS '12.

[24] Linda F. Wightman. LSAC National Longitudinal Bar Passage Study. LSAC Research Report Series. , 1998 .

[25] C. Villani. Optimal Transport: Old and New , 2008 .

[26] Seth Neel,et al. Preventing Fairness Gerrymandering: Auditing and Learning for Subgroup Fairness , 2017, ICML.

[27] Aaron C. Courville,et al. Adversarial Computation of Optimal Transport Maps , 2019, ArXiv.

[28] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.

[29] Yair Zick,et al. Algorithmic Transparency via Quantitative Input Influence: Theory and Experiments with Learning Systems , 2016, 2016 IEEE Symposium on Security and Privacy (SP).

[30] Christopher T. Lowenkamp,et al. False Positives, False Negatives, and False Analyses: A Rejoinder to "Machine Bias: There's Software Used across the Country to Predict Future Criminals. and It's Biased against Blacks" , 2016 .

[31] Jean-Michel Loubes,et al. Obtaining Fairness using Optimal Transport Theory , 2018, ICML.

[32] Nicolas Courty,et al. Mapping Estimation for Discrete Optimal Transport , 2016, NIPS.

[33] Toon Calders,et al. Data preprocessing techniques for classification without discrimination , 2011, Knowledge and Information Systems.

[34] John Salvatier,et al. Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.

[35] Andrew D. Selbst,et al. Big Data's Disparate Impact , 2016 .

[36] Caroline Uhler,et al. Scalable Unbalanced Optimal Transport using Generative Adversarial Networks , 2018, ICLR.

[37] Alexandra Chouldechova,et al. Does mitigating ML's impact disparity require treatment disparity? , 2017, NeurIPS.

[38] John Langford,et al. A Reductions Approach to Fair Classification , 2018, ICML.

[39] Cynthia Dwork,et al. Fairness Under Composition , 2018, ITCS.

[40] Nicolas Courty,et al. Large Scale Optimal Transport and Mapping Estimation , 2017, ICLR.

[41] Toniann Pitassi,et al. Learning Fair Representations , 2013, ICML.

[42] Matt J. Kusner,et al. Counterfactual Fairness , 2017, NIPS.

[43] Kent Quanrud,et al. Approximating optimal transport with linear programs , 2018, SOSA.

[44] Chris Russell,et al. Counterfactual Explanations Without Opening the Black Box: Automated Decisions and the GDPR , 2017, ArXiv.

[45] Krishna P. Gummadi,et al. Fairness Constraints: Mechanisms for Fair Classification , 2015, AISTATS.

[46] Jon M. Kleinberg,et al. Inherent Trade-Offs in the Fair Determination of Risk Scores , 2016, ITCS.