Pareto Efficient Fairness in Supervised Learning: From Extraction to Tracing

As algorithmic decision-making systems are becoming more pervasive, it is crucial to ensure such systems do not become mechanisms of unfair discrimination on the basis of gender, race, ethnicity, religion, etc. Moreover, due to the inherent trade-off between fairness measures and accuracy, it is desirable to learn fairness-enhanced models without significantly compromising the accuracy. In this paper, we propose Pareto efficient Fairness (PEF) as a suitable fairness notion for supervised learning, that can ensure the optimal trade-off between overall loss and other fairness criteria. The proposed PEF notion is definition-agnostic, meaning that any well-defined notion of fairness can be reduced to the PEF notion. To efficiently find a PEF classifier, we cast the fairness-enhanced classification as a bilevel optimization problem and propose a gradient-based method that can guarantee the solution belongs to the Pareto frontier with provable guarantees for convex and non-convex objectives. We also generalize the proposed algorithmic solution to extract and trace arbitrary solutions from the Pareto frontier for a given preference over accuracy and fairness measures. This approach is generic and can be generalized to any multicriteria optimization problem to trace points on the Pareto frontier curve, which is interesting by its own right. We empirically demonstrate the effectiveness of the PEF solution and the extracted Pareto frontier on real-world datasets compared to state-of-the-art methods.

[1]  Ameet Talwalkar,et al.  Model-Agnostic Characterization of Fairness Trade-offs , 2020, ICML 2020.

[2]  Catherine E. Tucker,et al.  Algorithmic Bias? An Empirical Study of Apparent Gender-Based Discrimination in the Display of STEM Career Ads , 2019, Manag. Sci..

[3]  Luca Oneto,et al.  Fairness in Machine Learning , 2020, INNSBDDL.

[4]  Nathan Srebro,et al.  Equality of Opportunity in Supervised Learning , 2016, NIPS.

[5]  Maya R. Gupta,et al.  Satisfying Real-world Goals with Dataset Constraints , 2016, NIPS.

[6]  Jun Sakuma,et al.  Fairness-aware Learning through Regularization Approach , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[7]  Krishna P. Gummadi,et al.  From Parity to Preference-based Notions of Fairness in Classification , 2017, NIPS.

[8]  Shai Ben-David,et al.  Empirical Risk Minimization under Fairness Constraints , 2018, NeurIPS.

[9]  Aaron Rieke,et al.  Help wanted: an examination of hiring algorithms, equity, and bias , 2018 .

[10]  Ellen H. Fukuda,et al.  A SURVEY ON MULTIOBJECTIVE DESCENT METHODS , 2014 .

[11]  Kaisa Miettinen,et al.  Nonlinear multiobjective optimization , 1998, International series in operations research and management science.

[12]  Corinna Cortes,et al.  Agnostic Learning with Multiple Objectives , 2020, NeurIPS.

[13]  Kush R. Varshney,et al.  Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing , 2020, ICML.

[14]  Carlos Eduardo Scheidegger,et al.  Certifying and Removing Disparate Impact , 2014, KDD.

[15]  Krishna P. Gummadi,et al.  Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification without Disparate Mistreatment , 2016, WWW.

[16]  James Zijun Wang,et al.  Skeleton matching with applications in severe weather detection , 2017, Appl. Soft Comput..

[17]  Jon M. Kleinberg,et al.  Inherent Trade-Offs in the Fair Determination of Risk Scores , 2016, ITCS.

[18]  Yiling Chen,et al.  A Short-term Intervention for Long-term Fairness in the Labor Market , 2017, WWW.

[19]  Aditya Krishna Menon,et al.  The cost of fairness in binary classification , 2018, FAT.

[20]  Toniann Pitassi,et al.  Fairness through awareness , 2011, ITCS '12.

[21]  Yiling Chen,et al.  Fair classification and social welfare , 2019, FAT*.

[22]  C. Hillermeier Nonlinear Multiobjective Optimization: A Generalized Homotopy Approach , 2001 .

[23]  T. Başar,et al.  Dynamic Noncooperative Game Theory , 1982 .

[24]  Ananth Balashankar,et al.  What is Fair? Exploring Pareto-Efficiency for Fairness Constrained Classifiers , 2019, ArXiv.

[25]  James Zijun Wang,et al.  2016 Ieee International Conference on Big Data (big Data) Shape Matching Using Skeleton Context for Automated Bow Echo Detection , 2022 .

[26]  Farzin Haddadpour,et al.  Efficient Fair Principal Component Analysis , 2019, ArXiv.

[27]  J. Dennis,et al.  A closer look at drawbacks of minimizing weighted sums of objectives for Pareto set generation in multicriteria optimization problems , 1997 .

[28]  Michael Dellnitz,et al.  Gradient-Based Multiobjective Optimization with Uncertainties , 2016, 1612.03815.

[29]  Krishna P. Gummadi,et al.  Fairness Constraints: Mechanisms for Fair Classification , 2015, AISTATS.

[30]  Alexandra Chouldechova,et al.  Does mitigating ML's disparate impact require disparate treatment? , 2017, ArXiv.

[31]  James Z. Wang,et al.  Targeted Data-driven Regularization for Out-of-Distribution Generalization , 2020, KDD.

[32]  C. Dwork,et al.  Group Fairness Under Composition , 2018 .

[33]  Paolo Frasconi,et al.  Bilevel Programming for Hyperparameter Optimization and Meta-Learning , 2018, ICML.

[34]  John Langford,et al.  A Reductions Approach to Fair Classification , 2018, ICML.

[35]  Guillermo Sapiro,et al.  Minimax Pareto Fairness: A Multi Objective Perspective , 2020, ICML.

[36]  Toniann Pitassi,et al.  Learning Fair Representations , 2013, ICML.

[37]  Qingfu Zhang,et al.  Pareto Multi-Task Learning , 2019, NeurIPS.

[38]  Sébastien Bubeck,et al.  Convex Optimization: Algorithms and Complexity , 2014, Found. Trends Mach. Learn..

[39]  Kimon Kieslich,et al.  Implications of AI (un-)fairness in higher education admissions: the effects of perceived AI (un-)fairness on exit, voice and organizational reputation , 2020, FAT*.

[40]  Sergey Levine,et al.  Meta-Learning with Implicit Gradients , 2019, NeurIPS.

[41]  Yurii Nesterov,et al.  Cubic regularization of Newton method and its global performance , 2006, Math. Program..

[42]  Aaron Roth,et al.  Average Individual Fairness: Algorithms, Generalization and Experiments , 2019, NeurIPS.

[43]  Michael I. Jordan,et al.  How to Escape Saddle Points Efficiently , 2017, ICML.

[44]  Emilia Gómez,et al.  Why Machine Learning May Lead to Unfairness: Evidence from Risk Assessment for Juvenile Justice in Catalonia , 2019, ICAIL.

[45]  Esther Rolf,et al.  Delayed Impact of Fair Machine Learning , 2018, ICML.

[46]  Toon Calders,et al.  Three naive Bayes approaches for discrimination-free classification , 2010, Data Mining and Knowledge Discovery.

[47]  Adam Tauman Kalai,et al.  The disparate equilibria of algorithmic decision making when individuals invest rationally , 2019, FAT*.

[48]  Alexandra Chouldechova,et al.  Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.

[49]  Vaibhav Rajan,et al.  Multi-Task Learning with User Preferences: Gradient Descent with Controlled Ascent in Pareto Optimization , 2020, ICML.

[50]  Nathan Srebro,et al.  Learning Non-Discriminatory Predictors , 2017, COLT.

[51]  Sampath Kannan,et al.  Downstream Effects of Affirmative Action , 2018, FAT.

[52]  Jon M. Kleinberg,et al.  On Fairness and Calibration , 2017, NIPS.

[53]  Toon Calders,et al.  Building Classifiers with Independency Constraints , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[54]  Suresh Venkatasubramanian,et al.  On the (im)possibility of fairness , 2016, ArXiv.

[55]  Akiko Takeda,et al.  Hyperparameter Learning via Bilevel Nonsmooth Optimization , 2018 .

[56]  Cynthia Dwork,et al.  Fairness Under Composition , 2018, ITCS.

[57]  Saeed Ghadimi,et al.  Approximation Methods for Bilevel Programming , 2018, 1802.02246.

[58]  Jean-Baptiste Tristan,et al.  Unlocking Fairness: a Trade-off Revisited , 2019, NeurIPS.

[59]  Aryan Mokhtari,et al.  On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms , 2019, AISTATS.

[60]  Mehrdad Mahdavi,et al.  Targeted Meta-Learning for Critical Incident Detection in Weather Data , 2019 .

[61]  John E. Dennis,et al.  Normal-Boundary Intersection: A New Method for Generating the Pareto Surface in Nonlinear Multicriteria Optimization Problems , 1998, SIAM J. Optim..

[62]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[63]  Mohit Singh,et al.  The Price of Fair PCA: One Extra Dimension , 2018, NeurIPS.

[64]  Avrim Blum,et al.  Recovering from Biased Data: Can Fairness Constraints Improve Accuracy? , 2019, FORC.

[65]  Matt J. Kusner,et al.  Counterfactual Fairness , 2017, NIPS.