FedFair: Training Fair Models In Cross-Silo Federated Learning

Building fair machine learning models becomes more and more important. As many powerful models are built by collaboration among multiple parties, each holding some sensitive data, it is natural to explore the feasibility of training fair models in cross-silo federated learning so that fairness, privacy and collaboration can be fully respected simultaneously. However, it is a very challenging task, since it is far from trivial to accurately estimate the fairness of a model without knowing the private data of the participating parties. In this paper, we first propose a federated estimation method to accurately estimate the fairness of a model without infringing the data privacy of any party. Then, we use the fairness estimation to formulate a novel problem of training fair models in cross-silo federated learning. We develop FedFair, a well-designed federated learning framework, which can successfully train a fair model with high performance without any data privacy infringement. Our extensive experiments on three real-world data sets demonstrate the excellent fair model training performance of our method.

[1]  Richard Nock,et al.  Advances and Open Problems in Federated Learning , 2021, Found. Trends Mach. Learn..

[2]  Sreenivas Gollapudi,et al.  Profit Sharing and Efficiency in Utility Games , 2017, ESA.

[3]  Nathan Srebro,et al.  Learning Non-Discriminatory Predictors , 2017, COLT.

[4]  A Unified Single-loop Alternating Gradient Projection Algorithm for Nonconvex-Concave and Convex-Nonconcave Minimax Problems , 2020, ArXiv.

[5]  Jian Pei,et al.  Personalized Cross-Silo Federated Learning on Non-IID Data , 2020, AAAI.

[6]  Shai Ben-David,et al.  Empirical Risk Minimization under Fairness Constraints , 2018, NeurIPS.

[7]  Kristina Lerman,et al.  A Survey on Bias and Fairness in Machine Learning , 2019, ACM Comput. Surv..

[8]  Anit Kumar Sahu,et al.  Federated Optimization in Heterogeneous Networks , 2018, MLSys.

[9]  Tian Li,et al.  Fair Resource Allocation in Federated Learning , 2019, ICLR.

[10]  Jiong Jin,et al.  Towards Fair and Privacy-Preserving Federated Deep Models , 2019, IEEE Transactions on Parallel and Distributed Systems.

[11]  Erez Shmueli,et al.  Algorithmic Fairness , 2020, ArXiv.

[12]  Avi Feller,et al.  Algorithmic Decision Making and the Cost of Fairness , 2017, KDD.

[13]  Jun Sakuma,et al.  Fairness-Aware Classifier with Prejudice Remover Regularizer , 2012, ECML/PKDD.

[14]  Aaron Roth,et al.  Fairness in Learning: Classic and Contextual Bandits , 2016, NIPS.

[15]  Maya R. Gupta,et al.  Optimization with Non-Differentiable Constraints with Applications to Fairness, Recall, Churn, and Other Goals , 2018, J. Mach. Learn. Res..

[16]  Toon Calders,et al.  Data preprocessing techniques for classification without discrimination , 2011, Knowledge and Information Systems.

[17]  Jon M. Kleinberg,et al.  On Fairness and Calibration , 2017, NIPS.

[18]  Toon Calders,et al.  Building Classifiers with Independency Constraints , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[19]  Kush R. Varshney,et al.  Optimized Pre-Processing for Discrimination Prevention , 2017, NIPS.

[20]  Yasaman Khazaeni,et al.  Federated Learning with Matched Averaging , 2020, ICLR.

[21]  Ron Kohavi,et al.  Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid , 1996, KDD.

[22]  Percy Liang,et al.  Fairness Without Demographics in Repeated Loss Minimization , 2018, ICML.

[23]  Hanghang Tong,et al.  Fairness-aware Agnostic Federated Learning , 2020, SDM.

[24]  Mehryar Mohri,et al.  Agnostic Federated Learning , 2019, ICML.

[25]  Tianjian Chen,et al.  Federated Machine Learning: Concept and Applications , 2019 .

[26]  Lili Su,et al.  Distributed Statistical Machine Learning in Adversarial Settings: Byzantine Gradient Descent , 2019, PERV.

[27]  Meisam Razaviyayn,et al.  Rényi Fair Inference , 2019, ICLR.

[28]  Toniann Pitassi,et al.  Fairness through awareness , 2011, ITCS '12.

[29]  John Langford,et al.  A Reductions Approach to Fair Classification , 2018, ICML.

[30]  Carlos Eduardo Scheidegger,et al.  Certifying and Removing Disparate Impact , 2014, KDD.

[31]  Mun Choon Chan,et al.  Collaborative Machine Learning with Incentive-Aware Model Rewards , 2020, ICML.

[32]  Franco Turini,et al.  k-NN as an implementation of situation testing for discrimination discovery and prevention , 2011, KDD.

[33]  Toniann Pitassi,et al.  Learning Fair Representations , 2013, ICML.

[34]  Toon Calders,et al.  Three naive Bayes approaches for discrimination-free classification , 2010, Data Mining and Knowledge Discovery.

[35]  Max Welling,et al.  The Variational Fair Autoencoder , 2015, ICLR.

[36]  Nathan Srebro,et al.  Equality of Opportunity in Supervised Learning , 2016, NIPS.

[37]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[38]  Tianjian Chen,et al.  A Fairness-aware Incentive Scheme for Federated Learning , 2020, AIES.

[39]  Shaojie Tang,et al.  On Designing Data Quality-Aware Truth Estimation and Surplus Sharing Method for Mobile Crowdsensing , 2017, IEEE Journal on Selected Areas in Communications.

[40]  Aditya Krishna Menon,et al.  The cost of fairness in binary classification , 2018, FAT.

[41]  Lu Zhang,et al.  On Convexity and Bounds of Fairness-aware Classification , 2019, WWW.

[42]  Novi Quadrianto,et al.  Recycling Privileged Learning and Distribution Matching for Fairness , 2017, NIPS.

[43]  Krishna P. Gummadi,et al.  Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification without Disparate Mistreatment , 2016, WWW.

[44]  Oliver Thomas,et al.  Discovering Fair Representations in the Data Domain , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Adam Tauman Kalai,et al.  Decoupled Classifiers for Group-Fair and Efficient Machine Learning , 2017, FAT.

[46]  A. Gorban,et al.  The Five Factor Model of personality and evaluation of drug consumption risk , 2015, 1506.06297.

[47]  Miroslav Dudík,et al.  Fair Regression: Quantitative Definitions and Reduction-based Algorithms , 2019, ICML.

[48]  Toon Calders,et al.  Discrimination Aware Decision Tree Learning , 2010, 2010 IEEE International Conference on Data Mining.