Representation Learning with Statistical Independence to Mitigate Bias.

Presence of bias (in datasets or tasks) is inarguably one of the most critical challenges in machine learning applications that has alluded to pivotal debates in recent years. Such challenges range from spurious associations between variables in medical studies to the bias of race in gender or face recognition systems. Controlling for all types of biases in the dataset curation stage is cumbersome and sometimes impossible. The alternative is to use the available data and build models incorporating fair representation learning. In this paper, we propose such a model based on adversarial training with two competing objectives to learn features that have (1) maximum discriminative power with respect to the task and (2) minimal statistical mean dependence with the protected (bias) variable(s). Our approach does so by incorporating a new adversarial loss function that encourages a vanished correlation between the bias and the learned features. We apply our method to synthetic data, medical images (containing task bias), and a dataset for gender classification (containing dataset bias). Our results show that the learned features by our method not only result in superior prediction performance but also are unbiased. The code is available at https://github.com/QingyuZhao/BR-Net/.

[1]  Yoav Goldberg,et al.  Adversarial Removal of Demographic Attributes from Text Data , 2018, EMNLP.

[2]  Jieyu Zhao,et al.  Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[3]  Dan Suciu,et al.  Interventional Fairness: Causal Database Repair for Algorithmic Fairness , 2019, SIGMOD Conference.

[4]  Vasant Honavar,et al.  Algorithmic Bias in Recidivism Prediction: A Causal Perspective , 2019, AAAI.

[5]  Nisheeth K. Vishnoi,et al.  How to be Fair and Diverse? , 2016, ArXiv.

[6]  Fei-Fei Li,et al.  Towards fairer datasets: filtering and balancing the distribution of the people subtree in the ImageNet hierarchy , 2019, FAT*.

[7]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[8]  Amos J. Storkey,et al.  Censoring Representations with an Adversary , 2015, ICLR.

[9]  Trevor Darrell,et al.  Adversarial Discriminative Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[11]  Dumitru Erhan,et al.  Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Chen Gao,et al.  Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition , 2019, NeurIPS.

[13]  Michael G. Strintzis,et al.  Face Recognition , 2008, Encyclopedia of Multimedia.

[14]  Janaina Mourão Miranda,et al.  Predictive modelling using neuroimaging data in the presence of confounds , 2017, NeuroImage.

[15]  Marc'Aurelio Ranzato,et al.  Unsupervised Learning of Invariant Feature Hierarchies with Applications to Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Alexei A. Efros,et al.  Undoing the Damage of Dataset Bias , 2012, ECCV.

[17]  Alvisa Palese,et al.  Age discrimination in healthcare institutions perceived by seniors and students , 2019, Nursing ethics.

[18]  Philippe Weinzaepfel,et al.  Mimetics: Towards Understanding Human Actions Out of Context , 2019, International Journal of Computer Vision.

[19]  Stefano Ermon,et al.  Learning Controllable Fair Representations , 2018, AISTATS.

[20]  Jeffrey M. Wooldridge,et al.  Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data , 2003 .

[21]  Kilian M. Pohl,et al.  End-To-End Alzheimer's Disease Diagnosis and Biomarker Identification , 2018, MLMI@MICCAI.

[22]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[23]  Graham Neubig,et al.  Controllable Invariance through Adversarial Feature Learning , 2017, NIPS.

[24]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Kilian M. Pohl,et al.  Chained regularization for identifying brain patterns specific to HIV infection , 2018, NeuroImage.

[26]  Timnit Gebru,et al.  Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification , 2018, FAT.

[27]  Rob Brekelmans,et al.  Invariant Representations without Adversarial Training , 2018, NeurIPS.

[28]  Yu Cheng,et al.  Fully-Adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Krishna P. Gummadi,et al.  Fairness Constraints: Mechanisms for Fair Classification , 2015, AISTATS.

[30]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[31]  Julie M. Smith Algorithms and Bias , 2021 .

[32]  Percy Liang,et al.  Fairness Without Demographics in Repeated Loss Minimization , 2018, ICML.

[33]  Esther Rolf,et al.  Delayed Impact of Fair Machine Learning , 2018, ICML.

[34]  Katrina Ligett,et al.  Learning Fair Classifiers: A Regularization-Inspired Approach , 2017, ArXiv.

[35]  Alexander Yates,et al.  Biased Representation Learning for Domain Adaptation , 2012, EMNLP.

[36]  Vishnu Naresh Boddeti,et al.  On the Global Optima of Kernelized Adversarial Representation Learning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[37]  Galen Reeves,et al.  Adversarially Learned Representations for Information Obfuscation and Inference , 2019, ICML.

[38]  D. A. Kenny,et al.  The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. , 1986, Journal of personality and social psychology.

[39]  Dinggang Shen,et al.  3D Deep Learning for Multi-modal Imaging-Guided Survival Time Prediction of Brain Tumor Patients , 2016, MICCAI.

[40]  Toniann Pitassi,et al.  Learning Adversarially Fair and Transferable Representations , 2018, ICML.

[41]  Jost Tobias Springenberg,et al.  Unsupervised and Semi-supervised Learning with Categorical Generative Adversarial Networks , 2015, ICLR.

[42]  Yutaka Matsuo,et al.  Adversarial Invariant Feature Learning with Accuracy Constraint for Domain Generalization , 2019, ECML/PKDD.

[43]  Ehsan Adeli,et al.  Training confounder-free deep learning models for medical applications , 2020, Nature Communications.

[44]  Luca Oneto,et al.  Fairness in Machine Learning , 2020, INNSBDDL.

[45]  François Laviolette,et al.  Domain-Adversarial Training of Neural Networks , 2015, J. Mach. Learn. Res..

[46]  Margrit Eichler,et al.  Gender bias in medical research , 1992 .

[47]  Zhe Zhao,et al.  Data Decisions and Theoretical Implications when Adversarially Learning Fair Representations , 2017, ArXiv.

[48]  Toniann Pitassi,et al.  Flexibly Fair Representation Learning by Disentanglement , 2019, ICML.

[49]  Maria L. Rizzo,et al.  Measuring and testing dependence by correlation of distances , 2007, 0803.4101.

[50]  Junmo Kim,et al.  Learning Not to Learn: Training Deep Neural Networks With Biased Data , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Michael Wooldridge,et al.  Econometric Analysis of Cross Section and Panel Data, 2nd Edition , 2001 .

[52]  A. Baghestani,et al.  How to control confounding effects by statistical analysis , 2012, Gastroenterology and hepatology from bed to bench.

[53]  Andrew Zisserman,et al.  Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[54]  Barbara Caputo,et al.  A Deeper Look at Dataset Bias , 2015, Domain Adaptation in Computer Vision Applications.

[55]  Taesung Park,et al.  CyCADA: Cycle-Consistent Adversarial Domain Adaptation , 2017, ICML.

[56]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[57]  Vishnu Naresh Boddeti,et al.  Mitigating Information Leakage in Image Representations: A Maximum Entropy Approach , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[58]  Toniann Pitassi,et al.  Learning Fair Representations , 2013, ICML.

[59]  Blake Lemoine,et al.  Mitigating Unwanted Biases with Adversarial Learning , 2018, AIES.

[60]  David J. Sharp,et al.  Increased brain-predicted aging in treated HIV disease , 2017, Neurology.

[61]  Yi Li,et al.  REPAIR: Removing Representation Bias by Dataset Resampling , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).