Projection-wise Disentangling for Fair and Interpretable Representation Learning: Application to 3D Facial Shape Analysis

Confounding bias is a crucial problem when applying machine learning to practice, especially in clinical practice. We consider the problem of learning representations independent to multiple biases. In literature, this is mostly solved by purging the bias information from learned representations. We however expect this strategy to harm the diversity of information in the representation, and thus limiting its prospective usage (e.g., interpretation). Therefore, we propose to mitigate the bias while keeping almost all information in the latent representations, which enables us to observe and interpret them as well. To achieve this, we project latent features onto a learned vector direction, and enforce the independence between biases and projected features rather than all learned features. To interpret the mapping between projected features and input data, we propose projection-wise disentangling: a sampling and reconstruction along the learned vector direction. The proposed method was evaluated on the analysis of 3D facial shape and patient characteristics (N=5011). Experiments showed that this conceptually simple method achieved state-of-the-art fair prediction performance and interpretability, showing its great potential for clinical applications.

[1]  A. Baghestani,et al.  How to control confounding effects by statistical analysis , 2012, Gastroenterology and hepatology from bed to bench.

[2]  Qiuping Xu Canonical correlation Analysis , 2014 .

[3]  Barbara Caputo,et al.  A Deeper Look at Dataset Bias , 2015, Domain Adaptation in Computer Vision Applications.

[4]  Barbara Caputo,et al.  A Deeper Look at Dataset Bias , 2015, Domain Adaptation in Computer Vision Applications.

[5]  Graham Neubig,et al.  Controllable Invariance through Adversarial Feature Learning , 2017, NIPS.

[6]  Gisbert Schneider,et al.  Drug discovery with explainable artificial intelligence , 2020, Nature Machine Intelligence.

[7]  Stefanos Zafeiriou,et al.  Large Scale 3D Morphable Models , 2017, International Journal of Computer Vision.

[8]  Juan Carlos Niebles,et al.  Representation Learning with Statistical Independence to Mitigate Bias. , 2019 .

[9]  Max Welling,et al.  The Variational Fair Autoencoder , 2015, ICLR.

[10]  Kilian M. Pohl,et al.  Chained regularization for identifying brain patterns specific to HIV infection , 2018, NeuroImage.

[11]  Alexander A. Alemi,et al.  Deep Variational Information Bottleneck , 2017, ICLR.

[12]  Stefanos Zafeiriou,et al.  SpiralNet++: A Fast and Highly Efficient Mesh Convolution Operator , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[13]  Toniann Pitassi,et al.  Flexibly Fair Representation Learning by Disentanglement , 2019, ICML.

[14]  Jakub M. Tomczak,et al.  Hierarchical VampPrior Variational Fair Auto-Encoder , 2018, ArXiv.

[15]  Alfredo Vellido,et al.  The importance of interpretability and visualization in machine learning for applications in medicine and health care , 2019, Neural Computing and Applications.

[16]  Kilian M. Pohl,et al.  Variational AutoEncoder For Regression: Application to Brain Aging Analysis , 2019, MICCAI.

[17]  Peter Claes,et al.  Association Between Prenatal Alcohol Exposure and Craniofacial Shape of Children at 12 Months of Age , 2017, JAMA pediatrics.

[18]  H. Hecht,et al.  Cross-ethnic assessment of body weight and height on the basis of faces , 2013 .

[19]  Ehsan Adeli,et al.  Training confounder-free deep learning models for medical applications , 2020, Nature Communications.

[20]  Amos J. Storkey,et al.  Censoring Representations with an Adversary , 2015, ICLR.

[21]  Pietro Perona,et al.  Towards causal benchmarking of bias in face analysis algorithms , 2020, ECCV.

[22]  Albert Hofman,et al.  The Generation R Study: Design and cohort profile , 2006, European Journal of Epidemiology.

[23]  Yoshua Bengio,et al.  Mutual Information Neural Estimation , 2018, ICML.

[24]  Bolei Zhou,et al.  Interpreting Deep Visual Representations via Network Dissection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Blake Lemoine,et al.  Mitigating Unwanted Biases with Adversarial Learning , 2018, AIES.