论文信息 - Evaluating Bias In Dutch Word Embeddings

Evaluating Bias In Dutch Word Embeddings

Recent research in Natural Language Processing has revealed that word embeddings can encode social biases present in the training data which can affect minorities in real world applications. This paper explores the gender bias implicit in Dutch embeddings while investigating whether English language based approaches can also be used in Dutch. We implement the Word Embeddings Association Test (WEAT), Clustering and Sentence Embeddings Association Test (SEAT) methods to quantify the gender bias in Dutch word embeddings, then we proceed to reduce the bias with Hard-Debias and Sent-Debias mitigation methods and finally we evaluate the performance of the debiased embeddings in downstream tasks. The results suggest that, among others, gender bias is present in traditional and contextualized Dutch word embeddings. We highlight how techniques used to measure and reduce bias created for English can be used in Dutch embeddings by adequately translating the data and taking into account the unique characteristics of the language. Furthermore, we analyze the effect of the debiasing techniques on downstream tasks which show a negligible impact on traditional embeddings and a 2% decrease in performance in contextualized embeddings. Finally, we release the translated Dutch datasets to the public along with the traditional embeddings with mitigated bias.

Gerasimos Spanakis | Rodrigo Alejandro Ch'avez Mulsa | Gerasimos Spanakis | Rodrigo Alejandro Chávez Mulsa

[1] R'emi Louf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[2] Arvind Narayanan,et al. Semantics derived automatically from language corpora contain human-like biases , 2016, Science.

[3] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[4] Adam Tauman Kalai,et al. Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[5] Chandler May,et al. On Measuring Social Biases in Sentence Encoders , 2019, NAACL.

[6] Petr Sojka,et al. Software Framework for Topic Modelling with Large Corpora , 2010 .

[7] Vicente Ordonez,et al. Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation , 2020, ACL.

[8] Pieter Delobelle,et al. RobBERT: a Dutch RoBERTa-based Language Model , 2020, EMNLP.

[9] Daniel Jurafsky,et al. Word embeddings quantify 100 years of gender and ethnic stereotypes , 2017, Proceedings of the National Academy of Sciences.

[10] Walter Daelemans,et al. Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource , 2016, LREC.

[11] Roland Schäfer,et al. Building Large Corpora from the Web Using a New Efficient Tool Chain , 2012, LREC.

[12] Bambang Parmanto,et al. Integrating Transformer and Paraphrase Rules for Sentence Simplification , 2018, EMNLP.

[13] Yoav Goldberg,et al. Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them , 2019, NAACL-HLT.

[14] Timnit Gebru,et al. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification , 2018, FAT.

[15] Mai ElSherief,et al. Mitigating Gender Bias in Natural Language Processing: Literature Review , 2019, ACL.

[16] Noe Casas,et al. Evaluating the Underlying Gender Bias in Contextualized Word Embeddings , 2019, Proceedings of the First Workshop on Gender Bias in Natural Language Processing.

[17] Ryan Cotterell,et al. Examining Gender Bias in Languages with Grammatical Gender , 2019, EMNLP.

[18] Murhaf Fares,et al. Word vectors, reuse, and replicability: Towards a community repository of large-text resources , 2017, NODALIDA.

[19] Michael Carl Tschantz,et al. Automated Experiments on Ad Privacy Settings , 2014, Proc. Priv. Enhancing Technol..

[20] Suzan Verberne,et al. The merits of Universal Language Model Fine-tuning for Small Datasets - a case with Dutch book reviews , 2019, ArXiv.

[21] Heng Tao Shen,et al. Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[22] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[23] Tommaso Caselli,et al. BERTje: A Dutch BERT Model , 2019, ArXiv.

[24] A. Greenwald,et al. Measuring individual differences in implicit cognition: the implicit association test. , 1998, Journal of personality and social psychology.

[25] Ryan Cotterell,et al. Gender Bias in Contextualized Word Embeddings , 2019, NAACL.

[26] Emily Denton,et al. Social Biases in NLP Models as Barriers for Persons with Disabilities , 2020, ACL.

[27] Prakhar Gupta,et al. Learning Word Vectors for 157 Languages , 2018, LREC.

[28] Ruslan Salakhutdinov,et al. Towards Debiasing Sentence Representations , 2020, ACL.

[29] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.