论文信息 - Investigating Sampling Bias in Abusive Language Detection

Investigating Sampling Bias in Abusive Language Detection

Abusive language detection is becoming increasingly important, but we still understand little about the biases in our datasets for abusive language detection, and how these biases affect the quality of abusive language detection. In the work reported here, we reproduce the investigation of Wiegand et al. (2019) to determine differences between different sampling strategies. They compared boosted random sampling, where abusive posts are upsampled, and biased topic sampling, which focuses on topics that are known to cause abusive language. Instead of comparing individual datasets created using these sampling strategies, we use the sampling strategies on a single, large dataset, thus eliminating the textual source of the dataset as a potential confounding factor. We show that differences in the textual source can have more effect than the chosen sampling strategy.

Sandra Kübler | Dante Razo | Sandra Kübler | Dante Razo

[1] Lucas Dixon,et al. Ex Machina: Personal Attacks Seen at Scale , 2016, WWW.

[2] Michael Wiegand,et al. Inducing a Lexicon of Abusive Words – a Feature-Based Approach , 2018, NAACL.

[3] Pascale Fung,et al. Reducing Gender Bias in Abusive Language Detection , 2018, EMNLP.

[4] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[5] Vasudeva Varma,et al. Stereotypical Bias Removal for Hate Speech Detection Task using Knowledge-based Generalizations , 2019, WWW.

[6] Dirk Hovy,et al. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter , 2016, NAACL.

[7] Yejin Choi,et al. The Risk of Racial Bias in Hate Speech Detection , 2019, ACL.

[8] Haohan Wang,et al. Unlearn Dataset Bias in Natural Language Inference by Fitting the Residual , 2019, EMNLP.

[9] Michael Wiegand,et al. Detection of Abusive Language: the Problem of Biased Datasets , 2019, NAACL.

[10] Julia Hirschberg,et al. Detecting Hate Speech on the World Wide Web , 2012 .

[11] Tommaso Caselli,et al. Lower Bias, Higher Density Abusive Language Datasets: A Recipe , 2020, RESTUP.

[12] Ingmar Weber,et al. Racial Bias in Hate Speech and Abusive Language Detection Datasets , 2019, Proceedings of the Third Workshop on Abusive Language Online.

[13] Ritesh Kumar,et al. Benchmarking Aggression Identification in Social Media , 2018, TRAC@COLING 2018.