论文信息 - Evaluating Machine Learning Techniques for Detecting Offensive and Hate Speech in South African Tweets

Evaluating Machine Learning Techniques for Detecting Offensive and Hate Speech in South African Tweets

In recent times, South Africa has been witnessing insurgence of offensive and hate speech along racial and ethnic dispositions on Twitter. Popular among the South African languages used is English. Although, machine learning has been successfully used to detect offensive and hate speech in several English contexts, the distinctiveness of South African tweets and the similarities among offensive, hate and free speeches require domain-specific English corpus and techniques to detect the offensive and hate speech. Thus, we developed an English corpus from South African tweets and evaluated different machine learning techniques to detect offensive and hate speech. Character n-gram, word n-gram, negative sentiment, syntactic-based features and their hybrid were extracted and analyzed using hyper-parameter optimization, ensemble and multi-tier meta-learning models of support vector machine, logistic regression, random forest, gradient boosting algorithms. The results showed that optimized support vector machine with character n-gram performed best in detection of hate speech with true positive rate of 0.894, while optimized gradient boosting with word n-gram performed best in detection of hate speech with true positive rate of 0.867. However, their performances in detection of other threatening classes were poor. Multi-tier meta-learning models achieved the most consistent and balanced classification performance with true positive rates of 0.858 and 0.887 for hate speech and offensive speech, respectively as well as true positive rate of 0.646 for free speech and overall accuracy of 0.671. The error analysis showed that multi-tier meta-learning model could reduce the misclassification error rate of the optimized models by 34.26%.

Oluwafemi Oriola | Eduan Kotzé | E. Kotzé | O. Oriola

[1] Nazli Goharian,et al. Hate speech detection: Challenges and solutions , 2019, PloS one.

[2] S. M. García,et al. 2014: , 2020, A Party for Lazarus.

[3] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .

[4] Kevin Barraclough,et al. I and i , 2001, BMJ : British Medical Journal.

[5] Shervin Malmasi,et al. Challenges in discriminating profanity from hate speech , 2017, J. Exp. Theor. Artif. Intell..

[6] Nitesh V. Chawla,et al. SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[7] Aditya Gaydhani,et al. Detecting Hate Speech and Offensive Language on Twitter using Machine Learning: An N-gram and TFIDF based Approach , 2018, ArXiv.

[8] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[9] Sebastian Raschka,et al. MLxtend: Providing machine learning and data science utilities and extensions to Python's scientific computing stack , 2018, J. Open Source Softw..

[10] Burgert A. Senekal,et al. Employing sentiment analysis for gauging perceptions of minorities in multicultural societies: An analysis of Twitter feeds on the Afrikaner community of Orania in South Africa , 2018, The Journal for Transdisciplinary Research in Southern Africa.

[11] Zeerak Waseem,et al. Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter , 2016, NLP+CSS@EMNLP.

[12] Walter Daelemans,et al. Multilingual Cross-domain Perspectives on Online Hate Speech , 2018, ArXiv.

[13] Q. Mcnemar. Note on the sampling error of the difference between correlated proportions or percentages , 1947, Psychometrika.

[14] Alan F. Smeaton,et al. Classifying racist texts using a support vector machine , 2004, SIGIR '04.

[15] Animesh Mukherjee,et al. HateMonitors: Language Agnostic Abuse Detection in Social Media , 2019, FIRE.

[16] J. Ross Quinlan,et al. Induction of Decision Trees , 1986, Machine Learning.

[17] Kang Liu,et al. Book Review: Sentiment Analysis: Mining Opinions, Sentiments, and Emotions by Bing Liu , 2015, CL.

[18] Seetha Hari,et al. Learning From Imbalanced Data , 2019, Advances in Computer and Electrical Engineering.

[19] Ziqi Zhang,et al. Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter , 2018, Semantic Web.

[20] Dirk Hovy,et al. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter , 2016, NAACL.

[21] Marios D. Dikaiakos,et al. Mandola: Monitoring and Detecting Online Hate Speech , 2016, ERCIM News.

[22] Pascale Fung,et al. One-step and Two-step Classification for Abusive Language Detection on Twitter , 2017, ALW@ACL.