HateMonitors: Language Agnostic Abuse Detection in Social Media

Reducing hateful and offensive content in online social media pose a dual problem for the moderators. On the one hand, rigid censorship on social media cannot be imposed. On the other, the free flow of such content cannot be allowed. Hence, we require efficient abusive language detection system to detect such harmful content in social media. In this paper, we present our machine learning model, HateMonitor, developed for Hate Speech and Offensive Content Identification in Indo-European Languages (HASOC), a shared task at FIRE 2019. We have used a Gradient Boosting model, along with BERT and LASER embeddings, to make the system language agnostic. Our model came at First position for the German sub-task A. We have also made our model public at this https URL .

[1]  Paolo Rosso,et al.  SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter , 2019, *SEMEVAL.

[2]  Björn Gambäck,et al.  The Effects of User Features on Twitter Hate Speech Detection , 2018, ALW.

[3]  Dominik Stammbach Offensive Language Detection with Neural Networks for Germeval Task 2018 , 2018 .

[4]  Adrian Wójcik,et al.  Harmful Ideas, The Structure and Consequences of Anti-Semitic Beliefs in Poland , 2013 .

[5]  Taha Yasseri,et al.  Detecting weak and strong Islamophobic hate speech on social media , 2018, Journal of Information Technology & Politics.

[6]  Holger Schwenk,et al.  Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond , 2018, Transactions of the Association for Computational Linguistics.

[7]  Björn Ross,et al.  Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis , 2016, ArXiv.

[8]  Tom Pyszczynski,et al.  The effect of an overheard ethnic slur on evaluations of the target: How to spread a social disease. , 1985 .

[9]  Karmen Erjavec,et al.  “You Don't Understand, This is a New War!” Analysis of Hate Speech in News Web Sites' Comments , 2012 .

[10]  Imran Awan,et al.  Islamophobia on Social Media: A Qualitative Analysis of the Facebook's Walls of Hate , 2016 .

[11]  Shivendra K. Verma Code-Switching: Hindi-English , 1975 .

[12]  Ingmar Weber,et al.  Understanding Abuse: A Typology of Abusive Language Detection Subtasks , 2017, ALW@ACL.

[13]  Prasenjit Majumder,et al.  Overview of the HASOC track at FIRE 2019: Hate Speech and Offensive Content Identification in Indo-European Languages , 2019, FIRE.

[14]  Ingmar Weber,et al.  Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[15]  Michael Wiegand,et al.  Overview of the GermEval 2018 Shared Task on the Identification of Offensive Language , 2018 .

[16]  Fabrício Benevenuto,et al.  Analyzing the Targets of Hate in Online Social Media , 2016, ICWSM.

[17]  Animesh Mukherjee,et al.  Hateminers : Detecting Hate speech against Women , 2018, ArXiv.

[18]  Matthew Leighton Williams,et al.  The Enemy Among Us: Detecting Hate Speech with Threats Based 'Othering' Language Embeddings , 2018 .

[19]  Paolo Rosso,et al.  Overview of the Evalita 2018 Task on Automatic Misogyny Identification (AMI) , 2018, EVALITA@CLiC-it.

[20]  Joel R. Tetreault,et al.  Abusive Language Detection in Online User Content , 2016, WWW.

[21]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[22]  Mai ElSherief,et al.  Hierarchical CVAE for Fine-Grained Hate Speech Classification , 2018, EMNLP.

[23]  Shaun W. Lawson,et al.  Troubling Vulnerability: Designing with LGBT Young People's Ambivalence Towards Hate Crime Reporting , 2018, CHI.

[24]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[25]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[26]  Felice Dell'Orletta,et al.  Overview of the EVALITA 2018 Hate Speech Detection Task , 2018, EVALITA@CLiC-it.

[27]  Brian Mullen,et al.  Ethnophaulisms and Exclusion: The Behavioral Consequences of Cognitive Representation of Ethnic Immigrant Groups , 2003, Personality & social psychology bulletin.

[28]  Cristina Bosco,et al.  An Impossible Dialogue! Nominal Utterances and Populist Rhetoric in an Italian Twitter Corpus of Hate Speech against Immigrants , 2018, LREC.

[29]  Animesh Mukherjee,et al.  Spread of Hate Speech in Online Social Media , 2018, WebSci.

[30]  Savvas Zannettou,et al.  A Quantitative Approach to Understanding Online Antisemitism , 2018, ICWSM.

[31]  Jamie Bartlett,et al.  Misogyny on Twitter , 2014 .

[32]  Sérgio Nunes,et al.  A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..

[33]  Vasu Reddy,et al.  Perverts and sodomites: homophobia as hate speech in Africa , 2002 .