Detection of Hate or Offensive Phrase using Magnified Tf-Idf

The non-negotiable challenge that social media platforms are facing nowadays is the abundant presence of hate speeches in text messages. Thus, automatic hate speech detection becomes an important ethical concern and research should be carried out to overcome this challenge. In the present paper, we propose a tf-idf based binary classification framework that manipulates the scores obtained as the differences between hate and offensive (HOF) words and non-HOF (NOT) words. Employing this framework, we have achieved a Macro F1 score of 0.6813 and 0.6762 for the English and Hindi test datasets, respectively provided in subtask-1A of the HASOC 2021[13] shared task.

[1]  Gautam Kishore Shahi,et al.  Overview of the HASOC Subtrack at FIRE 2022: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages , 2022, FIRE.

[2]  Prasenjit Majumder,et al.  Overview of the HASOC Subtrack at FIRE 2021: HateSpeech and Offensive Content Identification in English and Indo-Aryan Languages , 2021, FIRE.

[3]  Prasenjit Majumder,et al.  Overview of the HASOC track at FIRE 2020: Hate Speech and Offensive Content Identification in Indo-European Languages , 2021, FIRE.

[4]  Munmun De Choudhury,et al.  Prevalence and Psychological Effects of Hateful Speech in Online College Communities , 2019, WebSci.

[5]  Animesh Mukherjee,et al.  Spread of Hate Speech in Online Social Media , 2018, WebSci.

[6]  David Robinson,et al.  Detecting Hate Speech on Twitter Using a Convolution-GRU Based Deep Neural Network , 2018, ESWC.

[7]  Derek Ruths,et al.  A Web of Hate: Tackling Hateful Speech in Online Social Spaces , 2017, ArXiv.

[8]  Fabrício Benevenuto,et al.  A Measurement Study of Hate Speech in Social Media , 2017, HT.

[9]  Pascale Fung,et al.  One-step and Two-step Classification for Abusive Language Detection on Twitter , 2017, ALW@ACL.

[10]  Vasudeva Varma,et al.  Deep Learning for Hate Speech Detection in Tweets , 2017, WWW.

[11]  Joel R. Tetreault,et al.  Abusive Language Detection in Online User Content , 2016, WWW.

[12]  Jing Zhou,et al.  Hate Speech Detection with Comment Embeddings , 2015, WWW.

[13]  Elizabeth F. Churchill,et al.  Automatic identification of personal insults on social news sites , 2012, J. Assoc. Inf. Sci. Technol..