The Thin Line Between Hate and Profanity

Hate speech can be defined as a language used to demean people within a specific group. Hate speech often contains explicitly profane words, however, the presence of these words does not always mean that the text instance is hateful. In some cases, text instances with profane words are just offensive language and they do not target any specific group, and so cannot be classified as hate speech. In this work, we build on existing studies to find a better demarcation between hate speech and offensive language. Our main contribution is to introduce the use of typed dependency as new features in our feature set. This new feature enables us to consider the relationship between long distance words in a text instance, thereby provides more identifying information than single word-based features. We evaluate our approach using a dataset with the classes: hate, offensive and neither. Comparing our work with existing studies, our feature set is much smaller but we achieve better accuracy and show comparable results in further analysis. Our detailed analysis also showed instances missed by the lexical features that were correctly predicted by the proposed feature set.

[1]  Yongjun Sung,et al.  Brand followers' retweeting behavior on Twitter: How brand relationships influence brand electronic word-of-mouth , 2014, Comput. Hum. Behav..

[2]  Zeerak Waseem,et al.  Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter , 2016, NLP+CSS@EMNLP.

[3]  Joel R. Tetreault,et al.  Do Characters Abuse More Than Words? , 2016, SIGDIAL Conference.

[4]  Aditya Gaydhani,et al.  Detecting Hate Speech and Offensive Language on Twitter using Machine Learning: An N-gram and TFIDF based Approach , 2018, ArXiv.

[5]  Shervin Malmasi,et al.  Challenges in discriminating profanity from hate speech , 2017, J. Exp. Theor. Artif. Intell..

[6]  Amir Zeldes,et al.  A Deeper Look into Dependency-Based Word Embeddings , 2018, NAACL-HLT.

[7]  S. Stephens-Davidowitz,et al.  The Cost of Racial Animus on a Black Presidential Candidate: Using Google Search Data to Find What Surveys Miss , 2013 .

[8]  Ying Chen,et al.  Detecting Offensive Language in Social Media to Protect Adolescent Online Safety , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[9]  Shervin Malmasi,et al.  Measuring Feature Diversity in Native Language Identification , 2015, BEA@NAACL-HLT.

[10]  Shervin Malmasi,et al.  Detecting Hate Speech in Social Media , 2017, RANLP.

[11]  Julia Hirschberg,et al.  Detecting Hate Speech on the World Wide Web , 2012 .

[12]  Matthew Leighton Williams,et al.  The Enemy Among Us: Detecting Hate Speech with Threats Based 'Othering' Language Embeddings , 2018 .

[13]  Suresh Manandhar,et al.  Dependency Based Embeddings for Sentence Classification Tasks , 2016, NAACL.

[14]  Dirk Hovy,et al.  Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter , 2016, NAACL.

[15]  Ingmar Weber,et al.  Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[16]  David Robinson,et al.  Hate Speech Detection on Twitter: Feature Engineering v.s. Feature Selection , 2018, ESWC.

[17]  Sérgio Nunes,et al.  A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..

[18]  Jin-Cheon Na,et al.  Phrase-Level Sentiment Polarity Classification Using Rule-Based Typed Dependencies and Additional Complex Phrases Consideration , 2012, Journal of Computer Science and Technology.

[19]  Omer Levy,et al.  Dependency-Based Word Embeddings , 2014, ACL.

[20]  Gabriela Ferraro,et al.  Transfer learning for hate speech detection in social media , 2019, Journal of Computational Social Science.

[21]  Ziqi Zhang,et al.  Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter , 2018, Semantic Web.

[22]  M. Williams,et al.  Hate speech, machine classification and statistical modelling of information flows on Twitter: interpretation and communication for policy decision making , 2014 .

[23]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[24]  Pete Burnap,et al.  Us and them: identifying cyber hate on Twitter across multiple protected characteristics , 2016, EPJ Data Science.

[25]  Joel R. Tetreault,et al.  Abusive Language Detection in Online User Content , 2016, WWW.

[26]  Tomoaki Ohtsuki,et al.  Hate Speech on Twitter: A Pragmatic Approach to Collect Hateful and Offensive Expressions and Perform Hate Speech Detection , 2018, IEEE Access.

[27]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.