An Exploratory Analysis of the Relation between Offensive Language and Mental Health

In this paper, we analyze the interplay between the use of offensive language and mental health. We acquired publicly available datasets created for offensive language identification and depression detection and we train computational models to compare the use of offensive language in social media posts written by groups of individuals with and without self-reported depression diagnosis. We also look at samples written by groups of individuals whose posts show signs of depression according to recent related studies. Our analysis indicates that offensive language is more frequently used in the samples written by individuals with self-reported depression as well as individuals showing signs of depression. The results discussed here open new avenues in research in politeness/offensiveness and mental health.

[1]  Johannes Zimmermann,et al.  Me, myself, and I: self-referent word use as an indicator of self-focused attention in relation to depression and anxiety , 2015, Front. Psychol..

[2]  Marcos Zampieri,et al.  Offensive Language Identification in Greek , 2020, LREC.

[3]  Animesh Mukherjee,et al.  HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection , 2020, AAAI.

[4]  Ingmar Weber,et al.  Understanding Abuse: A Typology of Abusive Language Detection Subtasks , 2017, ALW@ACL.

[5]  Jonathan Culpeper Impoliteness: Using Language to Cause Offence , 2011 .

[6]  Preslav Nakov,et al.  Predicting the Type and Target of Offensive Posts in Social Media , 2019, NAACL.

[7]  Jun-Ming Xu,et al.  Learning from Bullying Traces in Social Media , 2012, NAACL.

[8]  J. Pennebaker,et al.  Language use of depressed and depression-vulnerable college students , 2004 .

[9]  James J. Gross,et al.  Emotion and emotion regulation , 1999 .

[10]  Cristian Danescu-Niculescu-Mizil,et al.  Conversations Gone Awry: Detecting Early Signs of Conversational Failure , 2018, ACL.

[11]  Preslav Nakov,et al.  SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval) , 2019, *SEMEVAL.

[12]  P. Resnik,et al.  CLPsych 2019 Shared Task: Predicting the Degree of Suicide Risk in Reddit Posts , 2019, Proceedings of the Sixth Workshop on Computational Linguistics and Clinical Psychology.

[13]  R. Norel,et al.  Identifying signals associated with psychiatric illness utilizing language and images posted to Facebook , 2020, npj Schizophrenia.

[14]  Zhong Wang,et al.  Corpus approaches to discourse: a critical review , 2020, Critical Discourse Studies.

[15]  Munmun De Choudhury,et al.  Prevalence and Psychological Effects of Hateful Speech in Online College Communities , 2019, WebSci.

[16]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[17]  Saif Mohammad,et al.  Word Affect Intensities , 2017, LREC.

[18]  Ioana R. Podina,et al.  A comprehensive meta-analysis of interpretation biases in depression. , 2017, Clinical psychology review.

[19]  Stevie Chancellor,et al.  Methods in predictive techniques for mental health status on social media: a critical review , 2020, npj Digital Medicine.

[20]  Nazli Goharian,et al.  Depression and Self-Harm Risk Assessment in Online Forums , 2017, EMNLP.

[21]  Dolf Trieschnigg,et al.  Improving Cyberbullying Detection with User Context , 2013, ECIR.

[22]  Fabio Crestani,et al.  A Test Collection for Research on Depression and Language Use , 2016, CLEF.

[23]  A. Beck,et al.  Advances in cognitive theory and therapy: the generic cognitive model. , 2014, Annual review of clinical psychology.

[24]  Marcos Zampieri,et al.  MUDES: Multilingual Detection of Offensive Spans , 2021, NAACL.

[25]  Çağrı Çöltekin,et al.  A Corpus of Turkish Offensive Language on Social Media , 2020, LREC.

[26]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[27]  Mark Dredze,et al.  Shared Task : Depression and PTSD on Twitter , 2015 .

[28]  Rafael A. Calvo,et al.  CLPsych 2016 Shared Task: Triaging content in online peer-support forums , 2016, CLPsych@HLT-NAACL.

[29]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[30]  S. Speer Reconsidering self‐deprecation as a communication practice , 2019, The British journal of social psychology.

[31]  Liang Zou,et al.  NULI at SemEval-2019 Task 6: Transfer Learning for Offensive Language Detection using Bidirectional Transformers , 2019, *SEMEVAL.

[32]  Shervin Malmasi,et al.  Predicting Post Severity in Mental Health Forums , 2016, CLPsych@HLT-NAACL.

[33]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[34]  Prasenjit Majumder,et al.  Filtering Aggression from the Multilingual Social Media Feed , 2018, TRAC@COLING 2018.

[35]  Paolo Rosso,et al.  SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter , 2019, *SEMEVAL.

[36]  Nazli Goharian,et al.  Hate speech detection: Challenges and solutions , 2019, PloS one.

[37]  H. Raat,et al.  Cyber and Traditional Bullying Victimization as a Risk Factor for Mental Health Problems and Suicidal Ideation in Adolescents , 2014, PloS one.

[38]  Maarten Sap,et al.  The role of personality, age, and gender in tweeting about mental illness , 2015, CLPsych@HLT-NAACL.

[39]  Marcos Zampieri,et al.  Multilingual Offensive Language Identification with Cross-lingual Embeddings , 2020, EMNLP.

[40]  Junyi Jessy Li,et al.  Why Swear? Analyzing and Inferring the Intentions of Vulgar Expressions , 2018, EMNLP.

[41]  Shervin Malmasi,et al.  Evaluating Aggression Identification in Social Media , 2020, TRAC.

[42]  Swearing as a Response to Pain: Assessing Hypoalgesic Effects of Novel “Swear” Words , 2019, Frontiers in Psychology.

[43]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[44]  Shervin Malmasi,et al.  Challenges in discriminating profanity from hate speech , 2017, J. Exp. Theor. Artif. Intell..

[45]  Fabio Crestani,et al.  A Dataset for Research on Depression in Social Media , 2020, UMAP.

[46]  A. Kilgarriff Simple Maths for Keywords , 2009 .

[47]  Alexander F. Gelbukh,et al.  Aggression Detection in Social Media: Using Deep Neural Networks, Data Augmentation, and Pseudo Labeling , 2018, TRAC@COLING 2018.

[48]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[49]  Mark Dredze,et al.  Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media , 2016, CHI.

[50]  R. Mathew,et al.  Sexual dysfunctions in depression , 1982, Archives of sexual behavior.