The Grievance Dictionary: Understanding threatening language use

This paper introduces the Grievance Dictionary, a psycholinguistic dictionary which can be used to automatically understand language use in the context of grievance-fuelled violence threat assessment. We describe the development the dictionary, which was informed by suggestions from experienced threat assessment practitioners. These suggestions and subsequent human and computational word list generation resulted in a dictionary of 20,502 words annotated by 2,318 participants. The dictionary was validated by applying it to texts written by violent and non-violent individuals, showing strong evidence for a difference between populations in several dictionary categories. Further classification tasks showed promising performance, but future improvements are still needed. Finally, we provide instructions and suggestions for the use of the Grievance Dictionary by security professionals and (violence) researchers.

[1]  Stephane J. Baele Lone-Actor Terrorists’ Emotions and Cognition: An Evaluation Beyond Stereotypes , 2017 .

[2]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[3]  Bennett Kleinberg,et al.  Manipulating emotions for ground truth emotion analysis , 2020, ArXiv.

[4]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[5]  Rudresh Panchal,et al.  Online hatred of women in the Incels.me forum , 2019, Journal of Language Aggression and Conflict.

[6]  Claire Cardie,et al.  Negative Deceptive Opinion Spam , 2013, NAACL.

[7]  B H Munro,et al.  Evaluation Model , 1995, Clinical nurse specialist CNS.

[8]  Laura S. Guy,et al.  The Structured Professional Judgement Approach to Violence Risk Assessment , 2016 .

[9]  Filiz Garip What failure to predict life outcomes can teach us , 2020, Proceedings of the National Academy of Sciences.

[10]  Lewis Pollock,et al.  Statistical and methodological problems with concreteness and other semantic variables: A list memory experiment case study , 2018, Behavior research methods.

[11]  J. Pennebaker,et al.  Linguistic styles: language use as an individual difference. , 1999, Journal of personality and social psychology.

[12]  P. Read,et al.  Pseudocommando mass murderers: A big five personality profile using psycholinguistics , 2019, Current Psychology.

[13]  E. Cardeña,et al.  Online validation of combined mood induction procedures , 2019, PloS one.

[14]  Alan S. Cowen,et al.  GoEmotions: A Dataset of Fine-Grained Emotions , 2020, ACL.

[15]  Martin Porter,et al.  Snowball: A language for stemming algorithms , 2001 .

[16]  Yair Neuman,et al.  Profiling School Shooters: Automatic Text-Based Analysis , 2015, Front. Psychiatry.

[17]  Bizony Piers Searching for the signs , 2016 .

[18]  F. Farnham,et al.  Mental Disorders, Personality Traits, and Grievance-Fueled Targeted Violence: The Evidence Base and Implications for Research and Practice , 2018, Journal of personality assessment.

[19]  Victor M. Yakovenko,et al.  Temporal Evolution , 2005, Encyclopedia of Database Systems.

[20]  Bennett Kleinberg,et al.  Too good to be true? Predicting author profiles from abusive language , 2020, ArXiv.

[21]  Lisa Kaati,et al.  Linguistic analysis of lone offender manifestos , 2016, 2016 IEEE International Conference on Cybercrime and Computer Forensic (ICCCF).

[22]  S. James Press,et al.  Bayesian Hypothesis Testing , 2010 .

[23]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[24]  S. Hart,et al.  Evaluation of a model of violence risk assessment among forensic psychiatric patients. , 2003, Psychiatric services.

[25]  Maximilian Mozes,et al.  Online influence, offline violence: Linguistic responses to the 'Unite the Right' rally , 2019, ArXiv.

[26]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[27]  Gorka Navarrete,et al.  Bayesian Hypothesis Testing: An Alternative to Null Hypothesis Significance Testing (NHST) in Psychology and Social Sciences , 2017 .

[28]  Haiyan Wang,et al.  quanteda: An R package for the quantitative analysis of textual data , 2018, J. Open Source Softw..

[29]  Garth Davies,et al.  Searching for signs of extremism on the web: an introduction to Sentiment-based Identification of Radical Authors , 2018 .

[30]  Bruno Verschuere,et al.  Detecting deceptive communication through linguistic concreteness , 2018 .

[31]  Lisa Kaati,et al.  Identifying Warning Behaviors of Violent Lone Offenders in Written Communication , 2016, 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW).

[32]  C. Mann,et al.  A Practical Treatise on Diseases of the Skin , 1889, Atlanta Medical and Surgical Journal (1884).

[33]  Marion Oswald,et al.  Algorithmic risk assessment policing models: lessons from the Durham HART model and ‘Experimental’ proportionality , 2017 .

[34]  Ahmed Abbasi,et al.  Affect Intensity Analysis of Dark Web Forums , 2007, 2007 IEEE Intelligence and Security Informatics.

[35]  M. Moens Natural language analysis , 1998 .

[36]  Lisa Kaati,et al.  Measuring online affects in a white supremacy forum , 2016, 2016 IEEE Conference on Intelligence and Security Informatics (ISI).

[37]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[38]  Vincent A. Knight,et al.  Tweeting the terror: modelling the social media reaction to the Woolwich terrorist attack , 2014, Social Network Analysis and Mining.

[39]  Shlomo Argamon,et al.  Effects of Age and Gender on Blogging , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[40]  Isabelle van der Vegt,et al.  The temporal evolution of a far-right forum , 2020, J. Comput. Soc. Sci..

[41]  Claire Cardie,et al.  Finding Deceptive Opinion Spam by Any Stretch of the Imagination , 2011, ACL.

[42]  Hsinchun Chen,et al.  Sentiment and affect analysis of Dark Web forums: Measuring radicalization on the internet , 2008, 2008 IEEE International Conference on Intelligence and Security Informatics.

[43]  Paul Rayson,et al.  From key words to key semantic domains , 2008 .

[44]  Lisa Kaati,et al.  A Machine Learning Approach towards Detecting Extreme Adopters in Digital Communities , 2017, 2017 28th International Workshop on Database and Expert Systems Applications (DEXA).

[45]  Harith Alani,et al.  On the use of Jargon and Word Embeddings to Explore Subculture within the Reddit’s Manosphere , 2020, WebSci.

[46]  Paul Gill,et al.  Lone Actor Terrorism , 2018, International Handbook of Threat Assessment.

[47]  Julie Barnett,et al.  Detecting psychological change through mobilizing interactions and changes in extremist linguistic style , 2020, Comput. Hum. Behav..

[48]  K. Taber The Use of Cronbach’s Alpha When Developing and Reporting Research Instruments in Science Education , 2017, Research in Science Education.

[49]  Savvas Zannettou,et al.  A Quantitative Approach to Understanding Online Antisemitism , 2018, ICWSM.

[50]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[51]  Ingmar Weber,et al.  Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[52]  Michael S. Bernstein,et al.  Empath: Understanding Topic Signals in Large-Scale Text , 2016, CHI.

[53]  Lisa Kaati,et al.  Assessment of risk in written communication : Introducing the Profile Risk Assessment Tool (PRAT) , 2018 .

[54]  E. Wagenmakers,et al.  Bayesian hypothesis testing for psychologists: A tutorial on the Savage–Dickey method , 2010, Cognitive Psychology.

[55]  Macdonald Stuart Shedding Light on Terrorist and Extremist Content Removal , 2019 .

[56]  Shibamouli Lahiri,et al.  Complexity of Word Collocation Networks: A Preliminary Structural Analysis , 2013, EACL.

[57]  Ryan L. Boyd,et al.  The Development and Psychometric Properties of LIWC2015 , 2015 .

[58]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[59]  Aldert Vrij,et al.  Taking threats to the lab: Introducing an experimental paradigm for studying verbal threats. , 2016 .

[60]  Maximilian Mozes,et al.  Measuring Emotions in the COVID-19 Real World Worry Dataset , 2020, NLPCOVID19.

[61]  H. A. Schwartz,et al.  Natural Language Analysis and the Psychology of Verbal Behavior: The Past, Present, and Future States of the Field , 2020, Journal of language and social psychology.