The Linguistic Ideologies of Deep Abusive Language Classification

This paper brings together theories from sociolinguistics and linguistic anthropology to critically evaluate the so-called “language ideologies”—the set of beliefs and ways of speaking about language—in the practices of abusive language classification in modern machine learning-based NLP. This argument is made at both a conceptual and empirical level, as we review approaches to abusive language from different fields, and use two neural network methods to analyze three datasets developed for abusive language classification tasks (drawn from Wikipedia, Facebook, and StackOverflow). By evaluating and comparing these results, we argue for the importance of incorporating theories of pragmatics and metapragmatics into both the design of classification tasks as well as in ML architectures.

[1]  R. Stuart Geiger,et al.  Beyond opening up the black box: Investigating the role of algorithmic systems in Wikipedian organizational culture , 2017, Big Data Soc..

[2]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[3]  L. Wittgenstein Philosophical investigations = Philosophische Untersuchungen , 1958 .

[4]  木村 和夫 Pragmatics , 1997, Language Teaching.

[5]  Jonathan Culpeper,et al.  Towards an anatomy of impoliteness. , 1996 .

[6]  Pascale Fung,et al.  One-step and Two-step Classification for Abusive Language Detection on Twitter , 2017, ALW@ACL.

[7]  John R. Searle,et al.  Speech Acts: An Essay in the Philosophy of Language , 1970 .

[8]  Björn Ross,et al.  Measuring the Reliability of Hate Speech Annotations: The Case of the European Refugee Crisis , 2016, ArXiv.

[9]  Csr Young,et al.  How to Do Things With Words , 2009 .

[10]  J. Gumperz Discourse strategies: Introduction , 1982 .

[11]  Alexander Brown,et al.  What is hate speech? Part 1: The Myth of Hate , 2017 .

[12]  E. Goffman Interaction Ritual. Essays on Face-to-Face Behaviour. London (Allen Lane) 1967. , 1967 .

[13]  J. Nagle The Idea of Pollution , 2009 .

[14]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[16]  Michael Wiegand,et al.  A Survey on Hate Speech Detection using Natural Language Processing , 2017, SocialNLP@EACL.

[17]  Mihailo Markovic,et al.  The language of ideology , 1984, Synthese.

[18]  Joel R. Tetreault,et al.  Abusive Language Detection in Online User Content , 2016, WWW.

[19]  Ye Tian,et al.  Dynamic Pragmatic View of Negation Processing , 2016 .

[20]  A. Agha Language and Social Relations: References , 2006 .

[21]  J. Gumperz The Speech Community , 2009 .

[22]  S. Mills Gender and Politeness , 2003 .

[23]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[24]  Ingmar Weber,et al.  Understanding Abuse: A Typology of Abusive Language Detection Subtasks , 2017, ALW@ACL.

[25]  J. Butler Excitable Speech. A Politics of the Performative , 1997 .

[26]  Michael Silverstein,et al.  Metapragmatic discourse and metapragmatic function , 1993 .

[27]  Derek Ruths,et al.  A Web of Hate: Tackling Hateful Speech in Online Social Spaces , 2017, ArXiv.

[28]  Ellen Spertus,et al.  Smokey: Automatic Recognition of Hostile Messages , 1997, AAAI/IAAI.

[29]  Shivendra K. Verma Code-Switching: Hindi-English , 1975 .

[30]  Jure Leskovec,et al.  Querying Complex Networks in Vector Space , 2018, ArXiv.

[31]  Jennifer Hornsby,et al.  Free Speech and Illocution , 1998, Legal Theory.

[32]  Siobhan Chapman Logic and Conversation , 2005 .

[33]  Penelope Brown,et al.  Politeness: Some Universals in Language Usage , 1989 .

[34]  Julia Hirschberg,et al.  Detecting Hate Speech on the World Wide Web , 2012 .

[35]  Mari J. Matsuda Public Response to Racist Speech: Considering the Victim''s Story , 1993 .

[36]  Christopher D. Manning Computational Linguistics and Deep Learning , 2015, Computational Linguistics.

[37]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[38]  Alexander Brown,et al.  What is Hate Speech? Part 2: Family Resemblances , 2017 .

[39]  JANE H. Hill The everyday language of white racism , 2008 .

[40]  J. Holmes,et al.  The Community of Practice: Theories and methodologies in language and gender research , 1999, Language in Society.

[41]  Alessandro Duranti,et al.  Linguistic Anthropology: Linguistic diversity , 1997 .

[42]  Joel R. Tetreault,et al.  Do Characters Abuse More Than Words? , 2016, SIGDIAL Conference.

[43]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[44]  E. Goffman Interaction Ritual: Essays on Face-To-Face Behavior , 1967 .

[45]  Fernando C Pereira Formal grammar and information theory: together again? , 2000, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[46]  Mary Bucholtz,et al.  “Why be normal?”: Language and identity practices in a community of nerd girls , 1999, Language in Society.

[47]  Jure Leskovec,et al.  Discovering value from community activity on focused question answering sites: a case study of stack overflow , 2012, KDD.

[48]  M. Silverstein Encountering Language and Languages of Encounter in North American Ethnohistory , 1996 .

[49]  Athena Vakali,et al.  A Unified Deep Learning Architecture for Abuse Detection , 2018, WebSci.

[50]  Ritesh Kumar,et al.  Aggression-annotated Corpus of Hindi-English Code-mixed Data , 2018, LREC.

[51]  Lucas Dixon,et al.  Ex Machina: Personal Attacks Seen at Scale , 2016, WWW.

[52]  John Pavlopoulos,et al.  Deep Learning for User Comment Moderation , 2017, ALW@ACL.

[53]  R. Bauman,et al.  POETICS AND PERFORMANCE AS CRITICAL PERSPECTIVES ON LANGUAGE AND SOCIAL LIFE , 1990 .