Self-Supervised Euphemism Detection and Identification for Content Moderation

Fringe groups and organizations have a long history of using euphemisms—ordinary-sounding words with a secret meaning—to conceal what they are discussing. Nowadays, one common use of euphemisms is to evade content moderation policies enforced by social media platforms. Existing tools for enforcing policy automatically rely on keyword searches for words on a "ban list", but these are notoriously imprecise: even when limited to swearwords, they can still cause embarrassing false positives [1]. When a commonly used ordinary word acquires a euphemistic meaning, adding it to a keyword-based ban list is hopeless: consider "pot" (storage container or marijuana?) or "heater" (household appliance or firearm?) The current generation of social media companies instead hire staff to check posts manually, but this is expensive, inhumane, and not much more effective. It is usually apparent to a human moderator that a word is being used euphemistically, but they may not know what the secret meaning is, and therefore whether the message violates policy. Also, when a euphemism is banned, the group that used it need only invent another one, leaving moderators one step behind.This paper will demonstrate unsupervised algorithms that, by analyzing words in their sentence-level context, can both detect words being used euphemistically, and identify the secret meaning of each word. Compared to the existing state of the art, which uses context-free word embeddings, our algorithm for detecting euphemisms achieves 30–400% higher detection accuracies of unlabeled euphemisms in a text corpus. Our algorithm for revealing euphemistic meanings of words is the first of its kind, as far as we are aware. In the arms race between content moderators and policy evaders, our algorithms may help shift the balance in the direction of the moderators.

[1]  Carl Yang,et al.  Co-Embedding Network Nodes and Hierarchical Labels with Taxonomy Based Generative Adversarial Networks , 2020, 2020 IEEE International Conference on Data Mining (ICDM).

[2]  Xiang Ren,et al.  SynSetExpan: An Iterative Framework for Joint Entity Set Expansion and Synonym Discovery , 2020, EMNLP.

[3]  Jiawei Han,et al.  CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation Transferring , 2020, KDD.

[4]  Yuichi SEI,et al.  Codewords Detection in Microblogs Focusing on Differences in Word Use Between Two Corpora , 2020, 2020 International Conference on Computing, Electronics & Communications Engineering (iCCECE).

[5]  Ellen Riloff,et al.  Recognizing Euphemisms and Dysphemisms Using Sentiment Analysis , 2020, FIGLANG.

[6]  Abdel-rahman Mohamed,et al.  wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations , 2020, NeurIPS.

[7]  Xin Luna Dong,et al.  Octet: Online Catalog Taxonomy Enrichment with Self-Supervision , 2020, KDD.

[8]  Geoffrey E. Hinton,et al.  Big Self-Supervised Models are Strong Semi-Supervised Learners , 2020, NeurIPS.

[9]  Torsten Kröger,et al.  Self-Supervised Learning for Precise Pick-and-Place Without Object Model , 2020, IEEE Robotics and Automation Letters.

[10]  Chris Donahue,et al.  Enabling Language Models to Fill in the Blanks , 2020, ACL.

[11]  Jingbo Shang,et al.  Empower Entity Set Expansion via Language Model Probing , 2020, ACL.

[12]  Jingbo Shang,et al.  NetTaxo: Automated Topic Taxonomy Construction from Text-Rich Network , 2020, WWW.

[13]  Yu Meng,et al.  Guiding Corpus-based Set Expansion by Auxiliary Sets Generation and Co-Expansion , 2020, WWW.

[14]  Chi Wang,et al.  TaxoExpan: Self-supervised Taxonomy Expansion with Position-Enhanced Graph Neural Network , 2020, WWW.

[15]  Derek Hoiem,et al.  Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Kevin Gimpel,et al.  ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[17]  Chao Zhang,et al.  FUSE: Multi-Faceted Set Expansion by Coherent Clustering of Skip-grams , 2019, ECML/PKDD.

[18]  Sergey Levine,et al.  Skew-Fit: State-Covering Self-Supervised Reinforcement Learning , 2019, ICML.

[19]  Yang Xu,et al.  Slang Detection and Identification , 2019, CoNLL.

[20]  Wolfram Burgard,et al.  Self-supervised 3D Shape and Viewpoint Estimation from Single Images for Robotics , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[21]  Alexei A. Efros,et al.  Unsupervised Domain Adaptation through Self-Supervision , 2019, ArXiv.

[22]  Ehsan Adeli,et al.  Self-Supervised Representation Learning via Neighborhood-Relational Encoding , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[23]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[24]  Liang Xiao,et al.  Self-Supervised Domain Adaptation for Computer Vision Tasks , 2019, IEEE Access.

[25]  Zhiwu Lu,et al.  Coarse-to-Fine Grained Classification , 2019, SIGIR.

[26]  Dacheng Tao,et al.  Self-Supervised Representation Learning by Rotation Feature Decoupling , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Graham Neubig,et al.  Learning to Describe Unknown Phrases with Local and Global Contexts , 2019, NAACL.

[28]  Alexander Kolesnikov,et al.  S4L: Self-Supervised Semi-Supervised Learning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[29]  Alexander Kolesnikov,et al.  Revisiting Self-Supervised Visual Representation Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Qiang Yang,et al.  Exploiting Coarse-to-Fine Task Transfer for Aspect-level Sentiment Classification , 2018 .

[31]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[32]  USE OF AI IN ONLINE CONTENT MODERATION , 2019 .

[33]  Weiwei Liu,et al.  Global for Coarse and Part for Fine: A Hierarchical Action Recognition Framework , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[34]  Brian M. Sadler,et al.  TaxoGen: Constructing Topical Concept Taxonomy by Adaptive Term Embedding and Clustering , 2018, KDD 2018.

[35]  Jiawei Han,et al.  A Spherical Hidden Markov Model for Semantics-Rich Human Mobility Modeling , 2018, AAAI.

[36]  Alberto Rodriguez,et al.  Learning Synergies Between Pushing and Grasping with Self-Supervised Deep Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[37]  Gianluca Stringhini,et al.  What is Gab: A Bastion of Free Speech or an Alt-Right Echo Chamber , 2018, WWW.

[38]  Wanlei Zhou,et al.  Twitter spam detection: Survey of new approaches and comparative study , 2017, Comput. Secur..

[39]  Sergey Levine,et al.  Self-Supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[40]  Xiang Ren,et al.  Empower Sequence Labeling with Task-Aware Neural Language Model , 2017, AAAI.

[41]  Surendra Sedhai,et al.  Semi-Supervised Spam Detection in Twitter Stream , 2017, IEEE Transactions on Computational Social Systems.

[42]  Jiebo Luo,et al.  Determining Code Words in Euphemistic Hate Speech Using Word Embedding Networks , 2018, ALW.

[43]  Haoran Lu,et al.  Reading Thieves' Cant: Automatically Identifying and Understanding Dark Jargons from Cybercrime Marketplaces , 2018, USENIX Security Symposium.

[44]  Yi-Shin Chen,et al.  Surfacing contextual hate speech words within social media , 2017, ArXiv.

[45]  William Yang Wang,et al.  Learning to Explain Non-Standard English Words and Phrases , 2017, IJCNLP.

[46]  Jiawei Han,et al.  SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble , 2017, ECML/PKDD.

[47]  Roberto Navigli,et al.  Neural Sequence Learning Models for Word Sense Disambiguation , 2017, EMNLP.

[48]  Vern Paxson,et al.  Identifying Products in Online Cybercrime Marketplaces: A Dataset for Fine-grained Domain Adaptation , 2017, EMNLP.

[49]  Iyad Rahwan,et al.  Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm , 2017, EMNLP.

[50]  Guang Liu,et al.  How to Learn Klingon without a Dictionary: Detection and Measurement of Black Keywords Used by the Underground Economy , 2017, 2017 IEEE Symposium on Security and Privacy (SP).

[51]  Vern Paxson,et al.  Tools for Automated Analysis of Cybercriminal Markets , 2017, WWW.

[52]  Roberto Navigli,et al.  Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison , 2017, EACL.

[53]  Bowen Zhou,et al.  A Structured Self-attentive Sentence Embedding , 2017, ICLR.

[54]  Jitendra Malik,et al.  Combining self-supervised learning and imitation for vision-based rope manipulation , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[55]  Jun Zhang,et al.  Twitter spam detection based on deep learning , 2017, ACSW.

[56]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[57]  Hsinchun Chen,et al.  Chinese underground market jargon analysis based on unsupervised learning , 2016, 2016 IEEE Conference on Intelligence and Security Informatics (ISI).

[58]  Ignacio Iacobacci,et al.  Embeddings for Word Sense Disambiguation: An Evaluation Study , 2016, ACL.

[59]  Jure Leskovec,et al.  Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora , 2016, EMNLP.

[60]  Zhe Chen,et al.  EgoSet: Exploiting Word Ego-networks and User-generated Ontology for Multifaceted Set Expansion , 2016, WSDM.

[61]  Nicolas Christin,et al.  Measuring the Longitudinal Evolution of the Online Anonymous Marketplace Ecosystem , 2015, USENIX Security Symposium.

[62]  Xiaofeng Wang,et al.  UIPicker: User-Input Privacy Identification in Mobile Applications , 2015, USENIX Security Symposium.

[63]  Xiangyu Zhang,et al.  SUPOR: Precise and Scalable Sensitive User Input Detection for Android Apps , 2015, USENIX Security Symposium.

[64]  Wei Xu,et al.  Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[65]  Jun Zhao,et al.  Recurrent Convolutional Neural Networks for Text Classification , 2015, AAAI.

[66]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[67]  Hwee Tou Ng,et al.  Semi-Supervised Word Sense Disambiguation Using Word Embeddings in General and Specific Domains , 2015, NAACL.

[68]  Hussein Abdo Rababah The Translatability and Use of X-Phemism Expressions (X-Phemization): Euphemisms, Dysphemisms and Orthophemisms in the Medical Discourse , 2014 .

[69]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[70]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[71]  Omer Levy,et al.  Dependency-Based Word Embeddings , 2014, ACL.

[72]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[73]  Vern Paxson,et al.  Trafficking Fraudulent Accounts: The Role of the Underground Market in Twitter Spam and Abuse , 2013, USENIX Security Symposium.

[74]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[75]  Nicolas Christin,et al.  Traveling the silk road: a measurement analysis of a large anonymous online marketplace , 2012, WWW.

[76]  Stefan Savage,et al.  PharmaLeaks: Understanding the Business of Online Pharmaceutical Affiliate Programs , 2012, USENIX Security Symposium.

[77]  Karsten Berns,et al.  Kernel Multilayer Perceptron , 2011, 2011 24th SIBGRAPI Conference on Graphics, Patterns and Images.

[78]  Tyler Moore,et al.  Measuring and Analyzing Search-Redirection Attacks in the Illicit Online Prescription Drug Trade , 2011, USENIX Security Symposium.

[79]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[80]  Keith Allan The connotations of English colour terms: Colour-based X-phemisms , 2009 .

[81]  Eliécer Crespo Fernández The Language of Death: Euphemism and Conceptual Metaphorization in Victorian Obituaries , 2006 .

[82]  Helene Ahl Motivation in adult education: a problem solver or a euphemism for direction and control? , 2006 .

[83]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[84]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[85]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[86]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[87]  Kerry L. Pfaff,et al.  Metaphor in using and understanding euphemism and dysphemism , 1997, Applied Psycholinguistics.

[88]  Hugh Rawson Rawson's Dictionary of Euphemisms and Other Doubletalk: Being a Compilation of Linguistic Fig Leaves and Verbal Flourishes for Artful Users of the English Language , 1995 .

[89]  K. Allan,et al.  Euphemism and Dysphemism: Language Used as Shield and Weapon , 1991 .

[90]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[91]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[92]  P. Bonacich Factoring and weighting approaches to status scores and clique identification , 1972 .

[93]  P. Bonacich TECHNIQUE FOR ANALYZING OVERLAPPING MEMBERSHIPS , 1972 .