Speak up, Fight Back! Detection of Social Media Disclosures of Sexual Harassment

The #MeToo movement is an ongoing prevalent phenomenon on social media aiming to demonstrate the frequency and widespread of sexual harassment by providing a platform to speak narrate personal experiences of such harassment. The aggregation and analysis of such disclosures pave the way to development of technology-based prevention of sexual harassment. We contend that the lack of specificity in generic sentence classification models may not be the best way to tackle text subtleties that intrinsically prevail in a classification task as complex as identifying disclosures of sexual harassment. We propose the Disclosure Language Model, a three part ULMFiT architecture, consisting of a Language model, a Medium-Specific (Twitter) model and a Task-Specific classifier to tackle this problem and create a manually annotated real-world dataset to test our technique on this, to show that using a Discourse Language Model often yields better classification performance over (i) Generic deep learning based sentence classification models (ii) existing models that rely on handcrafted stylistic features. An extensive comparison with state-of-the-art generic and specific models along with a detailed error analysis presents the case for our proposed methodology.

[1]  Duc-Thuan Vo,et al.  Exploiting Language Models to Classify Events from Twitter , 2015, Comput. Intell. Neurosci..

[2]  Richard Socher,et al.  Regularizing and Optimizing LSTM Language Models , 2017, ICLR.

[3]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[4]  Sebastian Ruder,et al.  Fine-tuned Language Models for Text Classification , 2018, ArXiv.

[5]  Xuejie Zhang,et al.  YNU-HPCC at Semeval-2018 Task 11: Using an Attention-based CNN-LSTM for Machine Comprehension using Commonsense Knowledge , 2018, SemEval@NAACL-HLT.

[6]  Swati Aggarwal,et al.  A Computational Approach to Feature Extraction for Identification of Suicidal Ideation in Tweets , 2018, ACL.

[7]  Roger Zimmermann,et al.  A Multimodal Approach to Predict Social Media Popularity , 2018, 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[8]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[9]  Dr. med. Rajiv Shah,et al.  Multimodal Analysis of User-Generated Multimedia Content , 2017, Socio-Affective Computing.

[10]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[11]  Zhiyuan Liu,et al.  A C-LSTM Neural Network for Text Classification , 2015, ArXiv.

[12]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[13]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[14]  Yann LeCun,et al.  Very Deep Convolutional Networks for Text Classification , 2016, EACL.

[15]  Xiong Luo,et al.  An LSTM Approach to Short Text Sentiment Classification with Word Embeddings , 2018, ROCLING/IJCLCLP.

[16]  Cecilia Ovesdotter Alm,et al.  An Analysis of Domestic Abuse Discourse on Reddit , 2015, EMNLP.

[17]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[18]  Ramit Sawhney,et al.  Detecting Offensive Tweets in Hindi-English Code-Switched Language , 2018, SocialNLP@ACL.

[19]  Erik Cambria,et al.  International Conference on Advances in Social Networks Analysis and Mining ( ASONAM ) Sounds of Silence Breakers : Exploring Sexual Violence on Twitter , 2018 .

[20]  Allen Schmaltz,et al.  On the Utility of Lay Summaries and AI Safety Disclosures: Toward Robust, Open Research Oversight , 2018, EthNLP@NAACL-HLT.

[21]  Jun Zhao,et al.  Recurrent Convolutional Neural Networks for Text Classification , 2015, AAAI.

[22]  Natalie McClain,et al.  Female Survivors of Child Sexual Abuse: Finding Voice through Research Participation , 2013, Issues in mental health nursing.

[23]  Bun-Hee Lee,et al.  #Me Too Movement; It Is Time That We All Act and Participate in Transformation , 2018, Psychiatry investigation.

[24]  Subbarao Kambhampati,et al.  Tweeting the Mind and Instagramming the Heart: Exploring Differentiated Content Sharing on Social Media , 2016, ICWSM.

[25]  Munmun De Choudhury,et al.  Understanding Social Media Disclosures of Sexual Abuse Through the Lenses of Support Seeking and Anonymity , 2016, CHI.

[26]  Razvan Pascanu,et al.  On the difficulty of training recurrent neural networks , 2012, ICML.

[27]  Tong Zhang,et al.  Deep Pyramid Convolutional Neural Networks for Text Categorization , 2017, ACL.

[28]  Rajib Rana,et al.  Gated Recurrent Unit (GRU) for Emotion Classification from Noisy Speech , 2016, ArXiv.

[29]  Ramit Sawhney,et al.  Exploring and Learning Suicidal Ideation Connotations on Social Media with Deep Learning , 2018, WASSA@EMNLP.

[30]  Alan Ritter,et al.  Unsupervised Modeling of Twitter Conversations , 2010, NAACL.

[31]  Xuanjing Huang,et al.  Recurrent Neural Network for Text Classification with Multi-Task Learning , 2016, IJCAI.

[32]  John R. Talburt,et al.  From Chirps to Whistles: Discovering Event-specific Informative Content from Twitter , 2015, WebSci.

[33]  Lindsay M. Orchowski,et al.  Sexual Violence Is #NotOkay: Social Reactions to Disclosures of Sexual Victimization on Twitter , 2019, Psychology of Violence.

[34]  Vasudeva Varma,et al.  Deep Learning for Hate Speech Detection in Tweets , 2017, WWW.

[35]  Christopher D. Manning,et al.  Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[36]  Tara Black,et al.  The utility of Twitter as a tool for increasing reach of research on sexual violence. , 2018, Child abuse & neglect.

[37]  Rajiv Ratn Shah,et al.  Detecting Personal Intake of Medicine from Twitter , 2018, IEEE Intelligent Systems.

[38]  Xiang Zhang,et al.  Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[39]  Leslie N. Smith,et al.  Cyclical Learning Rates for Training Neural Networks , 2015, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[40]  Wael Khreich,et al.  A Survey of Techniques for Event Detection in Twitter , 2015, Comput. Intell..