Hostility Detection in Hindi leveraging Pre-Trained Language Models

Hostile content on social platforms is ever increasing. This has led to the need for proper detection of hostile posts so that appropriate action can be taken to tackle them. Though a lot of work has been done recently in the English Language to solve the problem of hostile content online, similar works in Indian Languages are quite hard to find. This paper presents a transfer learning based approach to classify social media (i.e Twitter, Facebook, etc.) posts in Hindi Devanagari script as Hostile or Non-Hostile. Hostile posts are further analyzed to determine if they are Hateful, Fake, Defamation, and Offensive. This paper harnesses attention based pre-trained models fine-tuned on Hindi data with Hostile-Non hostile task as Auxiliary and fusing its features for further sub-tasks classification. Through this approach, we establish a robust and consistent model without any ensembling or complex pre-processing. We have presented the results from our approach in CONSTRAINT-2021 Shared Task[21] on hostile post detection where our model performs extremely well with 3rd runner up in terms of Weighted Fine-Grained F1 Score.

[1]  Ingmar Weber,et al.  Understanding Abuse: A Typology of Abusive Language Detection Subtasks , 2017, ALW@ACL.

[2]  P. Hrudya,et al.  DHOT-Repository and Classification of Offensive Tweets in the Hindi Language , 2020 .

[3]  Raviraj Joshi,et al.  Deep Learning for Hindi Text Classification: A Comparison , 2019, IHCI.

[4]  Ingmar Weber,et al.  Racial Bias in Hate Speech and Abusive Language Detection Datasets , 2019, Proceedings of the Third Workshop on Abusive Language Online.

[5]  Ayush Kaushal,et al.  Winners at W-NUT 2020 Shared Task-3: Leveraging Event Specific and Chunk Span information for Extracting COVID Entities from Tweets , 2020, W-NUT@EMNLP.

[6]  Savvas Zannettou,et al.  "And We Will Fight For Our Race!" A Measurement Study of Genetic Testing Conversations on Reddit and 4chan , 2019, ICWSM.

[7]  K. P. Soman,et al.  Detection of Hate Speech Text in Hindi-English Code-mixed Data , 2020 .

[8]  Alec Radford,et al.  Improving Language Understanding by Generative Pre-Training , 2018 .

[9]  Thomas Wolf,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[10]  Ingmar Weber,et al.  Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[11]  Krishnaprasad Thirunarayan,et al.  ALONE: A Dataset for Toxic Behavior among Adolescents on Twitter , 2020, SocInfo.

[12]  Shervin Malmasi,et al.  Challenges in discriminating profanity from hate speech , 2017, J. Exp. Theor. Artif. Intell..

[13]  Andreas Vlachos,et al.  FEVER: a Large-scale Dataset for Fact Extraction and VERification , 2018, NAACL.

[14]  Asif Ekbal,et al.  Hostility Detection Dataset in Hindi , 2020, ArXiv.

[15]  Joel R. Tetreault,et al.  Abusive Language Detection in Online User Content , 2016, WWW.

[16]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[17]  Kumar Shridhar,et al.  Indic-Transformers: An Analysis of Transformer Language Models for Indian Languages , 2020, ArXiv.

[18]  Bernard J. Jansen,et al.  A Multi-Platform Arabic News Comment Dataset for Offensive Language Detection , 2020, LREC.

[19]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[20]  Virgílio A. F. Almeida,et al.  Analyzing Right-wing YouTube Channels: Hate, Violence and Discrimination , 2018, WebSci.

[21]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[22]  Thamar Solorio,et al.  Aggression and Misogyny Detection using BERT: A Multi-Task Approach , 2020, TRAC.

[23]  Asif Ekbal,et al.  Overview of CONSTRAINT 2021 Shared Tasks: Detecting English COVID-19 Fake News and Hindi Hostile Posts , 2021, CONSTRAINT@AAAI.

[24]  Ayush Kaushal,et al.  Leveraging Event Specific and Chunk Span features to Extract COVID Events from tweets , 2020, ArXiv.

[25]  Md Saiful Islam,et al.  BanFakeNews: A Dataset for Detecting Fake News in Bangla , 2020, LREC.

[26]  Omer Levy,et al.  RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[27]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[28]  Mitesh M. Khapra,et al.  iNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages , 2020, FINDINGS.