DLJUST at SemEval-2021 Task 7: Hahackathon: Linking Humor and Offense

Humor detection and rating poses interesting linguistic challenges to NLP; it is highly subjective depending on the perceptions of a joke and the context in which it is used. This paper utilizes and compares transformers models; BERT base and Large, BERTweet, RoBERTa base and Large, and RoBERTa base irony, for detecting and rating humor and offense. The proposed models, where given a text in cased and uncased type obtained from SemEval-2021 Task7: HaHackathon: Linking Humor and Offense Across Different Age Groups. The highest scored model for the first subtask: Humor Detection, is BERTweet base cased model with 0.9540 F1-score, for the second subtask: Average Humor Rating Score, it is BERT Large cased with the minimum RMSE of 0.5555, for the fourth subtask: Average Offensiveness Rating Score, it is BERTweet base cased model with minimum RMSE of 0.4822.

[1]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  Leonardo Neves,et al.  TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification , 2020, FINDINGS.

[3]  Mahmoud Hammad,et al.  MLEngineer at SemEval-2020 Task 7: BERT-Flair Based Humor Detection Model (BFHumor) , 2020, SemEval@COLING.

[4]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[5]  Sanja Fidler,et al.  Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[6]  Wanli Liu,et al.  A BERT-based Approach for Automatic Humor Detection and Scoring , 2019, IberLEF@SEPLN.

[7]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[8]  Ari Rappoport,et al.  Semi-Supervised Recognition of Sarcasm in Twitter and Amazon , 2010, CoNLL.

[9]  Pieter Delobelle,et al.  Dutch Humor Detection by Generating Negative Examples , 2020, ArXiv.

[10]  Von-Wun Soo,et al.  Humor Recognition Using Deep Learning , 2018, NAACL.

[11]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12]  Dat Quoc Nguyen,et al.  BERTweet: A pre-trained language model for English Tweets , 2020, EMNLP.

[13]  Pramodith Ballapuram LMML at SemEval-2020 Task 7: Siamese Transformers for Rating Humor in Edited News Headlines , 2020, SemEval@COLING.

[14]  Walid Magdy,et al.  SemEval 2021 Task 7: HaHackathon, Detecting and Rating Humor and Offense , 2021, SEMEVAL.

[15]  Issa Annamoradnejad,et al.  ColBERT: Using BERT Sentence Embedding for Humor Detection , 2020, ArXiv.

[16]  Yiming Yang,et al.  XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[17]  Suraj Tripathi,et al.  Deep Learning Techniques for Humor Detection in Hindi-English Code-Mixed Tweets , 2019, WASSA@NAACL-HLT.

[18]  Alon Rozental,et al.  Amobee at SemEval-2020 Task 7: Regularization of Language Model Based Classifiers , 2020, SemEval@COLING.

[19]  Thomas Wolf,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[20]  Nikos Pelekis,et al.  DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis , 2017, *SEMEVAL.