论文信息 - Tsia at SemEval-2021 Task 7: Detecting and Rating Humor and Offense

Tsia at SemEval-2021 Task 7: Detecting and Rating Humor and Offense

This paper describes our contribution to SemEval-2021 Task 7: Detecting and Rating Humor and Of-fense.This task contains two sub-tasks, sub-task 1and sub-task 2. Among them, sub-task 1 containsthree sub-tasks, sub-task 1a ,sub-task 1b and sub-task 1c.Sub-task 1a is to predict if the text would beconsidered humorous.Sub-task 1c is described asfollows: if the text is classed as humorous, predictif the humor rating would be considered controver-sial, i.e. the variance of the rating between annota-tors is higher than the median.we combined threepre-trained model with CNN to complete these twoclassification sub-tasks.Sub-task 1b is to judge thedegree of humor.Sub-task 2 aims to predict how of-fensive a text would be with values between 0 and5.We use the idea of regression to deal with thesetwo sub-tasks.We analyze the performance of ourmethod and demonstrate the contribution of eachcomponent of our architecture.We have achievedgood results under the combination of multiple pre-training models and optimization methods.

Xiaobing Zhou | Zhengyi Guan | Xiaobing Zhou | Zhengyi Guan

[1] Xuejie Zhang,et al. Using a stacked residual LSTM model for sentiment intensity prediction , 2018, Neurocomputing.

[2] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[3] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[4] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[5] Anna-Katharina Dick,et al. HumorAAC at SemEval-2020 Task 7: Assessing the Funniness of Edited News Headlines through Regression and Trump Mentions , 2020, SemEval@COLING.

[6] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[7] Kevin Gimpel,et al. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[8] Guoyin Wang,et al. Deconvolutional Paragraph Representation Learning , 2017, NIPS.

[9] Charlotte Ammer,et al. UniTuebingenCL at SemEval-2020 Task 7: Humor Detection in News Headlines , 2020, SemEval@COLING.

[10] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[11] Xiaobing Zhou,et al. BYteam at SemEval-2020 Task 5: Detecting Counterfactual Statements with BERT and Ensembles , 2020, SemEval@COLING.

[12] Mark Chen,et al. Language Models are Few-Shot Learners , 2020, NeurIPS.

[13] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[14] Deniz Yuret,et al. KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media , 2020, SEMEVAL.

[15] Walid Magdy,et al. SemEval 2021 Task 7: HaHackathon, Detecting and Rating Humor and Offense , 2021, SEMEVAL.

[16] Xuanjing Huang,et al. How to Fine-Tune BERT for Text Classification? , 2019, CCL.