EPIC: Multi-Perspective Annotation of a Corpus of Irony

We present EPIC (English Perspectivist Irony Corpus), the first annotated corpus for irony analysis based on the principles of data perspectivism. The corpus contains short conversations from social media in five regional varieties of English, and it is annotated by contributors from five countries corresponding to those varieties. We analyse the resource along the perspectives induced by the diversity of the annotators, in terms of origin, age, and gender, and the relationship between these dimensions, irony, and the topics of conversation. We validate EPIC by creating perspective-aware models that encode the perspectives of annotators grouped according to their demographic characteristics. Firstly, the performance of perspectivist models confirms that different annotators induce very different models. Secondly, in the classification of ironic and non-ironic texts, perspectivist models prove to be generally more confident than the non-perspectivist ones. Furthermore, comparing the performance on a perspective-based test set with those achieved on a gold standard test set, we can observe how perspectivist models tend to detect more precisely the positive class, showing their ability to capture the different perceptions of irony. Thanks to these models, we are moreover able to show interesting insights about the variation in the perception of irony by the different groups of annotators, such as among different generations and nationalities.

[1]  Barbara Plank The “Problem” of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation , 2022, EMNLP.

[2]  Petr Knoth,et al.  Confidence estimation of classification based on the distribution of the neural network output layer , 2022, ArXiv.

[3]  C. Bosco,et al.  The unbearable hurtfulness of sarcasm , 2022, Expert Syst. Appl..

[4]  Vinodkumar Prabhakaran,et al.  On Releasing Annotator-Level Labels and Information in Datasets , 2021, LAW.

[5]  F. Cabitza,et al.  Toward a Perspectivist Turn in Ground Truthing for Predictive Computing , 2021, AAAI.

[6]  Viviana Patti,et al.  Modeling Annotator Perspective and Polarized Opinions to Improve Hate Speech Detection , 2020, HCOMP.

[7]  Yejin Choi,et al.  Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics , 2020, EMNLP.

[8]  Chu-Ren Huang,et al.  Ciron: a New Benchmark Dataset for Chinese Irony Detection , 2020, LREC.

[9]  Viviana Patti,et al.  A New Measure of Polarization in the Annotation of Hate Speech , 2019, AI*IA.

[10]  Omer Levy,et al.  BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.

[11]  Dan Roth,et al.  Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach , 2019, EMNLP.

[12]  Iryna Gurevych,et al.  Predicting Humorousness and Metaphor Novelty with Gaussian Process Preference Learning , 2019, ACL.

[13]  Véronique Hoste,et al.  SemEval-2018 Task 3: Irony Detection in English Tweets , 2018, *SEMEVAL.

[14]  Pushpak Bhattacharyya,et al.  Investigations in Computational Sarcasm , 2018, Cognitive Systems Monographs.

[15]  Samuel R. Bowman,et al.  A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.

[16]  Nathalie Aussenac-Gilles,et al.  Exploring the Impact of Pragmatic Phenomena on Irony Detection in Tweets: A Multilingual Corpus Study , 2017, EACL.

[17]  Véronique Hoste,et al.  Exploring the Realization of Irony in Twitter Data , 2016, LREC.

[18]  Paolo Rosso,et al.  SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter , 2015, *SEMEVAL.

[19]  Malvina Nissim,et al.  Overview of the Evalita 2016 SENTIment POLarity Classification Task , 2014, CLiC-it/EVALITA.

[20]  Elena Filatova,et al.  Irony and Sarcasm: Corpus Generation and Analysis Using Crowdsourcing , 2012, LREC.

[21]  Davide Buscaldi,et al.  From humor recognition to irony detection: The figurative language of social media , 2012, Data Knowl. Eng..

[22]  Els Lefever,et al.  Irony Detection for Dutch: a Venture into the Implicit , 2022, WASSA.

[23]  F. Alotaibi,et al.  Detecting Irony in Arabic Microblogs using Deep Convolutional Neural Networks , 2022, International Journal of Advanced Computer Science and Applications.

[24]  Reynier Ortega Bueno,et al.  Profiling Irony and Stereotype Spreaders on Twitter (IROSTEREO). Overview for PAN at CLEF 2022 , 2022, CLEF.

[25]  Valerio Basile,et al.  We Need to Consider Disagreement in Evaluation , 2021, BPPF.

[26]  Tristan Miller,et al.  SemEval-2021 Task 12: Learning with Disagreements , 2021, SEMEVAL.

[27]  Paolo Rosso,et al.  Overview of the Task on Irony Detection in Spanish Variants , 2019, IberLEF@SEPLN.

[28]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[29]  Paolo Rosso,et al.  Overview of the EVALITA 2018 Task on Irony Detection in Italian Tweets (IronITA) , 2018, EVALITA@CLiC-it.

[30]  Welch Bl THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVED , 1947 .