UMUTeam at EmoEvalEs 2021: Emotion Analysis for Spanish based on Explainable Linguistic Features and Transformers

Emotion Analysis extends the idea of Sentiment Analysis by shifting from plain positive or negative sentiments to a rich variety of emotions to get better understanding of the users’ thoughts and appraisals. The move from Sentiment Analysis to Emotion Analysis requires, however, better feature engineering techniques when it comes to capturing complex language phenomena, which have to do with figurative language and the way of expressing oneself. In this manuscript we detail the participation of the UMUTeam in EmoEvalEs’2021 shared task from IberLEF, concerning the identification of emotions in Spanish. Our proposal is grounded on the combination of explainable linguistic features and state-of-the-art transformers based on the Spanish version of BERT. We achieved the 6th position in the official leader board with an accuracy of 68.5990%, only 4.1667% below the best result. In addition, we apply model agnostic techniques for explainable artificial intelligence to achieve insights from the linguistic features. We observed a correlation between psycho-linguistic processes and perceptual feel with the emotions evaluated and, specifically, with documents labelled as sadness.

[1]  Rafael Valencia-García,et al.  Ontology-driven aspect-based sentiment analysis classification: An infodemiological case study regarding infectious diseases in Latin America , 2020, Future Generation Computer Systems.

[2]  Rafael Valencia-García,et al.  Review of English literature on figurative language applied to social networks , 2019, Knowledge and Information Systems.

[3]  Miguel Ángel Rodríguez-García,et al.  Automatic detection of satire in Twitter: A psycholinguistic-based approach , 2017, Knowl. Based Syst..

[4]  Christopher D. Manning,et al.  Stanza: A Python Natural Language Processing Toolkit for Many Human Languages , 2020, ACL.

[5]  Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2021) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2021), XXXVII International Conference of the Spanish Society for Natural Language Processing., Málaga, Spain, September, 2021 , 2021, IberLEF@SEPLN.

[6]  Luis Alfonso Ureña López,et al.  Improved emotion recognition in Spanish social media through incorporation of lexical knowledge , 2020, Future Gener. Comput. Syst..

[7]  J. Pennebaker,et al.  The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods , 2010 .

[8]  Roman Klinger,et al.  A Survey on Sentiment and Emotion Analysis for Computational Literary Studies , 2018, ArXiv.

[9]  Carlo Strapparava,et al.  EmoEvent: A Multilingual Emotion Corpus based on different Events , 2020, LREC.

[10]  Osnat Mokryn,et al.  Sharing emotions: determining films’ evoked emotional experience from their online reviews , 2020, Information Retrieval Journal.

[11]  José Manuel Perea Ortega,et al.  Semantic orientation for polarity classification in Spanish reviews , 2013, Expert Syst. Appl..

[12]  Miguel Ángel García Cumbreras,et al.  Overview of TASS 2020: Introducing Emotion Detection , 2020, IberLEF@SEPLN.

[13]  Peter D. Turney,et al.  Emotions Evoked by Common Words and Phrases: Using Mechanical Turk to Create an Emotion Lexicon , 2010, HLT-NAACL 2010.

[14]  Luis Alfonso Ureña López,et al.  Overview of the EmoEvalEs task on emotion detection for Spanish at IberLEF 2021 , 2021, Proces. del Leng. Natural.

[15]  Alexander F. Gelbukh,et al.  Empirical Study of Machine Learning Based Approach for Opinion Mining in Tweets , 2012, MICAI.

[16]  Mario Andrés Paredes-Valverde,et al.  Sentiment Analysis in Spanish for Improvement of Products and Services: A Deep Learning Approach , 2017, Sci. Program..

[17]  Iryna Gurevych,et al.  Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks , 2019, EMNLP.

[18]  Rafael Valencia-García,et al.  UMUTeam at TASS 2020: Combining Linguistic Features and Machine-learning Models for Sentiment Classification , 2020, IberLEF@SEPLN.

[19]  Rafael Valencia-García,et al.  Detecting misogyny in Spanish tweets. An approach based on linguistics features and word embeddings , 2021, Future Gener. Comput. Syst..

[20]  Jon Rokne,et al.  Emotion detection from text and speech: a survey , 2018, Social Network Analysis and Mining.

[21]  Paul Ekman,et al.  Lie Catching and Microexpressions , 2009 .

[22]  J. Russell A circumplex model of affect. , 1980 .

[23]  Lysandre Debut,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.