ArSEL: A Large Scale Arabic Sentiment and Emotion Lexicon

With the advancement of Web 2.0, social networks experienced a great increase in the number of active users reaching 2 billion active users on Facebook at the end of 2017. Consequently, the size of text data on the Internet increased tremendously. This textual data is rich in knowledge, which attracted many data scientists as well as computational linguists to develop resources and models to automatically process the data and extract useful information. One major interest is sentiment and emotion classification from text. In fact, learning the opinion and emotions of people is important for businesses, marketers, government, politicians, etc. While focus had been given to sentiment analysis, recently emotion analysis has captured great interest as well. Several resources were developed for emotion analysis from text for English, however, very few targeted Arabic text. We present in this paper, ArSEL, the first large scale Arabic Sentiment and Emotion Lexicon. ArSEL is built in a way to augment the publicly available Arabic Sentiment Lexicon, ArSenL, and to generate a large scale lexicon that includes emotion and sentiment labels for almost every lemma in ArSenL. We also show the efficiency of using ArSEL in emotion regression and classification tasks using an Arabic translated version of annotated data from SemEval 2007 “Affective Task” as well as SemEval 2018 Task1 “Affect in Tweets” Arabic dataset. Coverages of 91% and 84% are achieved on the two datasets respectively. An improvement of 30% compared to majority baseline is achieved in terms of average F1 measure for emotion classification on SemEval 2018 Arabic dataset. ArSEL is publicly available on http://oma-project.com.

[1]  Nizar Habash,et al.  Introduction to Arabic Natural Language Processing , 2010, Introduction to Arabic Natural Language Processing.

[2]  Nizar Habash,et al.  A Large Scale Arabic Sentiment Lexicon for Arabic Opinion Mining , 2014, ANLP@EMNLP.

[3]  Nizar Habash,et al.  MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic , 2014, LREC.

[4]  R. Plutchik A GENERAL PSYCHOEVOLUTIONARY THEORY OF EMOTION , 1980 .

[5]  Hazem M. Hajj,et al.  Deep Learning Models for Sentiment Analysis in Arabic , 2015, ANLP@ACL.

[6]  Amit P. Sheth,et al.  Harnessing Twitter "Big Data" for Automatic Emotion Identification , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[7]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[8]  Nizar Habash,et al.  OMAM at SemEval-2017 Task 4: English Sentiment Analysis with Conditional Random Fields , 2017, SemEval@ACL.

[9]  Hazem M. Hajj,et al.  A Framework for Emotion Recognition from Human Computer Interaction in Natural Setting , 2016 .

[10]  Mitsuru Ishizuka,et al.  Textual Affect Sensing for Sociable and Expressive Online Communication , 2007, ACII.

[11]  Muhammad Abdul-Mageed,et al.  Modeling Arabic subjectivity and sentiment in lexical space , 2017, Inf. Process. Manag..

[12]  Hazem M. Hajj,et al.  A novel approach for emotion classification based on fusion of text and speech , 2012, 2012 19th International Conference on Telecommunications (ICT).

[13]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[14]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[15]  Lei Zhang,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2012, Mining Text Data.

[16]  Stewart Massie,et al.  Generating a Word-Emotion Lexicon from #Emotional Tweets , 2014, *SEMEVAL.

[17]  Hazem M. Hajj,et al.  AROMA: A Recursive Deep Learning Model for Opinion Mining in Arabic as a Low Resource Language , 2017, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[18]  Hazem M. Hajj,et al.  Facial Action Unit and Emotion Recognition with Head Pose Variations , 2012, ADMA.

[19]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[20]  Gerald M. Knapp,et al.  Multimodal Affect Analysis for Product Feedback Assessment , 2017, ArXiv.

[21]  Erik Cambria,et al.  SenticNet 2: A Semantic and Affective Resource for Opinion Mining and Sentiment Analysis , 2012, FLAIRS.

[22]  Carlo Strapparava,et al.  WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[23]  Stewart Massie,et al.  Lexicon Generation for Emotion Detection from Text , 2017, IEEE Intelligent Systems.

[24]  Carlo Strapparava,et al.  SemEval-2007 Task 14: Affective Text , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[25]  A. Goldman,et al.  Simulationist models of face-based emotion recognition , 2005, Cognition.

[26]  J.R.L. Bernard,et al.  The Macquarie thesaurus : the book of words , 1986 .

[27]  Nizar Habash,et al.  A Sentiment Treebank and Morphologically Enriched Recursive Deep Models for Effective Sentiment Analysis in Arabic , 2017, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[28]  Christiane Fellbaum,et al.  Introducing the Arabic WordNet project , 2006 .

[29]  Marco Guerini,et al.  Depeche Mood: a Lexicon for Emotion Analysis from Crowd Annotated News , 2014, ACL.

[30]  C. Nass,et al.  Emotion in human-computer interaction , 2002 .

[31]  Joseph S. Valacich,et al.  How Is Your User Feeling? Inferring Emotion Through Human-Computer interaction Devices , 2017, MIS Q..

[32]  Saif Mohammad,et al.  CROWDSOURCING A WORD–EMOTION ASSOCIATION LEXICON , 2013, Comput. Intell..

[33]  R. Plutchik The psychology and biology of emotion , 1994 .

[34]  Andrea Esuli,et al.  SentiWordNet: A High-Coverage Lexical Resource for Opinion Mining , 2006 .

[35]  Pascal Poncelet,et al.  FEEL: a French Expanded Emotion Lexicon , 2016, Language Resources and Evaluation.

[36]  Hazem M. Hajj,et al.  A Multiresolution Approach to Recommender Systems , 2014, SNAKDD'14.

[37]  Preslav Nakov,et al.  SemEval-2016 Task 4: Sentiment Analysis in Twitter , 2016, *SEMEVAL.

[38]  George N. Votsis,et al.  Emotion recognition in human-computer interaction , 2001, IEEE Signal Process. Mag..

[39]  Hazem M. Hajj,et al.  Emotion Recognition from Text Based on Automatically Generated Rules , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[40]  Dipankar Das,et al.  Fuzzy Clustering for Semi-supervised Learning - Case Study: Construction of an Emotion Lexicon , 2012, MICAI.

[41]  Kathrin Knautz,et al.  MEMOSE: search engine for emotions in multimedia documents , 2010, SIGIR.

[42]  Saif Mohammad,et al.  Tracking Sentiment in Mail: How Genders Differ on Emotional Axes , 2011, WASSA@ACL.

[43]  L. Rothkrantz,et al.  Toward an affect-sensitive multimodal human-computer interaction , 2003, Proc. IEEE.

[44]  Hazem M. Hajj,et al.  A hybrid approach with collaborative filtering for recommender systems , 2013, 2013 9th International Wireless Communications and Mobile Computing Conference (IWCMC).

[45]  Martin Wegrzyn,et al.  Mapping the emotional face. How individual face parts contribute to successful emotion recognition , 2017, PloS one.

[46]  Houfeng Wang,et al.  Build Chinese Emotion Lexicons Using A Graph-based Algorithm and Multiple Resources , 2010, COLING.

[47]  Hatice Gunes,et al.  Bi-modal emotion recognition from expressive face and body gestures , 2007, J. Netw. Comput. Appl..

[48]  Muhammad Abdul-Mageed,et al.  SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis , 2014, LREC.

[49]  Nicu Sebe,et al.  Multimodal Human Computer Interaction: A Survey , 2005, ICCV-HCI.

[50]  P. Ekman An argument for basic emotions , 1992 .

[51]  Torky I. Sultan,et al.  A Computational Approach for Analyzing and Detecting Emotions in Arabic Text , 2022 .

[52]  Saif Mohammad,et al.  Word Affect Intensities , 2017, LREC.

[53]  Dipankar Das,et al.  A Classifier Based Approach to Emotion Lexicon Construction , 2012, NLDB.

[54]  Hazem M. Hajj,et al.  A Light Lexicon-based Mobile Application for Sentiment Mining of Arabic Tweets , 2015, ANLP@ACL.

[55]  Mitsuru Ishizuka,et al.  SentiFul: A Lexicon for Sentiment Analysis , 2011, IEEE Transactions on Affective Computing.

[56]  Zhigang Deng,et al.  Analysis of emotion recognition using facial expressions, speech and multimodal information , 2004, ICMI '04.

[57]  Samhaa R. El-Beltagy,et al.  NileTMRG at SemEval-2017 Task 4: Arabic Sentiment Analysis , 2017, *SEMEVAL.

[58]  Muhammad Abdul-Mageed,et al.  EmoNet: Fine-Grained Emotion Detection with Gated Recurrent Neural Networks , 2017, ACL.

[59]  Nizar Habash,et al.  A Characterization Study of Arabic Twitter Data with a Benchmarking for State-of-the-Art Opinion Mining Models , 2017, WANLP@EACL.

[60]  Nizar Habash,et al.  An Efficient Model For Sentiment Classification Of Arabic Tweets On Mobiles , 2014 .

[61]  R. Pieters,et al.  Angry customers don't come back, they get back: The experience and behavioral implications of anger and dissatisfaction in services , 2003 .

[62]  Iyad Rahwan,et al.  Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm , 2017, EMNLP.

[63]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[64]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[65]  Hsin-Hsi Chen,et al.  Building Emotion Lexicon from Weblog Corpora , 2007, ACL.

[66]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[67]  Hazem M. Hajj,et al.  Recommender Systems Using Harmonic Analysis , 2014, 2014 IEEE International Conference on Data Mining Workshop.