A comparative study of effective approaches for Arabic sentiment analysis

Abstract Sentiment analysis (SA) is a natural language processing (NLP) application that aims to analyse and identify sentiment within a piece of text. Arabic SA started to receive more attention in the last decade with many approaches showing some effectiveness for detecting sentiment on multiple datasets. While there have been some surveys summarising some of the approaches for Arabic SA in literature, most of these approaches are reported on different datasets, which makes it difficult to identify the most effective approaches among those. In addition, those approaches do not cover the recent advances in NLP that use transformers. This paper presents a comprehensive comparative study on the most effective approaches used for Arabic sentiment analysis. We re-implement most of the existing approaches for Arabic SA and test their effectiveness on three of the most popular benchmark datasets for Arabic SA. Further, we examine the use of transformer-based language models for Arabic SA and show their superior performance compared to the existing approaches, where the best model achieves F-score scores of 0.69, 0.76, and 0.92 on the SemEval, ASTD, and ArSAS benchmark datasets. We also apply an extensive analysis of the possible reasons for failures, which show the limitations of the existing annotated Arabic SA datasets, and the challenge of sarcasm that is prominent in Arabic dialects. Finally, we highlight the main gaps in Arabic sentiment analysis research and suggest the most in-need future research directions in this area.

[1]  Mahmoud Al-Ayyoub,et al.  Deep Recurrent neural network vs. support vector machine for aspect-based sentiment analysis of Arabic hotels' reviews , 2017, J. Comput. Sci..

[2]  Preslav Nakov,et al.  SemEval-2014 Task 9: Sentiment Analysis in Twitter , 2014, *SEMEVAL.

[3]  Lixin Tao,et al.  Word embeddings for Arabic sentiment analysis , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[4]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[5]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[6]  Muhammad Abdul-Mageed,et al.  Subjectivity and Sentiment Analysis of Modern Standard Arabic , 2011, ACL.

[7]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[8]  Samhaa R. El-Beltagy,et al.  Combining Lexical Features and a Supervised Learning Approach for Arabic Sentiment Analysis , 2016, CICLing.

[9]  Preslav Nakov,et al.  SemEval-2013 Task 2: Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[10]  Nikos Pelekis,et al.  DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis , 2017, *SEMEVAL.

[11]  Muhammad Abdul-Mageed,et al.  Modeling Arabic subjectivity and sentiment in lexical space , 2017, Inf. Process. Manag..

[12]  Siddharth Batra,et al.  Entity Based Sentiment Analysis on Twitter , 2010 .

[13]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[14]  Mahmoud Al-Ayyoub,et al.  Using long short-term memory deep neural networks for aspect-based sentiment analysis of Arabic reviews , 2018, International Journal of Machine Learning and Cybernetics.

[15]  Osmar R. Zaïane,et al.  Current State of Text Sentiment Analysis from Opinion to Emotion Mining , 2017, ACM Comput. Surv..

[16]  Taku Kudo,et al.  Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates , 2018, ACL.

[17]  Nadir Durrani,et al.  Farasa: A Fast and Furious Segmenter for Arabic , 2016, NAACL.

[18]  Mahmoud Al-Ayyoub,et al.  Lexicon-based sentiment analysis of Arabic tweets , 2015, Int. J. Soc. Netw. Min..

[19]  Samhaa R. El-Beltagy,et al.  NileTMRG at SemEval-2017 Task 4: Arabic Sentiment Analysis , 2017, *SEMEVAL.

[20]  Preslav Nakov,et al.  SemEval-2015 Task 10: Sentiment Analysis in Twitter , 2015, *SEMEVAL.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Imane Guellil,et al.  Arabic sentiment analysis: studies, resources, and tools , 2019, Social Network Analysis and Mining.

[23]  Reena Mahe,et al.  Study of Different Levels for Sentiment Analysis , 2015 .

[24]  Walid Magdy,et al.  From Arabic Sentiment Analysis to Sarcasm Detection: The ArSarcasm Dataset , 2020, OSACT.

[25]  Hazem Hajj,et al.  AraBERT: Transformer-based Model for Arabic Language Understanding , 2020, OSACT.

[26]  Preslav Nakov,et al.  SemEval-2016 Task 4: Sentiment Analysis in Twitter , 2016, *SEMEVAL.

[27]  Hazem M. Hajj,et al.  Deep Learning Models for Sentiment Analysis in Arabic , 2015, ANLP@ACL.

[28]  Abdel-Rahman Hedar,et al.  Sentiment Analysis of Arabic Slang Comments on Facebook , 2014, BIOINFORMATICS 2014.

[29]  Walid Magdy,et al.  Exploring Author Context for Detecting Intended vs Perceived Sarcasm , 2019, ACL.

[30]  Walid Magdy,et al.  Mazajak: An Online Arabic Sentiment Analyser , 2019, WANLP@ACL 2019.

[31]  Thomas Wolf,et al.  HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[32]  Dirk Hovy,et al.  Putting Sarcasm Detection into Context: The Effects of Class Imbalance and Manual Labelling on Supervised Machine Classification of Twitter Conversations , 2016, ACL.

[33]  David Bamman,et al.  Contextualized Sarcasm Detection on Twitter , 2015, ICWSM.

[34]  Mathieu Cliche,et al.  BB_twtr at SemEval-2017 Task 4: Twitter Sentiment Analysis with CNNs and LSTMs , 2017, *SEMEVAL.

[35]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[36]  AbdelRahim A. Elmadany,et al.  ArSAS : An Arabic Speech-Act and Sentiment Corpus of Tweets , 2018 .

[37]  Samhaa R. El-Beltagy,et al.  AraVec: A set of Arabic Word Embedding Models for use in Arabic NLP , 2017, ACLING.

[38]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[39]  Muhammad Abdul-Mageed,et al.  SAMAR: Subjectivity and sentiment analysis for Arabic social media , 2014, Comput. Speech Lang..

[40]  Amir F. Atiya,et al.  ASTD: Arabic Sentiment Tweets Dataset , 2015, EMNLP.

[41]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[42]  Nizar Habash,et al.  Introduction to Arabic Natural Language Processing , 2010, Introduction to Arabic Natural Language Processing.

[43]  Walid Magdy,et al.  Language processing for arabic microblog retrieval , 2012, CIKM.

[44]  Nora Al-Twairesh,et al.  Surface and Deep Features Ensemble for Sentiment Analysis of Arabic Tweets , 2019, IEEE Access.

[45]  Hazem M. Hajj,et al.  hULMonA: The Universal Language Model in Arabic , 2019, WANLP@ACL 2019.

[46]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[47]  Patrizia Grifoni,et al.  Approaches, Tools and Applications for Sentiment Analysis Implementation , 2015 .

[48]  Ming Zhou,et al.  Coooolll: A Deep Learning System for Twitter Sentiment Classification , 2014, *SEMEVAL.

[49]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[50]  Antonio Moreno,et al.  SiTAKA at SemEval-2017 Task 4: Sentiment Analysis in Twitter Based on a Rich Set of Features , 2017, *SEMEVAL.

[51]  Abed Allah Khamaiseh,et al.  A comprehensive survey of arabic sentiment analysis , 2019, Inf. Process. Manag..

[52]  Nizar Habash,et al.  A Survey of Opinion Mining in Arabic , 2019, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[53]  Khaled Shaalan,et al.  A Survey of Arabic Text Mining , 2018 .

[54]  Nizar Habash,et al.  MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic , 2014, LREC.

[55]  Farah Benamara,et al.  SOUKHRIA: Towards an Irony Detection System for Arabic in Social Media , 2017, ACLING.

[56]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[57]  Mahmoud Al-Ayyoub,et al.  Arabic sentiment analysis: Lexicon-based and corpus-based , 2013, 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT).

[58]  José-Ángel González,et al.  ELiRF-UPV at SemEval-2017 Task 4: Sentiment Analysis using Deep Learning , 2017, SemEval@ACL.

[59]  Benoît Favre,et al.  SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis , 2016, SemEval@NAACL-HLT.

[60]  Haris Papageorgiou,et al.  SemEval-2016 Task 5: Aspect Based Sentiment Analysis , 2016, *SEMEVAL.

[61]  Shuai Wang,et al.  Deep learning for sentiment analysis: A survey , 2018, WIREs Data Mining Knowl. Discov..

[62]  Richard Socher,et al.  Regularizing and Optimizing LSTM Language Models , 2017, ICLR.

[63]  Matthew England,et al.  A Combined CNN and LSTM Model for Arabic Sentiment Analysis , 2018, CD-MAKE.

[64]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[65]  Matthias Hagen,et al.  Webis: An Ensemble for Twitter Sentiment Detection , 2015, *SEMEVAL.

[66]  Ines Abbes,et al.  DAICT: A Dialectal Arabic Irony Corpus Extracted from Twitter , 2020, LREC.

[67]  Ellen Riloff,et al.  Sarcasm as Contrast between a Positive Sentiment and Negative Situation , 2013, EMNLP.

[68]  Aurélien Lucchi,et al.  SwissCheese at SemEval-2016 Task 4: Sentiment Classification Using an Ensemble of Convolutional Neural Networks with Distant Supervision , 2016, *SEMEVAL.

[69]  Amir F. Atiya,et al.  LABR: A Large Scale Arabic Book Reviews Dataset , 2013, ACL.

[70]  Nagwa M. El-Makky,et al.  Sentiment Analysis of Arabic Tweets using Deep Learning , 2018, ACLING.

[71]  Tomoko Ohkuma,et al.  TeamX: A Sentiment Analyzer with Enhanced Lexicon Mapping and Weighting Scheme for Unbalanced Data , 2014, *SEMEVAL.

[72]  Weitong Chen,et al.  A survey of sentiment analysis in social media , 2018, Knowledge and Information Systems.

[73]  Samhaa R. El-Beltagy,et al.  NileULex: A Phrase and Word Level Sentiment Lexicon for Egyptian and Modern Standard Arabic , 2016, LREC.

[74]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[75]  Matthew England,et al.  Arabic language sentiment analysis on health services , 2017, 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR).

[76]  Pengfei Duan,et al.  Word Embeddings and Convolutional Neural Network for Arabic Sentiment Classification , 2016, COLING.

[77]  Ayoub Ait Lahcen,et al.  ASA: A framework for Arabic sentiment analysis , 2020, J. Inf. Sci..

[78]  Anazida Zainal,et al.  A Review on Challenging Issues in Arabic Sentiment Analysis , 2016, J. Comput. Sci..

[79]  Muhammad Abdul-Mageed Not All Segments are Created Equal: Syntactically Motivated Sentiment Analysis in Lexical Space , 2017, WANLP@EACL.

[80]  Muhammad Abdul-Mageed,et al.  AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis , 2012, LREC.

[81]  Luigi Di Caro,et al.  Sentiment analysis via dependency parsing , 2013, Comput. Stand. Interfaces.

[82]  Doaa Mohey El Din Mohamed Hussein,et al.  A survey on sentiment analysis challenges , 2016, Journal of King Saud University - Engineering Sciences.