Enhancing Aspect-Based Sentiment Analysis of Arabic Hotels' reviews using morphological, syntactic and semantic features

Abstract This research presents an enhanced approach for Aspect-Based Sentiment Analysis (ABSA) of Hotels’ Arabic reviews using supervised machine learning. The proposed approach employs a state-of-the-art research of training a set of classifiers with morphological, syntactic, and semantic features to address the research tasks namely: (a) T1:Aspect Category Identification, (b) T2:Opinion Target Expression (OTE) Extraction, and (c) T3: Sentiment Polarity Identification. Employed classifiers include Naive Bayes, Bayes Networks, Decision Tree, K-Nearest Neighbor (K-NN), and Support-Vector Machine (SVM).The approach was evaluated using a reference dataset based on Semantic Evaluation 2016 workshop (SemEval-2016: Task-5). Results show that the supervised learning approach outperforms related work evaluated using the same dataset. More precisely, evaluation results show that all classifiers in the proposed approach outperform the baseline approach, and the overall enhancement for the best performing classifier (SVM) is around 53% for T1, around 59% for T2, and around 19% in T3.

[1]  Khurshid Ahmad,et al.  Multi-lingual Sentiment Analysis of Financial News Streams , 2007 .

[2]  Mahmoud Al-Ayyoub,et al.  Arabic sentiment analysis: Lexicon-based and corpus-based , 2013, 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT).

[3]  Udo Kruschwitz,et al.  AraNLP: a Java-based Library for the Processing of Arabic Text , 2014, LREC.

[4]  Hend Suliman Al-Khalifa,et al.  Subjectivity and sentiment analysis of Arabic: Trends and challenges , 2014, 2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA).

[5]  Mahmoud Al-Ayyoub,et al.  Enhancing the determination of aspect categories and their polarities in Arabic reviews using lexicon-based approaches , 2015, 2015 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT).

[6]  Eric Chang,et al.  Red Opal: product-feature scoring from reviews , 2007, EC '07.

[7]  Muhannad Quwaider,et al.  Human Annotated Arabic Dataset of Book Reviews for Aspect Based Sentiment Analysis , 2015, 2015 3rd International Conference on Future Internet of Things and Cloud.

[8]  R. M. Chandrasekaran,et al.  A sampling based sentiment mining approach for e-commerce applications , 2017, Inf. Process. Manag..

[9]  Antonio Ruiz-Martínez,et al.  Feature-based opinion mining in financial news: An ontology-driven approach , 2017, J. Inf. Sci..

[10]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[11]  Peter D. Turney,et al.  Emotions Evoked by Common Words and Phrases: Using Mechanical Turk to Create an Emotion Lexicon , 2010, HLT-NAACL 2010.

[12]  Xiaojun Wan,et al.  Opinion Target Extraction in Chinese News Comments , 2010, COLING.

[13]  Mahmoud Al-Ayyoub,et al.  An extended analytical study of Arabic sentiments , 2014, Int. J. Big Data Intell..

[14]  Muhammad Abdul-Mageed,et al.  SAMAR: Subjectivity and sentiment analysis for Arabic social media , 2014, Comput. Speech Lang..

[15]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[16]  German Rigau,et al.  IXA pipeline: Efficient and Ready to Use Multilingual NLP tools , 2014, LREC.

[17]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[18]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[19]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.

[20]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[21]  Verena Rieser,et al.  An Arabic Twitter Corpus for Subjectivity and Sentiment Analysis , 2014, LREC.

[22]  Madhavi Devaraj,et al.  Analytical mapping of opinion mining and sentiment analysis research during 2000-2015 , 2017, Inf. Process. Manag..

[23]  Michael Collins,et al.  Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[24]  Alaa M. El-Halees,et al.  Arabic Opinion Mining Using Combined Classification Approach , 2011 .

[25]  Edward A. Fox,et al.  Automated arabic text classification with P‐Stemmer, machine learning, and a tailored news article taxonomy , 2016, J. Assoc. Inf. Sci. Technol..

[26]  Maryna Chernyshevich,et al.  IHS R&D Belarus: Cross-domain Extraction of Product Features using Conditional Random Fields , 2014 .

[27]  Luis Alfonso Ureña López,et al.  OCA: Opinion corpus for Arabic , 2011, J. Assoc. Inf. Sci. Technol..

[28]  Amir F. Atiya,et al.  LABR: A Large Scale Arabic Book Reviews Dataset , 2013, ACL.

[29]  Muhammad Abdul-Mageed,et al.  Subjectivity and Sentiment Analysis of Modern Standard Arabic , 2011, ACL.

[30]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[31]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[32]  Mahmoud Al-Ayyoub,et al.  Automatic Lexicon Construction for Arabic Sentiment Analysis , 2014, 2014 International Conference on Future Internet of Things and Cloud.

[33]  Saif Mohammad,et al.  Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..

[34]  Muhammad Abdul-Mageed,et al.  Subjectivity and Sentiment Analysis of Arabic: A Survey , 2012, AMLTA.

[35]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[36]  José Saias,et al.  Sentiue: Target and Aspect based Sentiment Analysis in SemEval-2015 Task 12 , 2015, *SEMEVAL.

[37]  Alexander Clark,et al.  Combining Distributional and Morphological Information for Part of Speech Induction , 2003, EACL.

[38]  Xiaoyan Zhu,et al.  Movie review mining and summarization , 2006, CIKM '06.

[39]  Muhammad Abdul-Mageed,et al.  Subjectivity and Sentiment Annotation of Modern Standard Arabic Newswire , 2011, Linguistic Annotation Workshop.

[40]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[41]  Chong Long,et al.  A Review Selection Approach for Accurate Feature Rating Estimation , 2010, COLING.

[42]  Mahmoud Al-Ayyoub,et al.  Using Enhanced Lexicon-Based Approaches for the Determination of Aspect Categories and Their Polarities in Arabic Reviews , 2016, Int. J. Inf. Technol. Web Eng..

[43]  Rehab Duwairi,et al.  A study of the effects of preprocessing strategies on sentiment analysis for Arabic text , 2014, J. Inf. Sci..

[44]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[45]  Rodrigo Agerri,et al.  EliXa: A Modular and Flexible ABSA Platform , 2015, *SEMEVAL.

[46]  Kyung Hyan Yoo,et al.  What Motivates Consumers to Write Online Travel Reviews? , 2008, J. Inf. Technol. Tour..

[47]  Aqil M. Azmi,et al.  Arabic tweets sentiment analysis – a hybrid scheme , 2016, J. Inf. Sci..

[48]  Zhihua Zhang,et al.  ECNU: Extracting Effective Features from Multiple Sequential Sentences for Target-dependent Sentiment Analysis in Reviews , 2015, *SEMEVAL.

[49]  Luis Alfonso Ureña López,et al.  Bilingual Experiments with an Arabic-English Corpus for Opinion Mining , 2011, RANLP.

[50]  Suresh Manandhar,et al.  SemEval-2015 Task 12: Aspect Based Sentiment Analysis , 2015, *SEMEVAL.

[51]  Mahmoud Al-Ayyoub,et al.  An Aspect-Based Sentiment Analysis Approach to Evaluating Arabic News Affect on Readers , 2016, J. Univers. Comput. Sci..

[52]  Muhammad Abdul-Mageed,et al.  AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis , 2012, LREC.

[53]  Joachim Wagner,et al.  DCU: Aspect-based Polarity Classification for SemEval Task 4 , 2014, *SEMEVAL.

[54]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[55]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[56]  Mohamed G. Elfeky,et al.  Mining Arabic Business Reviews , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[57]  Suresh Manandhar,et al.  SemEval-2014 Task 4: Aspect Based Sentiment Analysis , 2014, *SEMEVAL.

[58]  John G. Breslin,et al.  INSIGHT-1 at SemEval-2016 Task 5: Deep Learning for Multilingual Aspect-based Sentiment Analysis , 2016, *SEMEVAL.

[59]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[60]  Saif Mohammad,et al.  NRC-Canada-2014: Detecting Aspects and Sentiment in Customer Reviews , 2014, *SEMEVAL.

[61]  Hazem M. Hajj,et al.  Sentence-Level and Document-Level Sentiment Mining for Arabic Texts , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[62]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[63]  Mahmoud Al-Ayyoub,et al.  An enhanced framework for aspect-based sentiment analysis of Hotels' reviews: Arabic reviews case study , 2016, 2016 11th International Conference for Internet Technology and Secured Transactions (ICITST).

[64]  Erik F. Tjong Kim Sang,et al.  Representing Text Chunks , 1999, EACL.

[65]  Luis Alfonso Ureña López,et al.  Combining resources to improve unsupervised sentiment analysis at aspect-level , 2016, J. Inf. Sci..

[66]  Nizar Habash,et al.  MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic , 2014, LREC.

[67]  Zhiqiang Toh,et al.  DLIREC: Aspect Term Extraction and Term Polarity Classification System , 2014, *SEMEVAL.

[68]  Haris Papageorgiou,et al.  SemEval-2016 Task 5: Aspect Based Sentiment Analysis , 2016, *SEMEVAL.

[69]  Roberto Basili,et al.  UNITOR: Aspect Based Sentiment Analysis with Structured Learning , 2014, *SEMEVAL.

[70]  Nizar Habash,et al.  Introduction to Arabic Natural Language Processing , 2010, Introduction to Arabic Natural Language Processing.

[71]  L. Hamandi,et al.  Classifying sentiment in arabic social networks: Naïve search versus Naïve bayes , 2012, 2012 2nd International Conference on Advances in Computational Tools for Engineering Applications (ACTEA).

[72]  Mahmoud Al-Ayyoub,et al.  Using Aspect-Based Sentiment Analysis to Evaluate Arabic News Affect on Readers , 2015, 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC).

[73]  Mahmoud Al-Ayyoub,et al.  Framework for affective news analysis of Arabic news: 2014 Gaza attacks case study , 2016, 2016 7th International Conference on Information and Communication Systems (ICICS).