Political Arabic Articles Orientation Using Rough Set Theory With Sentiment Lexicon

Sentiment analysis is an emerging research field that can be integrated with other domains, including data mining, natural language processing and machine learning. In political articles, it is difficult to understand and summarise the state or overall views due to the diversity and size of social media information. A number of studies were conducted in the area of sentiment analysis, especially using English texts, while Arabic language received less attention in the literature. In this study, we propose a detection model for political orientation articles in the Arabic language. We introduce the key assumptions of the model, present and discuss the obtained results, and highlight the issues that still need to be explored to further our understanding of subjective sentences. The main purpose of applying this new approach based on Rough Set (RS) theory is to increase the accuracy of the models in recognizing the orientation of the articles. We present extensive simulation results, which demonstrate the superiority of the proposed model over other algorithms. It is shown that the performance of the proposed approach significantly improves by adding discriminating features. To summarize, the proposed approach demonstrates an accuracy of 85.483%, when evaluating the orientation of political Arabic datasets, compared to 72.58% and 64.516% for the Support Vector Machines and Naïve Bayes methods, respectively.

[1]  Kazem Taghva,et al.  Arabic stemming without a root dictionary , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[2]  Simon Günter,et al.  Short Text Authorship Attribution via Sequence Kernels, Markov Chains and Author Unmasking: An Investigation , 2006, EMNLP.

[3]  Ming-Wen Shao,et al.  Reduction method for concept lattices based on rough set theory and its application , 2007, Comput. Math. Appl..

[4]  Jianhua Dai,et al.  Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification , 2013, Appl. Soft Comput..

[5]  Guohui Yang,et al.  Application of rough set theory for NVNA phase reference uncertainty analysis in hybrid information system , 2018, Comput. Electr. Eng..

[6]  Kiran Bhowmick,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2015 .

[7]  Danish Nadeem,et al.  Rough Intuitionistic Fuzzy Sets , 2002, JCIS.

[8]  Zhendong Niu,et al.  Automatic construction of domain-specific sentiment lexicon based on constrained label propagation , 2014, Knowl. Based Syst..

[9]  Erik Cambria,et al.  A review of sentiment analysis research in Arabic language , 2020, Future Gener. Comput. Syst..

[10]  Mohd. Hasan,et al.  Conditional arabic light stemmer: condlight , 2018, Int. Arab J. Inf. Technol..

[11]  Derek Greene,et al.  Practical solutions to the problem of diagonal dominance in kernel document clustering , 2006, ICML.

[12]  Abdelrahman Osman Elfaki,et al.  A Comparative Survey on Arabic Stemming: Approaches and Challenges , 2017 .

[13]  Yiqun Liu,et al.  Do users rate or review?: boost phrase-level sentiment labeling with review-level sentiment classification , 2014, SIGIR.

[14]  Dhafar Hamed Abd,et al.  Political Articles Categorization Based on Different Naïve Bayes Models , 2019, ACRIT.

[15]  Ming Zhou,et al.  Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach , 2014, COLING.

[16]  Eric Atwell,et al.  Comparative Evaluation of Arabic Language Morphological Analysers and Stemmers , 2008, COLING.

[17]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[18]  Qiong Wu,et al.  A random walk algorithm for automatic construction of domain-oriented sentiment lexicon , 2011, Expert Syst. Appl..

[19]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[20]  Yue Lu,et al.  Automatic construction of a context-aware sentiment lexicon: an optimization approach , 2011, WWW.

[21]  Sebastian Raschka,et al.  Naive Bayes and Text Classification I - Introduction and Theory , 2014, ArXiv.

[22]  Abhay Sharma,et al.  An Investigation of Supervised Learning Methods for Authorship Attribution in Short Hinglish Texts using Char & Word N-grams , 2018, ArXiv.

[23]  Matthew Self,et al.  Bayesian Classification , 1988, AAAI.

[24]  Thabet Slimani,et al.  Application of Rough Set Theory in Data Mining , 2013, ArXiv.

[25]  D. Massart,et al.  Application of rough set theory to feature selection for unsupervised clustering , 2002 .

[26]  Mahmoud Al-Ayyoub,et al.  Automatic Arabic text categorization: A comprehensive comparative study , 2015, J. Inf. Sci..

[27]  Dale Schuurmans,et al.  Augmenting Naive Bayes Classifiers with Statistical Language Models , 2004, Information Retrieval.

[28]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[29]  Qasem A. Al-Radaideh,et al.  An associative rule-based classifier for Arabic medical text , 2015, Int. J. Knowl. Eng. Data Min..

[30]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[31]  Qasem A. Al-Radaideh,et al.  An Arabic text categorization approach using term weighting and multiple reducts , 2018, Soft Comput..

[32]  Mamata Jenamani,et al.  Senti-N-Gram: An n-gram lexicon for sentiment analysis , 2018, Expert Syst. Appl..

[33]  Lisa Ballesteros,et al.  Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis , 2002, SIGIR '02.

[34]  Mahmoud Al-Ayyoub,et al.  Automatic categorization of Arabic articles based on their political orientation , 2018, Digit. Investig..

[35]  Dhafar Hamed Abd,et al.  PAAD: POLITICAL ARABIC ARTICLES DATASET FOR AUTOMATIC TEXT CATEGORIZATION , 2020 .

[36]  Jwan K. Alwan,et al.  The effect of gamma value on support vector machine performance with different kernels , 2020 .

[37]  Jwan K. Alwan,et al.  Performance Evaluation of Kernels in Support Vector Machine , 2018, 2018 1st Annual International Conference on Information and Sciences (AiCIS).

[38]  George J. Vachtsevanos,et al.  An application of rough set theory to defect detection of automotive glass , 2002, Math. Comput. Simul..

[39]  K. M. Azharul Hasan,et al.  N-Gram Based Sentiment Mining for Bangla Text Using Support Vector Machine , 2018, 2018 International Conference on Bangla Speech and Language Processing (ICBSLP).

[40]  Angela Fahrni,et al.  Old Wine or Warm Beer : Target-Specific Sentiment Analysis of Adjectives , .

[41]  K. Srinathan,et al.  Automatic keyphrase extraction from scientific documents using N-gram filtration technique , 2008, ACM Symposium on Document Engineering.

[42]  Mitsuru Ishizuka,et al.  SentiFul: Generating a reliable lexicon for sentiment analysis , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[43]  Chuhan Wu,et al.  Automatic construction of target-specific sentiment lexicon , 2019, Expert Syst. Appl..

[44]  Dhafar Hamed Abd,et al.  Classifying Political Arabic Articles Using Support Vector Machine with Different Feature Extraction , 2019, ACRIT.