Rough Set Theory for Arabic Sentiment Classification

Recently, the web has been a major place where people interact and express their views and sentiments. Researchers were attracted to conduct further analysis on this rich content known as Sentiment Analysis, Sentiment Classification or Opinion Mining. Rough Set Theory is a mathematical tool that can be used for classification and analysis of uncertain, incomplete or vague information. It can be used to significantly reduce the dimensionality of the data without much loss in information content, which is achieved using the concept of Reduct. This paper focuses on investigating the use of the Rough Set theory approach for Arabic Sentiment Classification. This paper compares some approaches that have been proposed to find Reducts to classify Arabic tweeting reviews. The Rosetta toolkit is used for testing where two main Reduct approaches were applied: Johnson Reducer and Genetic-based reducer. We compared the results of the approaches using cross validation evaluation method. The results showed that Genetic reducer achieved 57% of accuracy, which outperformed Johnson Reducer. The paper concludes that the Rough Set based approach is applicable for sentiment analysis of Arabic text but further investigation is required to evaluate other Reduct generation methods.

[1]  Hamidah Ibrahim,et al.  Approximate reduct computation by rough sets based attribute weighting , 2005, 2005 IEEE International Conference on Granular Computing.

[2]  Lina Zhou,et al.  Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[3]  Qiang Shen,et al.  Rough set-aided keyword reduction for text categorization , 2001, Appl. Artif. Intell..

[4]  A. Shoukry,et al.  Sentence-level Arabic sentiment analysis , 2012, 2012 International Conference on Collaboration Technologies and Systems (CTS).

[5]  Qiang Shen,et al.  A modular approach to generating fuzzy rules with reduced attributes for the monitoring of complex systems , 2000 .

[6]  Aqil M. Azmi,et al.  Aara’– a system for mining the polarity of Saudi public opinion through e-newspaper comments , 2014, J. Inf. Sci..

[7]  Namita Mittal,et al.  Sentiment Classification using Rough Set based Hybrid Feature Selection , 2013, WASSA@NAACL-HLT.

[8]  Janusz Zalewski,et al.  Rough sets: Theoretical aspects of reasoning about data , 1996 .

[9]  Rehab Duwairi,et al.  A study of the effects of preprocessing strategies on sentiment analysis for Arabic text , 2014, J. Inf. Sci..

[10]  Saifee Vohra,et al.  A COMPARATIVE STUDY OF SENTIMENT ANALYSIS TECHNIQUES 1 , 2013 .

[11]  Thangavel,et al.  Unsupervised Quick Reduct Algorithm Using Rough Set Theory , 2011 .

[12]  Qiang Shen,et al.  A Rough Set-Aided System for Sorting WWW Bookmarks , 2001, Web Intelligence.

[13]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[14]  Shubhamoy Dey,et al.  Performance Investigation of Feature Selection Methods and Sentiment Lexicons for Sentiment Analysis , 2012 .

[15]  B. Anitha,et al.  Sentiment Classification Approaches – A Review , 2013 .

[16]  Ashwin Kothari,et al.  FPGA Implementation of a Reduct Generation Algorithm based on Rough Set Theory , 2013 .

[17]  Andrzej Skowron,et al.  Chapter 19 the Design and Implementation of a Knowledge Discovery Toolkit Based on Rough Sets { the Rosetta System , 1998 .

[18]  Naomie Salim,et al.  Opinion analysis for twitter and arabic tweets: a systematic literature review , 2013 .

[19]  Namita Mittal,et al.  Optimal Feature Selection for Sentiment Analysis , 2013, CICLing.

[20]  Luis Alfonso Ureña López,et al.  OCA: Opinion corpus for Arabic , 2011, J. Assoc. Inf. Sci. Technol..

[21]  Hesham Arafat,et al.  Different Feature Selection for Sentiment Classification , 2014 .

[22]  S. R. El-Beltagy,et al.  Open issues in the sentiment analysis of Arabic social media: A case study , 2013, 2013 9th International Conference on Innovations in Information Technology (IIT).

[23]  Jyoti Rough Set Theory and Its Applications , 2013 .

[24]  P.A. Crossley,et al.  Application of Genetic Algorithm and Rough Set Theory for Knowledge Extraction , 2007, 2007 IEEE Lausanne Power Tech.

[25]  Alaa M. El-Halees,et al.  Arabic Opinion Mining Using Combined Classification Approach , 2011 .

[26]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .