On the Feature Selection and Classification Based on Information Gain for Document Sentiment Analysis

Sentiment analysis in a movie review is the needs of today lifestyle. Unfortunately, enormous features make the sentiment of analysis slow and less sensitive. Finding the optimum feature selection and classification is still a challenge. In order to handle an enormous number of features and provide better sentiment classification, an information-based feature selection and classification are proposed. The proposed method reduces more than 90% unnecessary features while the proposed classification scheme achieves 96% accuracy of sentiment classification. From the experimental results, it can be concluded that the combination of proposed feature selection and classification achieves the best performance so far.

[1]  Sotiris Kotsiantis,et al.  Text Classification Using Machine Learning Techniques , 2005 .

[2]  Hsinchun Chen,et al.  Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.

[3]  Erik Cambria,et al.  Affective Computing and Sentiment Analysis , 2016, IEEE Intelligent Systems.

[4]  Namita Mittal,et al.  Prominent Feature Extraction for Sentiment Analysis , 2015, Socio-Affective Computing.

[5]  Ruli Manurung,et al.  Machine Learning-based Sentiment Analysis of Automatic Indonesian Translations of English Movie Reviews , 2008 .

[6]  Timothy O'Keefe Feature Selection and Weighting Methods in Sentiment Analysis , 2009 .

[7]  Roberto Battiti,et al.  Using mutual information for selecting features in supervised neural net learning , 1994, IEEE Trans. Neural Networks.

[8]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[9]  Namita Mittal,et al.  Text Classification Using Machine Learning Methods-A Survey , 2012, SocProS.

[10]  Lina Zhou,et al.  Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[11]  Nasser Yazdani,et al.  Mutual information-based feature selection for intrusion detection systems , 2011, J. Netw. Comput. Appl..

[12]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[13]  Cícero Nogueira dos Santos,et al.  Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts , 2014, COLING.

[14]  Mohammed Al Achhab,et al.  Comparison of Feature Selection Methods for Sentiment Analysis , 2018, BDCA.