Sentiment Classification through Combining Classifiers with Multiple Feature Sets

Sentiment classification aims at assigning a document to a predefined category according to the polarity of its subjective information (e.g. 'thumbs up' or 'thumbs down'). In this paper, we present a classifier combination approach to this task. First, different classifiers are generated through training the review data with different features: unigram and some POS features. Then, classifier selection method is used to select a part of the classifiers for the next-step combination. Finally, these selected classifiers are combined using several combining rules. The experimental results show that all the combination approaches with different combining rules outperform individual classifiers and the sum rule achieves the best performance with an improvement of 2.56% over the best individual classifier.

[1]  Vibhu O. Mittal,et al.  Comparative Experiments on Sentiment Classification for Online Product Reviews , 2006, AAAI.

[2]  Fabio Roli,et al.  A theoretical and experimental analysis of linear combiners for multiple classifier systems , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Bogdan Gabrys,et al.  Classifier selection for majority voting , 2005, Inf. Fusion.

[5]  Tong Zhang,et al.  Named Entity Recognition through Classifier Combination , 2003, CoNLL.

[6]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[7]  Alistair Kennedy,et al.  SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[8]  Tetsunori Kobayashi,et al.  Media-integrated biometric person recognition based on the Dempster-Shafer theory , 2002, Object recognition supported by user interaction for service robots.

[9]  Ian H. Witten,et al.  Issues in Stacked Generalization , 2011, J. Artif. Intell. Res..

[10]  Matti Aksela,et al.  Comparison of Classifier Selection Methods for Improving Committee Performance , 2003, Multiple Classifier Systems.

[11]  Dan Klein,et al.  Combining Heterogeneous Classifiers for Word-Sense Disambiguation , 2001, *SEMEVAL.

[12]  Siddharth Patwardhan,et al.  Feature Subsumption for Opinion Analysis , 2006, EMNLP.

[13]  Anil K. Jain,et al.  Large-scale evaluation of multimodal biometric authentication using state-of-the-art systems , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  W. Bruce Croft,et al.  Combining classifiers in text categorization , 1996, SIGIR '96.

[15]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[16]  Wei Tang,et al.  Ensembling neural networks: Many could be better than all , 2002, Artif. Intell..

[17]  Eduard Hovy,et al.  Identifying Opinion Holders for Question Answering in Opinion Texts , 2005 .

[18]  Hsin-Hsi Chen,et al.  Opinion Extraction, Summarization and Tracking in News and Blog Corpora , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[19]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[20]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.