Improving Sentiment Classification Using Feature Highlighting and Feature Bagging

Sentiment classification is an important data mining task. Previous researches tried various machine learning techniques while didn't make fully use of the difference among features. This paper proposes a novel method for improving sentiment classification by fully exploring the different contribution of features. The method consists of two parts. First, we highlight sentimental features by increasing their weight. Second, we use bagging to construct multiple classifiers on different feature spaces and combine them into an aggregating classifier. Extensive experiments show that the method can evidently improve the performance of sentiment classification.

[1]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[2]  Rui Xia,et al.  A POS-based Ensemble Model for Cross-domain Sentiment Classification , 2011, IJCNLP.

[3]  Claire Cardie,et al.  Multi-Level Structured Models for Document-Level Sentiment Classification , 2010, EMNLP.

[4]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[5]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[6]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[7]  Rui Xia,et al.  Ensemble of feature sets and classification algorithms for sentiment classification , 2011, Inf. Sci..

[8]  Rui Xia,et al.  Exploring the Use of Word Relation Features for Sentiment Classification , 2010, COLING.

[9]  Xia Wang,et al.  Sentiment Classification through Combining Classifiers with Multiple Feature Sets , 2007, 2007 International Conference on Natural Language Processing and Knowledge Engineering.

[10]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[11]  Yoav Freund,et al.  Boosting a weak learning algorithm by majority , 1990, COLT '90.

[12]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[13]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[14]  Chris H. Q. Ding,et al.  Knowledge transformation for cross-domain sentiment classification , 2009, SIGIR.

[15]  Prem Melville,et al.  Sentiment analysis of blogs by combining lexical knowledge with text classification , 2009, KDD.

[16]  Bo Pang,et al.  Using Very Simple Statistics for Review Search: An Exploration , 2008, COLING.

[17]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[18]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[19]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[20]  Kazutaka Shimada,et al.  Movie Review Classification Based on a Multiple Classifier , 2007, PACLIC.