论文信息 - An empirical study of unsupervised sentiment classification of Chinese reviews

An empirical study of unsupervised sentiment classification of Chinese reviews

Abstract This paper is an empirical study of unsupervised sentiment classification of Chinese reviews. The focus is on exploring the ways to improve the performance of the unsupervised sentiment classification based on limited existing sentiment resources in Chinese. On the one hand, all available Chinese sentiment lexicons — individual and combined — are evaluated under our proposed framework. On the other hand, the domain dependent sentiment noise words are identified and removed using unlabeled data, to improve the classification performance. To the best of our knowledge, this is the first such attempt. Experiments have been conducted on three open datasets in two domains, and the results show that the proposed algorithm for sentiment noise words removal can improve the classification performance significantly.

Hua Xu | Zhongwu Zhai | Peifa Jia

[1] Bo Pang,et al. A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[2] Philip S. Yu,et al. A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[3] Lillian Lee,et al. Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[4] Christopher J. Fox,et al. A stop list for general text , 1989, SIGF.

[5] W. John Wilbur,et al. The automatic identification of stop words , 1992, J. Inf. Sci..

[6] Bing Liu,et al. Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[7] Soo-Min Kim,et al. Determining the Sentiment of Opinions , 2004, COLING.

[8] Bing Liu,et al. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[9] Hua Xu,et al. Sentiment classification for Chinese reviews based on key substring features , 2009, 2009 International Conference on Natural Language Processing and Knowledge Engineering.

[10] Bing Liu,et al. Mining Opinion Features in Customer Reviews , 2004, AAAI.

[11] Alistair Kennedy,et al. SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[12] Hua Xu,et al. Feature Subsumption for Sentiment Classification in Multiple Languages , 2010, PAKDD.

[13] Bo Pang,et al. Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.