Combining Convolutional Neural Networks and Word Sentiment Sequence Features for Chinese Text Sentiment Classification

Combining Convolutional Neural Networks and Word Sentiment Sequence Features for Chinese Text Sentiment Classification Zhao Chen1, Ruifeng Xu1, Lin Gui1, Qin Lu2 (1. School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong, 518000, China; 2. Depart of Computing, The Hong Kong Polytechnic University, Hong Kong, China) Abstract: Recently, the classification approach based on word embedding and convolutional neural networks achieved good results in sentiment classification task. This approach is mainly based on the contextual features of the word embeddings without the consideration of the polarity of the words. Meanwhile, this approach lacks of the use of manually compiled sentiment lexicon resources. Target to these problems, this paper proposes a novel sentiment classification method which incorporates existing sentiment lexicon and convolution neural networks. In this word, the words in text are abstractly represented by using existing sentiment words. The convolutional neural networks are used to extract sequence features from the abstracted word embeddings. Finally, the sequence features are applied to sentiment classification. The evaluations on Chinese Opinion Analysis Evaluation 2014 dataset show that our proposed approach outperforms the convolutional neural networks model with word embedding features and Naïve Bayes Support Vector Machines.

[1]  Christopher D. Manning,et al.  Baselines and Bigrams: Simple, Good Sentiment and Topic Classification , 2012, ACL.

[2]  Jin Zhang,et al.  An empirical study of sentiment analysis for chinese documents , 2008, Expert Syst. Appl..

[3]  Geoffrey E. Hinton,et al.  A Scalable Hierarchical Distributed Language Model , 2008, NIPS.

[4]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[5]  Jeffrey Pennington,et al.  Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions , 2011, EMNLP.

[6]  Tong Zhang,et al.  Effective Use of Word Order for Text Categorization with Convolutional Neural Networks , 2014, NAACL.

[7]  Ming Zhou,et al.  Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification , 2014, ACL.

[8]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[9]  Danushka Bollegala,et al.  Using Multiple Sources to Construct a Sentiment Sensitive Thesaurus for Cross-Domain Sentiment Classification , 2011, ACL.

[10]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[11]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[12]  Kam-Fai Wong,et al.  Coarse-Fine Opinion Mining - WIA in NTCIR-7 MOAT Task , 2008, NTCIR.

[13]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.