Product Related Information Sentiment-Content Analysis Based on Convolutional Neural Networks for the Chinese Micro-Blog

Sentiment analysis is a hot topic in natural language processing. People usually concern the sentiments in a specific domain. By focusing on specific domain such as micro-blogs of a specific product, we can discover the sentiment distribution of customs about this product. Unlike traditional sentiment analysis which divides the sentiments into positive/negative and neutral categories, we divided the product related information into three types: positive/negative about the product and product advertisements. The reason for this kind division is that ordinary positive category in sentiment analysis may contains product advertisements which do not reflect custom's opinions. This phenomenon is very common in micro-blog where there are lots of product promotions and propagandas. In this paper, we propose a sentiment analysis method based on Convolutional Neural Network (CNN). The proposed CNN model combines Chinese word-level embedding and Chinese character-level embedding as input features of the sentiment analysis. In the experiment, Chinese micro-blog datasets of weight loss products and mobile phones are used to show that our sentiment analysis method has better performance than the lexicon based method.

[1]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[2]  Lijun Wang,et al.  Improved KNN classification algorithms research in text categorization , 2012, 2012 2nd International Conference on Consumer Electronics, Communications and Networks (CECNet).

[3]  Tong Zhang,et al.  Semi-supervised Convolutional Neural Networks for Text Categorization via Region Embedding , 2015, NIPS.

[4]  Ming Zhou,et al.  Building Large-Scale Twitter-Specific Sentiment Lexicon : A Representation Learning Approach , 2014, COLING.

[5]  Peng Wang,et al.  Semantic Clustering and Convolutional Neural Network for Short Text Categorization , 2015, ACL.

[6]  Yelong Shen,et al.  A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval , 2014, CIKM.

[7]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[8]  Huang He,et al.  Sentiment analysis of Sina Weibo based on semantic sentiment space model , 2013, 2013 International Conference on Management Science and Engineering 20th Annual Conference Proceedings.

[9]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Yong Wang,et al.  Design of Text Categorization System Based on SVM , 2012 .

[11]  Cícero Nogueira dos Santos,et al.  Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts , 2014, COLING.

[12]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[13]  Jordán Pascual Espada,et al.  Machine learning approach for text and document mining , 2014, ArXiv.

[14]  Tao Chen,et al.  Combining Convolutional Neural Network and Support Vector Machine for Sentiment Classification , 2015, SMP.

[15]  Yelong Shen,et al.  Learning semantic representations using convolutional neural networks for web search , 2014, WWW.

[16]  Gjorgji Madjarov,et al.  Twitter Sentiment Analysis Using Deep Convolutional Neural Network , 2015, HAIS.

[17]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[18]  Myong Kee Jeong,et al.  Support vector-based feature selection using Fisher's linear discriminant and Support Vector Machine , 2010, Expert Syst. Appl..