Constructing a Chinese Conversation Corpus for Sentiment Analysis

Sentiment analysis plays an important role in many applications. This paper introduces our ongoing work related to the sentiment analysis on Chinese conversation. The main purpose is to construct a Chinese conversation corpus for sentiment analysis and provide a benchmark result on this corpus. To explore the effectiveness of machine learning based approaches for sentiment analysis on Chinese conversation, we firstly collected conversational data from some online English learning websites and our instant messages, and manually annotated it with three sentiment polarities and 22 fine-grained emotion classes. Then we applied multiple representative classification methods to evaluate the corpus. The evaluation results provide good suggestions for the future research. And we will release the corpus with gold standards publicly for research purposes.

[1]  Zhiyuan Liu,et al.  Neural Sentiment Classification with User and Product Attention , 2016, EMNLP.

[2]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[3]  Lei Zhang,et al.  Sentiment Classification with Convolutional Neural Networks: An Experimental Study on a Large-Scale Chinese Conversation Corpus , 2016, 2016 12th International Conference on Computational Intelligence and Security (CIS).

[4]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[5]  Erik Cambria,et al.  A Review of Sentiment Analysis Research in Chinese Language , 2017, Cognitive Computation.

[6]  Ting Liu,et al.  Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[7]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[8]  Ting Liu,et al.  Creating a Fine-Grained Corpus for Chinese Sentiment Analysis , 2015, IEEE Intelligent Systems.

[9]  Changqin Quan,et al.  Construction of a Blog Emotion Corpus for Chinese Emotional Expression Analysis , 2009, EMNLP.

[10]  Xiang Zhang,et al.  Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[11]  Changliang Li,et al.  Compositional Recurrent Neural Networks for Chinese Short Text Classification , 2016, 2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI).

[12]  Jie Cao,et al.  Hybrid Attention Networks for Chinese Short Text Classification , 2017, Computación y Sistemas.

[13]  Liheng Xu,et al.  Overview of Topic-based Chinese Message Polarity Classification in SIGHAN 2015 , 2015, SIGHAN@IJCNLP.

[14]  Jun Zhao,et al.  Recurrent Convolutional Neural Networks for Text Classification , 2015, AAAI.

[15]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[16]  Ramlan Mahmod,et al.  A systematic literature review for topic detection in chat conversation for cyber-­crime investigation , 2014 .

[17]  Bo Xu,et al.  Recursive Deep Learning for Sentiment Analysis over Social Data , 2014, 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT).

[18]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[19]  Ron Sun,et al.  Anatomy of the Mind: a Quick Overview , 2017, Cognitive Computation.

[20]  Bo Xu,et al.  Hierarchical Hybrid Attention Networks for Chinese Conversation Topic Classification , 2017, ICONIP.