Creating a Fine-Grained Corpus for Chinese Sentiment Analysis

Writing comments on products or news has become a popular activity in social media. The amount of opinionated text available online has been growing rapidly, increasing the need for techniques that can analyze opinions expressed in such text so that reviews can be easily absorbed by users. To date, most techniques depend on annotated corpora. However, existing corpora are almost sentence-level works that ignore important global sentiment information in other sentences. Given the rise of advanced applications, more fine-grained corpora are needed, even at the sentence level. The authors aim to create a fine-grained corpus for Chinese sentiment analysis, and more importantly, explore new sentiment analysis tasks by analyzing the annotated corpus. The proposed fine-grained annotation scheme not only introduces cross-sentence and global sentiment information (such as "target entity"') but also includes new sentence-level elements (such as "implicit aspect"). Based on this scheme, this corpus can provide a more fine-grained platform for researchers to study algorithms for advanced applications. In addition, an in-depth analysis on the annotated corpus is made and several important but ignored tasks, such as the target-aspect pair extraction task, are explored, which can give useful hints about future directions.

[1]  Iryna Gurevych,et al.  Sentence and Expression Level Annotation of Opinions in User-Generated Discourse , 2010, ACL.

[2]  Diana Inkpen,et al.  uOttawa: System description for SemEval 2013 Task 2 Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[3]  Lei Zhang,et al.  Entity discovery and assignment for opinion mining applications , 2009, KDD.

[4]  Chun Chen,et al.  Opinion Word Expansion and Target Extraction through Double Propagation , 2011, CL.

[5]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[6]  Christian S. Jensen,et al.  The use of categorization information in language models for question retrieval , 2009, CIKM.

[7]  Meng Wang,et al.  Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews , 2011, EMNLP.

[8]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[9]  Lori Lamel,et al.  Annotation and Detection of Emotion in a Task-oriented Human-Human Dialog Corpus , 2007 .

[10]  Theresa Wilson Fine-grained subjectivity and sentiment analysis: recognizing the intensity, polarity, and attitudes of private states , 2008 .

[11]  Lei Zhang,et al.  Sentiment Analysis and Opinion Mining , 2017, Encyclopedia of Machine Learning and Data Mining.

[12]  Arjun Mukherjee,et al.  Aspect Extraction through Semi-Supervised Modeling , 2012, ACL.

[13]  Cristina Bosco,et al.  Developing Corpora for Sentiment Analysis: The Case of Irony and Senti-TUT , 2013, IEEE Intelligent Systems.

[14]  Xiaoyan Zhu,et al.  Movie review mining and summarization , 2006, CIKM '06.

[15]  Guodong Zhou,et al.  Opinion Target Extraction Using a Shallow Semantic Parsing Framework , 2012, AAAI.

[16]  Xinying Xu,et al.  Hidden sentiment association in chinese web opinion mining , 2008, WWW.

[17]  Bing Qin,et al.  Integrating Intra-and Inter-document Evidences for Improving Sentence Sentiment Classification , 2010 .

[18]  Jon Atle Gulla,et al.  Sentiment Learning on Product Reviews via Sentiment Ontology Tree , 2010, ACL.

[19]  Bing Qin,et al.  Integrating Intra- and Inter-document Evidences for Improving Sentence Sentiment Classification: Integrating Intra- and Inter-document Evidences for Improving Sentence Sentiment Classification , 2010 .

[20]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[21]  Shlomo Argamon,et al.  Extracting Appraisal Expressions , 2007, NAACL.

[22]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[23]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.