论文信息 - Collective Opinion Target Extraction in Chinese Microblogs

Collective Opinion Target Extraction in Chinese Microblogs

Microblog messages pose severe challenges for current sentiment analysis techniques due to some inherent characteristics such as the length limit and informal writing style. In this paper, we study the problem of extracting opinion targets of Chinese microblog messages. Such fine-grained word-level task has not been well investigated in microblogs yet. We propose an unsupervised label propagation algorithm to address the problem. The opinion targets of all messages in a topic are collectively extracted based on the assumption that similar messages may focus on similar opinion targets. Topics in microblogs are identified by hashtags or using clustering algorithms. Experimental results on Chinese microblogs show the effectiveness of our framework and algorithms.

Xiaojun Wan | Jianguo Xiao | Xinjie Zhou

[1] Peter A. Flach,et al. Evaluation Measures for Multi-class Subgroup Discovery , 2009, ECML/PKDD.

[2] Iryna Gurevych,et al. Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields , 2010, EMNLP.

[3] Jason Baldridge,et al. Twitter Polarity Classification with Label Propagation over Lexical Links and the Follower Graph , 2011, ULNLP@EMNLP.

[4] Dan Klein,et al. Learning Accurate, Compact, and Interpretable Tree Annotation , 2006, ACL.

[5] Bo Pang,et al. Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[6] Junlan Feng,et al. Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[7] Richard Johansson,et al. Syntactic and Semantic Structure for Opinion Expression Detection , 2010, CoNLL.

[8] Qiang Yang,et al. Cross-Domain Co-Extraction of Sentiment and Topic Lexicons , 2012, ACL.

[9] Xiaoyan Zhu,et al. Movie review mining and summarization , 2006, CIKM '06.

[10] Johan Bollen,et al. Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[11] Gerard Salton,et al. A vector space model for automatic indexing , 1975, CACM.