论文信息 - Experiments of Opinion Analysis on the Corpora MPQA and NTCIR-6

Experiments of Opinion Analysis on the Corpora MPQA and NTCIR-6

This paper describes the algorithms and linguistic features used in our participating system for the opinion analysis pilot task at NTCIR-6. It presents and discusses the results of our system on the opinion analysis task. It also presents our experiments of opinion analysis on the two corpora MPQA and NTCIR-6, by using our learning based system. Our system was base on the SVM learning. It achieved state of the art results on the MPQA corpus for the two problems, opinionated sentence recognition and opinion holder extraction. The results using the NTCIR-6 English corpus for both training and testing are certainly among the first ones. Our results on the opinionated sentence recognition sub-task of the NTCIR-6 were encouraging. The results on the English evaluation of the NTCIR-6 opinion analysis task were obtained from the models learned from the MPQA corpus. The lower results on the NTCIR-6 opinion holder extraction subtask, in comparison with those using each corpus for both training and testing, may possibly show that there exist substantial differences between the MPQA corpus and the NTCIR-6 English corpus.

Kalina Bontcheva | Hamish Cunningham | Yaoyong Li

[1] Janyce Wiebe,et al. Learning Subjective Language , 2004, CL.

[2] Siddharth Patwardhan,et al. Feature Subsumption for Opinion Analysis , 2006, EMNLP.

[3] Kalina Bontcheva,et al. Perceptron Learning for Chinese Word Segmentation , 2005, SIGHAN@IJCNLP 2005.

[4] Hsin-Hsi Chen,et al. Opinion Extraction, Summarization and Tracking in News and Blog Corpora , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[5] Ellen Riloff,et al. Learning subjective nouns using extraction pattern bootstrapping , 2003, CoNLL.

[6] Claire Cardie,et al. Identifying Sources of Opinions with Conditional Random Fields and Extraction Patterns , 2005, HLT.

[7] Claire Cardie,et al. Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[8] Wei-Hao Lin,et al. Which Side are You on? Identifying Perspectives at the Document and Sentence Levels , 2006, CoNLL.

[9] Kalina Bontcheva,et al. SVM Based Learning System for Information Extraction , 2004, Deterministic and Statistical Methods in Machine Learning.

[10] D. Litman,et al. NRRC Summer Workshop on Multiple-Perspective Question Answering Final Report , 2002 .

[11] John Shawe-Taylor,et al. The SVM With Uneven Margins and Chinese Document Categorization , 2003, PACLIC.