论文信息 - A Shallow Approach to Subjectivity Classification

A Shallow Approach to Subjectivity Classification

We present a shallow linguistic approach to subjectivity classification. Using multinomial kernel machines, we demonstrate that a data representation based on counting character n-grams is able to improve on results previously attained on the MPQA corpus using word-based n-grams and syntactic information. We compare two types of string-based representations: key substring groups and character n-grams. We find that word-spanning character n-grams significantly reduce the bias of a classifier, and boost its accuracy.1 Copyright © 2008, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Wessel Kraaij | Stephan Raaijmakers

[1] Bo Pang,et al. A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[2] Siddharth Patwardhan,et al. Feature Subsumption for Opinion Analysis , 2006, EMNLP.

[3] E. Stamatatos. Ensemble-based Author Identification Using Character N-grams , 2006 .

[4] Dell Zhang,et al. Extracting key-substring-group features for text classification , 2006, KDD '06.

[5] Claire Cardie,et al. Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[6] Claire Cardie,et al. Toward Opinion Summarization: Linking the Sources , 2006 .

[7] Xi Chen,et al. Text classification with kernels on the multinomial manifold , 2005, SIGIR '05.

[8] Ellen Riloff,et al. Exploiting Subjectivity Classification to Improve Information Extraction , 2005, AAAI.

[9] Kalina Bontcheva,et al. Experiments of Opinion Analysis on the Corpora MPQA and NTCIR-6 , 2007, NTCIR.

[10] Soo-Min Kim,et al. Automatic Identification of Pro and Con Reasons in Online Reviews , 2006, ACL.

[11] Geoffrey I. Webb,et al. MultiBoosting: A Technique for Combining Boosting and Wagging , 2000, Machine Learning.

[12] Ron Kohavi,et al. Bias Plus Variance Decomposition for Zero-One Loss Functions , 1996, ICML.