论文信息 - Automatic Opinion Polarity Classification of Movie Reviews

Automatic Opinion Polarity Classification of Movie Reviews

One approach to assessing overall opinion polarity (OvOP) of reviews, a concept defined in this paper, is the use of supervised machine learning mechanisms. In this paper, the impact of lexical filtering, applied to reviews, on the accuracy of two statistical classifiers (Naive Bayes and Markov Model) with respect to OvOP identification is observed. Two kinds of lexical filters, one based on hypernymy as provided by WordNet (Fellbaum, 1998), and one hand-crafted filter based on part-of-speech (POS) tags, are evaluated. A ranking criterion based on a function of the probability of having positive or negative polarity is introduced and verified as being capable of achieving 100% accuracy with 10% recall. Movie reviews are used for training and evaluation of each statistical classifier, achieving 80% accuracy.

[1] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[2] Kathleen R. McKeown,et al. Predicting the semantic orientation of adjectives , 1997 .

[3] Michael L. Littman,et al. Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[4] Bo Pang,et al. Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[5] Janyce Wiebe,et al. Learning Subjective Adjectives from Corpora , 2000, AAAI/IAAI.

[6] Robert M. Losee,et al. Natural language processing in support of decision-making: phrases and part-of-speech tagging , 2001, Inf. Process. Manag..

[7] Janyce Wiebe,et al. Effects of Adjective Orientation and Gradability on Sentence Subjectivity , 2000, COLING.

[8] Eric Brill,et al. Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[9] Peter D. Turney. Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[10] Vasileios Hatzivassiloglou,et al. Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[11] Peter Oram. WordNet: An electronic lexical database. Christiane Fellbaum (Ed.). Cambridge, MA: MIT Press, 1998. Pp. 423. , 2001, Applied Psycholinguistics.

[12] Janyce Wiebe,et al. Development and Use of a Gold-Standard Data Set for Subjectivity Classifications , 1999, ACL.