Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques

We present sentiment analyzer (SA) that extracts sentiment (or opinion) about a subject from online text documents. Instead of classifying the sentiment of an entire document about a subject, SA detects all references to the given subject, and determines sentiment in each of the references using natural language processing (NLP) techniques. Our sentiment analysis consists of 1) a topic specific feature term extraction, 2) sentiment extraction, and 3) (subject, sentiment) association by relationship analysis. SA utilizes two linguistic resources for the analysis: the sentiment lexicon and the sentiment pattern database. The performance of the algorithms was verified on online product review articles ("digital camera" and "music" reviews), and more general documents including general Webpages and news articles.

[1]  Cynthia Whissell,et al.  THE DICTIONARY OF AFFECT IN LANGUAGE , 1989 .

[2]  George A. Miller,et al.  Nouns in WordNet: A Lexical Inheritance System , 1990 .

[3]  Marti A. Hearst Direction-based text interpretation as an information access refinement , 1992 .

[4]  Ted Dunning,et al.  Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[5]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[6]  Warren Sack,et al.  On the Computation of Point of View , 1994, AAAI.

[7]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Model for Part-Of-Speech Tagging , 1996, EMNLP.

[8]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[9]  Boris Katz,et al.  From Sentence Processing to Information Access on the World Wide Web , 1997 .

[10]  Loren Terveen,et al.  PHOAKS: a system for sharing recommendations , 1997, CACM.

[11]  L. Rovinelli,et al.  Emotion and Style in 30-Second Television Advertisements Targeted at Men, Women, Boys, and Girls , 1998, Perceptual and motor skills.

[12]  Eugene Charniak,et al.  Finding Parts in Very Large Corpora , 1999, ACL.

[13]  Janyce Wiebe,et al.  Learning Subjective Adjectives from Corpora , 2000, AAAI/IAAI.

[14]  Pero Subasic,et al.  Affect analysis of text using fuzzy semantic typing , 2001, IEEE Trans. Fuzzy Syst..

[15]  John D. Lafferty,et al.  Model-based feedback in the language modeling approach to information retrieval , 2001, CIKM '01.

[16]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[17]  Hang Li,et al.  Mining Open Answers in Questionnaire Data , 2001, IEEE Intell. Syst..

[18]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[19]  Satoshi Morinaga,et al.  Mining product reputations on the Web , 2002, KDD.

[20]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[21]  Yi Zhang,et al.  Exact Maximum Likelihood Estimation for Word Mixtures , 2002 .

[22]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[23]  Third IEEE International Conference on Data Mining , 2003, Third IEEE International Conference on Data Mining.