Sentix: An Aspect and Domain Sensitive Sentiment Lexicon

Sentiment lexicons have often been used to aid sentiment analysis. Most of these sentiment lexicons are general-purpose lexicons which assign a fixed polarity to every word. However, it has been noted that the polarity of words depends on both the aspect and domain, thus a general-purpose sentiment lexicon would not be able to accurately classify the sentiment of words. This paper proposes a method to automatically construct an aspect and domain sensitive sentiment lexicon which assigns polarity to a word depending on its aspect and domain, and make available Sentix which is an aspect and domain sensitive sentiment lexicon spanning over 200 product domains. Experimental results have shown that our lexicon produces significantly better results compared to other commonly used lexicons. We also observe the long tail distribution behavior of product aspects, and propose the possibility of aspect ranking by comparing the number of domains and number of sentiment words present for an aspect.

[1]  Carlo Strapparava,et al.  WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[2]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[3]  Hiroshi Kanayama,et al.  Fully Automatic Lexicon Expansion for Domain-oriented Sentiment Analysis , 2006, EMNLP.

[4]  Kam-Fai Wong,et al.  Learning Domain-Specific Sentiment Lexicons for Predicting Product Sales , 2011, 2011 IEEE 8th International Conference on e-Business Engineering.

[5]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[6]  Meng Wang,et al.  Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews , 2011, ACL.

[7]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[8]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[9]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[10]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[11]  M. de Rijke,et al.  UvA-DARE ( Digital Academic Repository ) Using WordNet to measure semantic orientations of adjectives , 2004 .

[12]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[13]  Mitsuru Ishizuka,et al.  SentiFul: Generating a reliable lexicon for sentiment analysis , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[14]  Graeme Hirst,et al.  Lexical chains as representations of context for the detection and correction of malapropisms , 1995 .

[15]  Dan I. Moldovan,et al.  Automatic Discovery of Part-Whole Relations , 2006, CL.

[16]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[17]  Suk Hwan Lim,et al.  Extracting and Ranking Product Features in Opinion Documents , 2010, COLING.

[18]  Yue Lu,et al.  Automatic construction of a context-aware sentiment lexicon: an optimization approach , 2011, WWW.

[19]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[20]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[21]  Songbo Tan,et al.  Adapting information bottleneck method for automatic construction of domain-oriented sentiment lexicon , 2010, WSDM '10.

[22]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[23]  Claire Cardie,et al.  Adapting a Polarity Lexicon using Integer Linear Programming for Domain-Specific Sentiment Classification , 2009, EMNLP.

[24]  Angela Fahrni,et al.  Old Wine or Warm Beer : Target-Specific Sentiment Analysis of Adjectives , .