Context-specific sentiment lexicon expansion via minimal user interaction

One of the important factors in the performance of sentiment analysis methods is having a comprehensive sentiment lexicon. However, since sentiment words have different polarities not only in different domains, but also in different contexts within the same domain, constructing such context-specific sentiment lexicons is not an easy task. The high costs of manually constructing such lexicons motivate researchers to create automatic methods for finding sentiment words and assigning their polarities. However, existing methods may encounter ambiguous cases with contradictory evidence which are hard to automatically resolve. To address this problem, we aim to engage the user in the process of polarity assignment and improve the quality of the generated lexicon via minimal user effort. A novel visualization is employed to present the results of the automatic algorithm, i.e., the extracted sentiment pairs along with their polarities. User interactions are provided to facilitate the supervision process. The results of our user study demonstrate (1) involving the user in the polarity assignment process improves the quality of the generated lexicon significantly, and (2) participants in the study preferred our visual interface and conveyed that it is easier to use compared to a text-based interface.

[1]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[2]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[3]  Daniel A. Keim,et al.  Feature-Based Visual Sentiment Analysis of Text Document Streams , 2012, TIST.

[4]  Grzegorz Kondrak,et al.  A Comparison of Sentiment Analysis Techniques: Polarizing Movie Blogs , 2008, Canadian Conference on AI.

[5]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[6]  Jean Véronis,et al.  Visualising a Text with a Tree Cloud , 2009 .

[7]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[8]  Vlado Keselj,et al.  Text Similarity Using Google Tri-grams , 2012, Canadian Conference on AI.

[9]  Yue Lu,et al.  Automatic construction of a context-aware sentiment lexicon: an optimization approach , 2011, WWW.

[10]  Chun Chen,et al.  Opinion Word Expansion and Target Extraction through Double Propagation , 2011, CL.

[11]  Maurizio Vichi,et al.  Studies in Classification Data Analysis and knowledge Organization , 2011 .

[12]  Qiang Yang,et al.  Cross-Domain Co-Extraction of Sentiment and Topic Lexicons , 2012, ACL.

[13]  Claire Cardie,et al.  Adapting a Polarity Lexicon using Integer Linear Programming for Domain-Specific Sentiment Classification , 2009, EMNLP.

[14]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[15]  Michelle L. Gregory,et al.  User-directed Sentiment Analysis: Visualizing the Affective Content of Documents , 2006 .

[16]  Yücel Saygin,et al.  An Aspect-Lexicon Creation and Evaluation Tool for Sentiment Analysis Researchers , 2012, ECML/PKDD.

[17]  Delip Rao,et al.  Semi-Supervised Polarity Lexicon Induction , 2009, EACL.