Polarity reinforcement: Sentiment polarity identification by means of social semantics

We propose a combination of machine learning and socially constructed concepts for the task of sentiment polarity identification. Detecting words with polarity is difficult not only due to limitations in current sentiment dictionaries but also due to the colloquial terms that are often used. Current approaches disregard the dynamics of language, i.e. that new words are often created comprising different polarities. In fact, the online community is very creative in coining terms about certain subjects such as “tweetup” (a request by a user to meet with friends via Twitter) or “whack” (Street slang, meaning bad). Our approach utilizes a user generated dictionary of urban term definitions as a resource for polarity concepts. Therefore, we are not only able to map newly created words to their respective polarity but also enhance common expressions with additional features and reinforce the polarity, strengthening our initial finding. We empirically show that the use of polarity reinforcement improves the sentiment classification.

[1]  Evgeniy Gabrilovich,et al.  Feature Generation for Text Categorization Using World Knowledge , 2005, IJCAI.

[2]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[3]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[4]  Takashi Inui,et al.  Extracting Semantic Orientations of Words using Spin Model , 2005, ACL.

[5]  Stefan Kopp,et al.  The virtual human Max – Modeling embodied conversation , 2006 .

[6]  Grzegorz Kondrak,et al.  A Comparison of Sentiment Analysis Techniques: Polarizing Movie Blogs , 2008, Canadian Conference on AI.

[7]  Ipke Wachsmuth,et al.  Affective computing with primary and secondary emotions in a virtual human , 2009, Autonomous Agents and Multi-Agent Systems.

[8]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[9]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[10]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[11]  Stefan Kopp,et al.  Why emotions should be integrated into conversational agents , 2007 .

[12]  Alexander Mehler,et al.  A Short Note on Social-Semiotic Networks from the Point of View of Quantitative Semantics , 2008, Social Web Communities.

[13]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[14]  Nigel Collier,et al.  Sentiment Analysis using Support Vector Machines with Diverse Information Sources , 2004, EMNLP.

[15]  Alexander Mehler,et al.  Social Semantics And Its Evaluation By Means of Closed Topic Models: An SVM-Classification Approach Using Semantic Feature Replacement By Topic Generalization , 2009 .

[16]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[17]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[18]  W. B. Cavnar,et al.  N-gram-based text categorization , 1994 .

[19]  Jin Zhang,et al.  An empirical study of sentiment analysis for chinese documents , 2008, Expert Syst. Appl..

[20]  Evgeniy Gabrilovich,et al.  Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge , 2006, AAAI.

[21]  Hsinchun Chen,et al.  Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.

[22]  M. de Rijke,et al.  UvA-DARE ( Digital Academic Repository ) Using WordNet to measure semantic orientations of adjectives , 2004 .

[23]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[24]  Carlo Strapparava,et al.  WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[25]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[26]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[27]  Alexander Mehler,et al.  Social Semantics and Its Evaluation by Means of Semantic Relatedness and Open Topic Models , 2009, 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology.

[28]  J. Fellous,et al.  Emotions: from brain to robot , 2004, Trends in Cognitive Sciences.

[29]  Stephan Bloehdorn,et al.  Boosting for Text Classification with Semantic Features , 2004, WebKDD.

[30]  Khurshid Ahmad,et al.  Sentiment Polarity Identification in Financial News: A Cohesion-based Approach , 2007, ACL.

[31]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[32]  C. Fellbaum An Electronic Lexical Database , 1998 .

[33]  Alexander Mehler,et al.  The Feature Difference Coefficient: Classification by Means of Feature Distributions , 2009 .