Combining lexicon and learning based approaches for concept-level sentiment analysis

In this paper, we present the anatomy of pSenti --- a concept-level sentiment analysis system that seamlessly integrates into opinion mining lexicon-based and learning-based approaches. Compared with pure lexicon-based systems, it achieves significantly higher accuracy in sentiment polarity classification as well as sentiment strength detection. Compared with pure learning-based systems, it offers more structured and readable results with aspect-oriented explanation and justification, while being less sensitive to the writing style of text. Our extensive experiments on two real-world datasets (CNET software reviews and IMDB movie reviews) confirm the superiority of the proposed hybrid approach over state-of-the-art systems like SentiStrength.

[1]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[2]  Björn W. Schuller,et al.  Learning and Knowledge-Based Sentiment Analysis in Movie Review Key Excerpts , 2010, COST 2102 Training School.

[3]  Sabine Bergler,et al.  When Specialists and Generalists Work Together: Overcoming Domain Dependence in Sentiment Tagging , 2008, ACL.

[4]  Shlomo Argamon,et al.  Stylistic text classification using functional lexical features: Research Articles , 2007 .

[5]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[6]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[7]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[8]  Jonathon Read,et al.  Using Emoticons to Reduce Dependency in Machine Learning Techniques for Sentiment Classification , 2005, ACL.

[9]  Mike Thelwall,et al.  Sentiment in short strength detection informal text , 2010 .

[10]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[11]  Erik Cambria,et al.  Sentic Computing: Techniques, Tools, and Applications , 2012 .

[12]  Kristian J. Hammond,et al.  Domain Specific Affective Classification of Documents , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[13]  Yulan He,et al.  Incorporating Sentiment Prior Knowledge for Weakly Supervised Sentiment Analysis , 2012, TALIP.

[14]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[15]  Ellen Riloff,et al.  Learning Extraction Patterns for Subjective Expressions , 2003, EMNLP.

[16]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[17]  Bing Liu,et al.  Opinion Feature Extraction Using Class Sequential Rules , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[18]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[19]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[20]  E. Cambria,et al.  Sentic Computing , 2015, Cognitive Computation.

[21]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[22]  Erik Cambria,et al.  Sentic Computing: Exploitation of Common Sense for the Development of Emotion-Sensitive Systems , 2009, COST 2102 Training School.

[23]  Erik Cambria,et al.  Sentic Computing for social media marketing , 2012, Multimedia Tools and Applications.

[24]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.