A context-based algorithm for sentiment analysis

With netizens continuing to express a range of opinions and making assessments online, it has become a challenge to mine sentiments accurately from the ever-multiplying Big Data. We present a context-driven sentiment analysis scheme with the objective of refining the degree of subjectivity during sentiment analysis. The essence of our scheme is to capture in a stable manner, the mutual influence of the sentiments of neighbouring words on the sentiment of each word in a document. A parametric influence function combines the native sentiment score of each word with the context-derived sentiment score obtained from surrounding words. We apply a genetic algorithm to fine tune the parameters of the influence function so as to obtain the best possible accuracy for a given corpus. The experimental results on hotel reviews extracted from Tripadvisor.com show an average accuracy of 73.2% which is 3.6% more than the results obtained from the baseline sentiment analysis approach using native scores obtained from SentiWordNet. Though the improvement is small, it re-affirms our belief that contextual information provides valuable reinforcement of sentiment scores especially with regard to the borderline cases where words show near neutral sentiments. We also present a comparison with alternative sentiment analysis approaches that shows the strength of our proposed context-based sentiment analysis approach.

[1]  Annie Zaenen,et al.  Contextual Valence Shifters , 2006, Computing Attitude and Affect in Text.

[2]  Markus Zanker,et al.  Classification of Customer Reviews based on Sentiment Analysis , 2012, ENTER.

[3]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[4]  Dilek Z. Hakkani-Tür,et al.  Probabilistic model-based sentiment analysis of twitter messages , 2010, 2010 IEEE Spoken Language Technology Workshop.

[5]  Vimla Jethani,et al.  Sentiment Analysis of Movie Reviews , 2017 .

[6]  Jeonghee Yi,et al.  Sentiment analysis: capturing favorability using natural language processing , 2003, K-CAP '03.

[7]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[8]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[9]  Simon Fong,et al.  Opinion mining over twitterspace: Classifying tweets programmatically using the R approach , 2012, Seventh International Conference on Digital Information Management (ICDIM 2012).

[10]  Arno Scharl,et al.  A Context-Dependent Supervised Learning Approach to Sentiment Detection in Large Textual Databases , 2010, J. Inf. Data Manag..

[11]  Stan Matwin,et al.  Hierarchical Classification Approach to Emotion Recognition in Twitter , 2012, 2012 11th International Conference on Machine Learning and Applications.

[12]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[13]  Andrea Esuli,et al.  PageRanking WordNet Synsets: An Application to Opinion Mining , 2007, ACL.

[14]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[15]  Michael Gamon,et al.  Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis , 2004, COLING.

[16]  Uzay Kaymak,et al.  Determining negation scope and strength in sentiment analysis , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[17]  Andrea Esuli,et al.  Determining the semantic orientation of terms through gloss classification , 2005, CIKM '05.