Cross-Domain Sentiment Analysis on Social Media Interactions using Senti-Lexicon based Hybrid Features

Analyzing the sentiment information from the social media interactions is a rapidly growing research area. Several studies in the literature focus on modeling the sentiment information using linguistics, generic word counts and even the contextual information, including the presence of punctuations, elongated words, emoticons, etc. In this paper, we experiment on the effectiveness of lexicon information in combination with other information, for the effective analysis of sentiment in social interactions. The objective of this study is to experimentally verify how senti-lexicons can take part in the process of modeling the sentiment information even in cross-domain sentiment analysis. In general, this paper explores the effectiveness of several feature vectors including the generic Bag of Word (BoW), linguistic (N-Gram and Part-of-Speech (POS)) and the lexicon features (number of positive and negative words). Other than the traditional features we generate hybrid features by combining the lexicon features with the BoW and linguistic features. We conduct the experiments on sentiment classification using supervised models like Linear SVC (L-SVC), Multi-Layer Perceptron (MLP), Multinomial Naïve Bayes (MNB) and Decision Tree (DT). The experiments are conducted on three different types of sentiment document datasets - the Amazon food review dataset, student opinion tweet dataset, and the Large Movie Review Dataset v1.0. We also verify the efficacy of these features in cross-domain sentiment analysis. Experiments show that hybridizing the BoW, linguistic N-Gram and POS method with lexicon features improves the accuracy of sentiment classification even for cross-domain sentiment analysis.

[1]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[2]  Qiang Yang,et al.  Cross-domain sentiment classification via spectral feature alignment , 2010, WWW '10.

[3]  Yanghui Rao,et al.  Contextual Sentiment Topic Model for Adaptive Social Emotion Classification , 2016, IEEE Intelligent Systems.

[4]  Alistair Kennedy,et al.  SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[5]  João Francisco Valiati,et al.  Document-level sentiment classification: An empirical comparison between SVM and ANN , 2013, Expert Syst. Appl..

[6]  Santanu Kumar Rath,et al.  Classification of sentiment reviews using n-gram machine learning approach , 2016, Expert Syst. Appl..

[7]  Ming Zhou,et al.  Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification , 2014, ACL.

[8]  M. Bradley,et al.  Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings , 1999 .

[9]  Varun Grover,et al.  Can online product reviews be more helpful? Examining characteristics of information content by product type , 2015, Decis. Support Syst..

[10]  Eric P. Xing,et al.  Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2014, ACL 2014.

[11]  Danushka Bollegala,et al.  Using Multiple Sources to Construct a Sentiment Sensitive Thesaurus for Cross-Domain Sentiment Classification , 2011, ACL.

[12]  Mohammad Salehan,et al.  Predicting the performance of online consumer reviews: A sentiment mining approach to big data analytics , 2014, Decis. Support Syst..

[13]  Chihli Hung,et al.  Word sense disambiguation based sentiment lexicons for sentiment classification , 2016, Knowl. Based Syst..

[14]  Nirmalie Wiratunga,et al.  Contextual sentiment analysis for social media genres , 2016, Knowl. Based Syst..

[15]  Ryan Shaun Joazeiro de Baker,et al.  Educational Data Mining and Learning Analytics: Applications to Constructionist Research , 2014, Technology, Knowledge and Learning.

[16]  Panos Panagiotopoulos,et al.  Beyond positive or negative: Qualitative sentiment analysis of social media reactions to unexpected stressful events , 2016, Comput. Hum. Behav..

[17]  Philip J. Stone,et al.  Extracting Information. (Book Reviews: The General Inquirer. A Computer Approach to Content Analysis) , 1967 .

[19]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[20]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[21]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[22]  Dekang Lin,et al.  Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1 , 2011 .

[23]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[24]  Harith Alani,et al.  Contextual semantics for sentiment analysis of Twitter , 2016, Inf. Process. Manag..

[25]  Rui Xia,et al.  Feature Ensemble Plus Sample Selection: Domain Adaptation for Sentiment Classification , 2013, IEEE Intelligent Systems.

[26]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[27]  Alton Yeow-Kuan Chua,et al.  Helpfulness of user-generated reviews as a function of review sentiment, product type and information quality , 2016, Comput. Hum. Behav..

[28]  Vibhu O. Mittal,et al.  Comparative Experiments on Sentiment Classification for Online Product Reviews , 2006, AAAI.

[29]  VelcinJulien,et al.  Sentiment analysis on social media for stock movement prediction , 2015 .

[30]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[31]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[32]  Stewart Massie,et al.  Lexicon based feature extraction for emotion text classification , 2017, Pattern Recognit. Lett..

[33]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[34]  Stewart Massie,et al.  Generating a Word-Emotion Lexicon from #Emotional Tweets , 2014, *SEMEVAL.

[35]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[36]  William E. Grieb The general inquirer: A computer approach to content analysis: Philip J. Stone, Dexter C. Dunphy, Marshall S. Smith, Daniel M. Ogilvie, with associates. The MIT Press, Cambridge, Massachusetts, 1966. 651 pp. plus xx , 1968 .

[37]  Julien Velcin,et al.  Sentiment analysis on social media for stock movement prediction , 2015, Expert Syst. Appl..