Combining Semantic and Prior Polarity for Boosting Twitter Sentiment Analysis

Twitter sentiment analysis offers organizations an ability to monitor public feeling towards the products and events related to them in real time. Most existing researches for Twitter sentiment analysis are focused on the extraction of sentiment feature of lexical and syntactic feature that are expressed explicitly through words, emoticons, exclamation marks etc, although sentiment implicitly expressed via latent contextual semantic relations, dependencies among words in tweets are ignored. In this paper, we introduce distributed representation of sentence that can capture co-occurrence statistics and contextual semantic relations of words in tweets, and represent a tweet via a fixed size feature vector. We used the feature vector as sentence semantic feature for the tweet. We combined semantic feature, prior polarity score feature and n-grams feature as sentiment feature set of tweets, and incorporated the feature set into Support Vector Machines(SVM) model training and predicting sentiment classification label. We used six Twitter datasets in our evaluation and compared the performance against n-grams model baseline. Results show the superior performance of our method in accuracy sentiment classification.

[1]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[2]  Mike Thelwall,et al.  Twitter, MySpace, Digg: Unsupervised Sentiment Analysis in Social Media , 2012, TIST.

[3]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[4]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[5]  Ming Zhou,et al.  Coooolll: A Deep Learning System for Twitter Sentiment Classification , 2014, *SEMEVAL.

[6]  Luis Alfonso Ureña López,et al.  A knowledge‐based approach for polarity classification in Twitter , 2014, J. Assoc. Inf. Sci. Technol..

[7]  Finn Årup Nielsen,et al.  A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs , 2011, #MSM.

[8]  S. Albayrak,et al.  Language-Independent Twitter Sentiment Analysis , 2012 .

[9]  Harith Alani,et al.  Semantic Sentiment Analysis of Twitter , 2012, SEMWEB.

[10]  Vasudeva Varma,et al.  Mining Sentiments from Tweets , 2012, WASSA@ACL.

[11]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[12]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[13]  Nick Bassiliades,et al.  Ontology-based sentiment analysis of twitter posts , 2013, Expert Syst. Appl..

[14]  Huan Liu,et al.  Unsupervised sentiment analysis with emotional signals , 2013, WWW.

[15]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[16]  Harith Alani,et al.  Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new dataset, the STS-Gold , 2013, ESSEM@AI*IA.

[17]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[18]  Saif Mohammad,et al.  Sentiment Analysis of Short Informal Texts , 2014, J. Artif. Intell. Res..

[19]  Nancy Ide,et al.  Distant Supervision for Emotion Classification with Discrete Binary Values , 2013, CICLing.

[20]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[21]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.