Real-Time Sentiment Analysis of 2019 Election Tweets using Word2vec and Random Forest Model

Sentiment analysis of social media data consists of attitudes, assessments, and emotions which can be considered a way human think. Understanding and classifying the large collection of documents into positive and negative aspects are a very difficult task. Social networks such as Twitter, Facebook, and Instagram provide a platform in order to gather information about people’s sentiments and opinions. Considering the fact that people spend hours daily on social media and share their opinion on various different topics helps us analyze sentiments better. More and more companies are using social media tools to provide various services and interact with customers. Sentiment Analysis (SA) classifies the polarity of given tweets to positive and negative tweets in order to understand the sentiments of the public. This paper aims to perform sentiment analysis of real-time 2019 election twitter data using the feature selection model word2vec and the machine learning algorithm random forest for sentiment classification. Word2vec with Random Forest improves the accuracy of sentiment analysis significantly compared to traditional methods such as BOW and TF-IDF. Word2vec improves the quality of features by considering contextual semantics of words in a text hence improving the accuracy of machine learning and sentiment analysis.

[1]  Santoshi Kumari,et al.  Real time analysis of top trending event on Twitter: Lexicon based approach , 2017, 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT).

[2]  Kang Liu,et al.  Book Review: Sentiment Analysis: Mining Opinions, Sentiments, and Emotions by Bing Liu , 2015, CL.

[3]  ShamsfardMehrnoush,et al.  Using Linked Data for polarity classification of patients' experiences , 2015 .

[4]  Cláudio de Souza Baptista,et al.  A Comparison of SVM Versus Naive-Bayes Techniques for Sentiment Analysis in Tweets: A Case Study with the 2013 FIFA Confederations Cup , 2014, WebMedia.

[5]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[6]  Geetika Gautam,et al.  Sentiment analysis of twitter data using machine learning approaches and semantic analysis , 2014, 2014 Seventh International Conference on Contemporary Computing (IC3).

[7]  Ji Zhang,et al.  Coupling topic modelling in opinion mining for social media analysis , 2017, WI.

[8]  Marina Sokolova,et al.  Word2Vec and Doc2Vec in Unsupervised Sentiment Analysis of Clinical Discharge Summaries , 2018, ArXiv.

[9]  C Narendra Babu,et al.  Real time analysis of social media data to understand people emotions towards national parties , 2017, 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT).

[10]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[11]  Roman Kern,et al.  Polarity Classification for Target Phrases in Tweets: A Word2Vec Approach , 2016, ESWC.

[12]  Shrikanth S. Narayanan,et al.  A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle , 2012, ACL.

[13]  Mehrnoush Shamsfard,et al.  Using Linked Data for polarity classification of patients' experiences , 2015, J. Biomed. Informatics.

[14]  R. Rajasree,et al.  Sentiment analysis in twitter using machine learning techniques , 2013, 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT).