Lexicon-Based Sentiment Analysis for Movie Review Tweets

Sentiment analysis is a computational process to identify and classify subjective information such as positive, negative and neutral from the source material. It is able to extract feeling and emotion from a piece of a sentence. This technology has been widely used to extract valuable information from people’s views on social media. Hence, this project aims to classify movie reviews into positives, negatives and neutral polarity using lexicon-based method which used R as the language and development framework. Twitter data is used as the source material. Firstly, tweets were extracted using RStudio and Twitter API. Then data pre-processing was done by removing all the stop words and noises. Next was the tokenization process, which separates the words and matches the separated words with positive and negative words vocabulary. Finally, the result of the sentiment analysis is produced into positive, negative and neutral polarities. The results were evaluated using standard evaluation metrics that are the precision, recall, F1 score and accuracy. After all, it is found that the basic lexicon-based method is able to classify sentiment quite well with 52% accuracy. Apparently, the accuracy value achieved in our experiment is not impressive enough, but it is worth corresponding to the simplicity and minimal cost of development for sentiment analysis on Twitter data for movies.

[1]  Philip Treleaven,et al.  Twitter Sentiment Analysis , 2015, ArXiv.

[2]  Yaxin Bi,et al.  Improved lexicon-based sentiment analysis for social media analytics , 2015, Security Informatics.

[3]  Anju Bala,et al.  Analyzing Twitter sentiments through big data , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[4]  Muhammad Zubair Asghar,et al.  Lexicon-Based Sentiment Analysis in the Social Web , 2014 .

[5]  Dibakar Ray Lexicon Based Sentiment Analysis of Twitter Data , 2017 .

[6]  Santanu Kumar Rath,et al.  Classification of Sentimental Reviews Using Machine Learning Techniques , 2015 .

[7]  Ebru Akcapinar Sezer,et al.  Assessment of Feature Selection Metrics for Sentiment Analyses: Turkish Movie Reviews , 2014 .

[8]  J. S. Teja,et al.  Sentiment Analysis of Movie Reviews Using Machine Learning Algorithms-A Survey , 2018 .

[9]  Zainab Abu Bakar,et al.  Interface features of semantic web search engine , 2013, 2013 IEEE Conference on e-Learning, e-Management and e-Services.

[10]  Giovanni Semeraro,et al.  A Comparison of Lexicon-based Approaches for Sentiment Analysis of Microblog Posts , 2014, DART@AI*IA.

[11]  Saleem Alhabash,et al.  A Tale of Four Platforms: Motivations and Uses of Facebook, Twitter, Instagram, and Snapchat Among College Students? , 2017 .

[12]  Girish K. Patnaik,et al.  Analyzing Sentiment of Movie Review Data using Naive Bayes Neural Classifier , 2014 .

[13]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[14]  Huy Nguyen,et al.  Twitter Sentiment Analysis Using Machine Learning Techniques , 2020, ICCSAMA.

[15]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[16]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[17]  Linh Vu,et al.  A lexicon-based method for Sentiment Analysis using social network data , 2017 .

[18]  T. D. V. Kiran,et al.  Twitter sentiment analysis of game reviews using machine learning techniques , 2016 .

[19]  Burairah Hussin,et al.  Opinion Mining of Movie Review using Hybrid Method of Support Vector Machine and Particle Swarm Optimization , 2013 .

[20]  Haoran Xie,et al.  Sentiment Classification Using Negative and Intensive Sentiment Supplement Information , 2019, Data Science and Engineering.

[21]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[22]  Vineet Yadav,et al.  Serendio: Simple and Practical lexicon based approach to Sentiment Analysis , 2013, *SEMEVAL.

[23]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[24]  Jamila El Alami,et al.  Impact of corpus domain for sentiment classification: An evaluation study using supervised machine learning techniques , 2017 .

[25]  Doaa Mohey El Din Mohamed Hussein,et al.  A survey on sentiment analysis challenges , 2016, Journal of King Saud University - Engineering Sciences.