Application of machine learning techniques to sentiment analysis

Today, we live in a ‘data age’. Due to rapid increase in the amount of user-generated data on social media platforms like Twitter, several opportunities and new open doors have been prompted for organizations that endeavour hard to keep a track on customer reviews and opinions about their products. Twitter is a huge fast emergent micro-blogging social networking platform for users to express their views about politics, products sports etc. These views are useful for businesses, government and individuals. Hence, tweets can be used as a valuable source for mining public's opinion. Sentiment analysis is a process of automatically identifying whether a user-generated text expresses positive, negative or neutral opinion about an entity (i.e. product, people, topic, event etc). The objective of this paper is to give step-by-step detail about the process of sentiment analysis on twitter data using machine learning. This paper also provides details of proposed approach for sentiment analysis. This work proposes a Text analysis framework for twitter data using Apache spark and hence is more flexible, fast and scalable. Naïve Bayes and Decision trees machine learning algorithms are used for sentiment analysis in the proposed framework.

[1]  Lei Zhang,et al.  Combining lexicon-based and learning-based methods for twitter sentiment analysis , 2011 .

[2]  Patrizia Grifoni,et al.  Approaches, Tools and Applications for Sentiment Analysis Implementation , 2015 .

[3]  R. Rajasree,et al.  Sentiment analysis in twitter using machine learning techniques , 2013, 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT).

[4]  Rajeshwari M. Goudar,et al.  Notice of Violation of IEEE Publication PrinciplesA Speedy Data Uploading Approach for Twitter Trend and Sentiment Analysis Using HADOOP , 2015, 2015 International Conference on Computing Communication Control and Automation.

[5]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[6]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[7]  Víctor Robles,et al.  Feature selection for multi-label naive Bayes classification , 2009, Inf. Sci..

[8]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[9]  Diego Klabjan,et al.  Semantic Properties of Customer Sentiment in Tweets , 2014, 2014 28th International Conference on Advanced Information Networking and Applications Workshops.

[10]  Hang-Bong Kang,et al.  Text sentiment classification for SNS-based marketing using domain sentiment dictionary , 2012, 2012 IEEE International Conference on Consumer Electronics (ICCE).

[11]  Ambuj Kumar Agarwal,et al.  Real-time Sentiment Analysis of Big Data Applications Using Twitter Data with Hadoop Framework , 2018 .

[12]  Wiltrud Kessler Turney: Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classication of Reviews , 2012 .

[13]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[14]  Ram Mohana Reddy Guddeti,et al.  NLP based sentiment analysis on Twitter data using ensemble classifiers , 2015, 2015 3rd International Conference on Signal Processing, Communication and Networking (ICSCN).

[15]  Ram Mohana Reddy Guddeti,et al.  Semantic sentiment analysis using context specific grammar , 2015, International Conference on Computing, Communication & Automation.