Twitter sentiment analysis using hybrid Spider Monkey optimization method

The use of social media, over the past few years, has escalated enormously. Social media has formed a platform for the availability of abundant data. Thousands of people express their perceptions through social media. Sentiment Analysis (SA) of such views and perceptions is very substantial to measure public notion on a peculiar/specific subject matter of concern. SA is a remarkable field of data mining concerned with identification and translation of sentiments accessible on social media. Twitter is a microblogging site in which users can post updates (tweets) to friends (followers). This paper proposes a mechanism for extracting the sentiments from the tweets posted on Twitter. Tweets can be classified as positive, neutral or negative. The metaheuristic-based clustering techniques are superior to conventional techniques due to the subjective behaviour of tweets. A hybrid strategy, named as Hybrid Spider Monkey optimization with k-means clustering, is introduced to obtain the optimal cluster-heads of the dataset. The accuracy of the proposed method is determined on two datasets, namely, sender2 and twitter. To analyse the authenticity of the proposed method, a comparative analysis is performed with a few significant Nature-Inspired Algorithms such as Spider-Monkey optimization, Particle-Swarm algorithm, Genetic-Algorithm and Differential Evolution.

[1]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[2]  Annapurna Bhargava,et al.  Optimal placement and sizing of capacitor using Limaçon inspired spider monkey optimization algorithm , 2016, Memetic Comput..

[3]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[4]  W. John Wilbur,et al.  The automatic identification of stop words , 1992, J. Inf. Sci..

[5]  Wu Yang,et al.  Hybrid approach framework for sentiment classification on microblogging , 2017, 2017 Computing Conference.

[6]  Minyi Guo,et al.  Emoticon Smoothed Language Models for Twitter Sentiment Analysis , 2012, AAAI.

[7]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[8]  Huma Parveen,et al.  Sentiment analysis on Twitter Data-set using Naive Bayes algorithm , 2016, 2016 2nd International Conference on Applied and Theoretical Computing and Communication Technology (iCATccT).

[9]  Rincy Jose,et al.  Prediction of election result by enhanced sentiment analysis on twitter data using classifier ensemble Approach , 2016, 2016 International Conference on Data Mining and Advanced Computing (SAPIENCE).

[10]  Yuhui Shi,et al.  Particle swarm optimization: developments, applications and resources , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[11]  Amir Hossein Gandomi,et al.  Cuckoo search algorithm: a metaheuristic approach to solve structural optimization problems , 2011, Engineering with Computers.

[12]  Harish Sharma,et al.  Spider Monkey Optimization algorithm for numerical optimization , 2014, Memetic Computing.

[13]  Alan C. Bovik,et al.  Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[14]  Georgina Cosma,et al.  A novel approach based on an extended cuckoo search algorithm for the classification of tweets which contain Emoticon and Emoji , 2017, 2017 2nd International Conference on Knowledge Engineering and Applications (ICKEA).

[15]  Rupal Bhargava,et al.  Sentiment analysis for mixed script Indic sentences , 2016, 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[16]  R. Juday Optimal realizable filters and the minimum Euclidean distance principle. , 1993, Applied optics.

[17]  Shrikanth S. Narayanan,et al.  A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle , 2012, ACL.

[18]  Eytan Domany,et al.  Resampling Method for Unsupervised Estimation of Cluster Validity , 2001, Neural Computation.

[19]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[20]  Rodney A. Brooks,et al.  The ACRONYM Model-Based Vision System , 1979, IJCAI.

[21]  Nikos A. Vlassis,et al.  The global k-means clustering algorithm , 2003, Pattern Recognit..

[22]  Metin Bilgin,et al.  Sentiment analysis on Twitter data with semi-supervised Doc2Vec , 2017, 2017 International Conference on Computer Science and Engineering (UBMK).

[23]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[24]  Teng-Sheng Moh,et al.  Prediction of Indian election using sentiment analysis on Hindi Twitter , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[25]  Sara Tedmori,et al.  Arabic Sentiment Classification: A Hybrid Approach , 2017, 2017 International Conference on New Trends in Computing Sciences (ICTCS).

[26]  Konstantinos Tserpes,et al.  Comparing Methods for Twitter Sentiment Analysis , 2014, KDIR.

[27]  Irina Rish,et al.  An empirical study of the naive Bayes classifier , 2001 .

[28]  Mehrdad Jalali,et al.  A new sentiment classification method based on hybrid classification in Twitter , 2015, 2015 International Congress on Technology, Communication and Knowledge (ICTCK).

[29]  Estevam R. Hruschka,et al.  Tweet sentiment analysis with classifier ensembles , 2014, Decis. Support Syst..

[30]  P. N. Suganthan,et al.  Differential Evolution Algorithm With Strategy Adaptation for Global Numerical Optimization , 2009, IEEE Transactions on Evolutionary Computation.

[31]  Marko Jurmu,et al.  Detection, classification and visualization of place-triggered geotagged tweets , 2012, UbiComp.

[32]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[33]  Avinash Chandra Pandey,et al.  Twitter sentiment analysis using hybrid cuckoo search method , 2017, Inf. Process. Manag..

[34]  Kodrat Iman Satoto,et al.  Sentiment analysis on Twitter posts: An analysis of positive or negative opinion on GoJek , 2017, 2017 4th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE).

[35]  Clement T. Yu,et al.  Construction of a sentimental word dictionary , 2010, CIKM '10.

[36]  Bayu Distiawan Trisedya,et al.  Stock price prediction using linear regression based on sentiment analysis , 2015, 2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS).