Multi-Class Sentiment Analysis in Twitter: What if Classification is Not the Answer

With the rapid growth of online social media content, and the impact these have made on people’s behavior, many researchers have been interested in studying these media platforms. A major part of their work focused on sentiment analysis and opinion mining. These refer to the automatic identification of opinions of people toward specific topics by analyzing their posts and publications. Multi-class sentiment analysis, in particular, addresses the identification of the exact sentiment conveyed by the user rather than the overall sentiment polarity of his text message or post. That being the case, we introduce a task different from the conventional multi-class classification, which we run on a data set collected from Twitter. We refer to this task as “quantification.” By the term “quantification,” we mean the identification of all the existing sentiments within an online post (i.e., tweet) instead of attributing a single sentiment label to it. For this sake, we propose an approach that automatically attributes different scores to each sentiment in a tweet, and selects the sentiments with the highest scores which we judge as conveyed in the text. To reach this target, we added to our previously introduced tool SENTA the necessary components to run and perform such a task. Throughout this work, we present the added components; we study the feasibility of quantification, and propose an approach to perform it on a data set made of tweets for 11 different sentiment classes. The data set was manually labeled and the results of the automatic analysis were checked against the human annotation. Our experiments show the feasibility of this task and reach an F1 score equal to 45.9%.

[1]  Hsin-Hsi Chen,et al.  Emotion Classification of Online News Articles from the Reader's Perspective , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[2]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[3]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[4]  Ruth Page,et al.  The linguistics of self-branding and micro-celebrity in Twitter: The role of hashtags , 2012 .

[5]  Multiclass Sentiment Prediction using Yelp Business Reviews , 2015 .

[6]  Preslav Nakov,et al.  Semantic Sentiment Analysis of Twitter Data , 2017, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[7]  Andrea Esuli,et al.  Optimizing Text Quantifiers for Multivariate Loss Functions , 2015, TKDD.

[8]  Harith Alani,et al.  Semantic Sentiment Analysis of Twitter , 2012, SEMWEB.

[9]  Ali Selamat,et al.  Hybrid sentiment classification on twitter aspect-based sentiment analysis , 2018, Applied Intelligence.

[10]  Maria Virvou,et al.  Sentiment analysis of Facebook statuses using Naive Bayes classifier for language learning , 2013, IISA 2013.

[11]  Ari Rappoport,et al.  Semi-Supervised Recognition of Sarcasm in Twitter and Amazon , 2010, CoNLL.

[12]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[13]  Boi Faltings,et al.  A :) Is Worth a Thousand Words: How People Attach Sentiment to Emoticons and Words in Tweets , 2013, 2013 International Conference on Social Computing.

[14]  Ee-Peng Lim,et al.  Who is Retweeting the Tweeters? Modeling, Originating, and Promoting Behaviors in the Twitter Network , 2012, TMIS.

[15]  Akshi Kumar,et al.  Sentiment Analysis on Twitter , 2012 .

[16]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[17]  Juan José del Coz,et al.  Quantification-oriented learning based on reliable classifiers , 2015, Pattern Recognit..

[18]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[19]  Masrah Azrifah Azmi Murad,et al.  Term weighting scheme effect in sentiment analysis of online movie reviews , 2018 .

[20]  Luis Alfonso Ureña López,et al.  Sentiment analysis in Twitter , 2012, Natural Language Engineering.

[21]  P. Radha Krishna,et al.  Analyzing Internet Slang for Sentiment Mining , 2010, 2010 Second Vaagdevi International Conference on Information Technology for Real World Problems.

[22]  Jun Xu,et al.  Emotion prediction of news articles from reader's perspective based on multi-label classification , 2012, 2012 International Conference on Machine Learning and Cybernetics.

[23]  Sandeep Joshi,et al.  Various Approaches to Aspect-based Sentiment Analysis , 2018, ArXiv.

[24]  Chung-Hsien Wu,et al.  Emoticon recommendation in microblog using affective trajectory model , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.

[25]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[26]  Nicholas Griffin,et al.  The tweeter matters: Factors that affect false memory from Twitter , 2017, Comput. Hum. Behav..

[27]  José Hernández-Orallo,et al.  Quantification via Probability Estimators , 2010, 2010 IEEE International Conference on Data Mining.

[28]  Giovanni Montana,et al.  Community detection in multiplex networks using Locally Adaptive Random walks , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[29]  Sami Faïz,et al.  Prominent users detection during specific events by learning On- and Off-topic features of user activities , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[30]  George Forman,et al.  Quantifying counts and costs via classification , 2008, Data Mining and Knowledge Discovery.

[31]  L. Ranathunga,et al.  Sentiment analysis: Measuring sentiment strength of call centre conversations , 2015, 2015 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT).

[32]  Wei Gao,et al.  Tweet sentiment: From classification to quantification , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[33]  Fabrício Benevenuto,et al.  You followed my bot! Transforming robots into influential users in Twitter , 2013, First Monday.

[34]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[35]  Michael Trusov,et al.  Determining Influential Users in Internet Social Networks , 2010 .

[36]  M. Gentzkow,et al.  Social Media and Fake News in the 2016 Election , 2017 .

[37]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[38]  Tomoaki Ohtsuki,et al.  A Pattern-Based Approach for Multi-Class Sentiment Analysis in Twitter , 2017, IEEE Access.

[39]  E. Kassens-Noor Twitter as a teaching practice to enhance active and informal learning in higher education: The case of sustainable tweets , 2012 .

[40]  Tomoaki Ohtsuki,et al.  Sentiment Analysis in Twitter for Multiple Topics : How to Detect the Polarity of Tweets Regardless of Their Topic (知的環境とセンサネットワーク) , 2015, AmI 2015.

[41]  Tomoaki Ohtsuki,et al.  Sarcasm Detection in Twitter: "All Your Products Are Incredibly Amazing!!!" - Are They Really? , 2014, 2015 IEEE Global Communications Conference (GLOBECOM).

[42]  Hsin-Hsi Chen,et al.  What emotions do news articles trigger in their readers? , 2007, SIGIR.

[43]  Kurt Junshean Espinosa,et al.  Optimizing N-gram based text feature selection in sentiment analysis for commercial products in Twitter through polarity lexicons , 2014, IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications.

[44]  Mahinder Pal Singh Bhatia,et al.  Quantifying modified opinion strength: A fuzzy inference system for Sentiment Analysis , 2013, 2013 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[45]  K. Ghag,et al.  Comparative analysis of the techniques for Sentiment Analysis , 2013, 2013 International Conference on Advances in Technology and Engineering (ICATE).

[46]  Tomoaki Ohtsuki,et al.  Sentiment analysis: From binary to multi-class classification: A pattern-based approach for multi-class sentiment analysis in Twitter , 2016, 2016 IEEE International Conference on Communications (ICC).

[47]  Amandeep Dhir,et al.  Tweeters on Campus: Twitter a Learning Tool in Classroom? , 2013, J. Univers. Comput. Sci..

[48]  J. Fernando Sánchez-Rada,et al.  Enhancing deep learning sentiment analysis with ensemble techniques in social applications , 2020 .

[49]  Bartosz Krawczyk,et al.  Sentiment Classification from Multi-class Imbalanced Twitter Data Using Binarization , 2017, HAIS.

[50]  Mika V. Mäntylä,et al.  The evolution of sentiment analysis - A review of research topics, venues, and top cited papers , 2016, Comput. Sci. Rev..